Section 13.2 Export Package Definition

Section 13.2 Export Package Definition

Section 13.2   Export Package Definition

WIB™ Review supports various types of exports and formats. The Export Package Definition includes a Name, Description, Delimiter, Text Qualifier and defines what is exported and the format for some components of the package i.e., Image formats such as JPEG, PDF, Single-Page, Multi-page, etc.

Name the export package in a manner that is standardized and allows for sorting and/or quickly identifying the content. For example, including the date in the Name will allow for sorting the Export Packages in date order. Keep the Name short and leverage the description for more detailed information about the export.

Date Created is a default metadata field for Export Packages and is not required as part of the Package Name to sort by data order. Date is used for demonstrative purposes only.

Section 13.2.3                   Export File Format

The metadata can be exported to a CSV (Comma Separated Values), Excel Spreadsheet, or JSON (JavaScript Object Notation.  If you need a format that is not available please contact support@radixdata.com.

Section 13.2.3.1              CSV File Format Settings

Section 13.2.3.1.1                  Column Delimiter

The delimiter in the metadata file is the character (comma or otherwise) that separates the data in your file into distinct fields.

Section 13.2.3.1.2                 Text Qualifier

A text qualifier is a character used to distinguish the point at which the contents of a text field should begin and end. Say you need to import a text file that is comma delimited (separated by commas), and one of fields is a description that could potentially contain a comma. You can use a text qualifier to show that the comma is meant to be included within the text field-- not to be used as a separator.

Section 13.2.4         Retention Days

The length of time an export package should be kept. The retention period should allow for the quality check and if required, the import of the package into another platform. A notification can be configured to notify specific individuals of the Export Packages about to expire.

Section 13.2.6         Include Images

The metadata for images can be exported without the images. This feature is useful when the images for a given box have not changed but the metadata or attributes may have changed through user review or as a retroactive update to the Taxonomy.

Section 13.2.7                 Attribute Classification

The system has two groups of attributes, extracted and user defined. Extracted attributes are those that are automatically recognized by WIB Review from a Phrase List or a Regular Expression and those that a user accepts as correct from the Extracted value or are manually entered into the Review Page. You can select which group(s) of attributes are included in the Export Configuration. To include both groups select Both to include only one of the two groups select the appropriate radial button.

Section 13.2.8                  Attribute Inclusion

Attribute inclusion determines which attribute classification is exported. There are four (4) options available, and each option is detailed below.

Section 13.2.8.1             Both

Selecting Both will export all attributes classes and system attributes.

Section 13.2.8.2            User

Selecting User will export the User class attributes and system attributes.

Section 13.2.8.3            Extracted

Selecting Extracted will export the Extracted class attributes and system attributes.

Section 13.2.8.4           Logical

Selecting Logical will logically determine which class attributes to export with the system attributes. The following logic determines which attribute class is exported:

IF a user defined attribute is not null, then the user attribute is exported.

IF a user defined attribute is null and an extracted attribute is defined then the extracted attribute is exported.

IF both attribute classes contain a value, the user defined attribute takes precedence and will be exported.

IF both attribute classes are null the user class attribute is exported as a null value.

Section 13.2.9                  Include OCR

The images and OCR are separate components unless the images are exported as a PDF. You can choose to use Radix Data OCR results or run the OCR through an engine in another application.

Section 13.2.10                  Include metadata

If WIB™ us used as a processor of images and no data is extracted and the capture metadata is not needed, the images can be exported without the metadata. In the concept of export, metadata, or system generated attributes, extracted data, and user attributes are all defined as metadata.

Section 13.2.11                  Attribute Types

The system has two groups of attributes, extracted and user defined. Extracted attributes are those that are automatically recognized by WIB Review from a Phrase List or a Regular Expression and those that a user accepts as correct from the Extracted value or are manually entered into the Review Page. You can select which group(s) of attributes are included in the Export Configuration. To include both groups select Both to include only one of the two groups select the appropriate radial button.

Section 13.2.12                  Grayscale images

Images are captured in color. However, there may arise the need to export images in grayscale. This option converts the color images to grayscale during the export process. Grayscale images are in JPEG image format.

Section 13.2.13                  PDF

Portable Document Format (PDF) is an image format. This option creates a PDF which combines the OCR and image into a combined file that is searchable.  PDFs can be exported as single-page or multi-page.

Section 13.2.13.1             Single-Page

One PDF file is exported for each image in a container.

Section 13.2.13.2            Multi-Page

All images for a container are combined into a single PDF file.

Section 13.2.13.3            DPI

DPI is a printer resolution measured in dots per inch (dpi). The higher the dpi, the finer the printed output you’ll get. Most inkjet printers have a resolution of approximately 720 to 2880 dpi. Printer resolution is different from, but related to, image resolution.

The DPI setting can be set when creating a PDF file. If exporting to other image formats, DPI cannot be set.

Section 13.2.14              Column Mapper

The column mapper allows you to select which attributes are in the export and how those attributes are named. You can rename the attributes to match another system name for seamless imports. To exclude an attribute, make sure the selector is not checked. To rename the attribute enter the new name in the Exported Header field next to the Attribute Name.

Section 13.2.14.1          Include System Attributes

System attributes are automatically included in the export. To change the header name of the order the system attributes appear in the export, select include System Attributes.

Section 13.2.14.2         Map a single attribute to two (2) fields in the Export

Add the attribute to the Column Mapper a second time and change the Exported Header name.

Section 13.2.15                Column Order

You can set the order the attributes appear in the export. To change the order, select an attribute and drag it to the correct position. Do this for each attribute until the Column Order appears in the order you want them to appear in the export.       

Section 13.2.16                   XLM Character Encoding

You can replace special characters in the export using the XML Character Encoding by adding the Character and the replacement Representation. These characters will be replaced in the file containing the metadata. Some examples include but are not limited to the following:

Character Name

Character

Encoded Representation

Ampersand

&

&

Less than (Left angle bracket)

<

Greater than (Right angle bracket)

>

Double quotation

"

Single quotation (apostrophe)

'

Section 13.2.17      Query

JSON view of the query that generated the results in the search or the JSON for the active box record in Review. 

Section 13.2.18                   Export History

The export history includes the following information: Box Identifier, Image Filename, Export Package, Configuration, username, Boolean export status for the OCR Text, Image, and metadata, the capture date of the image, the date of export, and the expiry date for the export package. Please note that the expiry date only applies to the export package and not the images in storage.

Select Include Export History to include the history with the export or select only the ‘Include Export History’ and leave all other export options off to export ONLY the history. This configuration can then be scheduled to export the history.


    • Related Articles

    • Section 13.6 Export History

      Section 13.6 Export History The export history lists the images for all the exports. The following system attributes are listed for each image; Box Identifier, Image Filename, Export Package, Configuration, Username, Exported Text (Boolean), Exported ...
    • Section 13.4 Download an Export Package

      Section 13.4 Download an Export Package Once an Export Package is created it can be downloaded from the Exports page. Each export is listed on the grid. Selecting an export will launch the Export Package Details page where all the information from ...
    • Section 13.1 Creating an Export Package

      Section 13.1 Creating an Export Package Users can create export packages the Search and Review pages. If a user does not perform a search to cull the records all records will be exported. Section 13.1.1 Export Package from Search Perform a search and ...
    • Section 13 Export

      Section 13 Export WIB™ Review can be used as a system to process and store records or it can be a transitory system where parts of the system are used in a series of standard operating procedures that results in a connection to another repository ...
    • Section 13.5 Set Export Destination

      Section 13.5 Set Export Destination Export Destinations allows for pushing export packages to AWS S3 buckets, Azure Storage Blobs, and/or an FTP Site. The automation/scheduling of exports requires a destination definition. Each destination type (AWS, ...