Section 13.2 Export Package Definition
WIB™ Review supports various types of exports and formats. The Export Package Definition includes a Name, Description, Delimiter, Text Qualifier and defines what is exported and the format for some components of the package i.e., Image formats such as JPEG, PDF, Single-Page, Multi-page, etc.Name the export package in a manner that is standardized and allows for sorting and/or quickly identifying the content. For example, including the date in the Name will allow for sorting the Export Packages in date order. Keep the Name short and leverage the description for more detailed information about the export.
Date Created is a default metadata field for Export Packages and is not required as part of the Package Name to sort by data order. Date is used for demonstrative purposes only.
Section 13.2.3 Export File Format
The metadata can be exported to a CSV (Comma Separated Values), Excel Spreadsheet, or JSON (JavaScript Object Notation. If you need a format that is not available please contact support@radixdata.com.
Section 13.2.3.1 CSV File Format Settings
Section 13.2.3.1.1 Column Delimiter
The delimiter in the metadata file is the character (comma or otherwise) that separates the data in your file into distinct fields.
Section 13.2.3.1.2 Text Qualifier
A text qualifier is a character used to distinguish the point at which the contents of a text field should begin and end. Say you need to import a text file that is comma delimited (separated by commas), and one of fields is a description that could potentially contain a comma. You can use a text qualifier to show that the comma is meant to be included within the text field-- not to be used as a separator.
Section 13.2.4 Retention Days
The length of time an export package should be kept. The retention period should allow for the quality check and if required, the import of the package into another platform. A notification can be configured to notify specific individuals of the Export Packages about to expire.
Section 13.2.5 Retention Notification
Section 13.2.6 Include Images
The metadata for images can be exported without the images. This feature is useful when the images for a given box have not changed but the metadata or attributes may have changed through user review or as a retroactive update to the Taxonomy.
Section 13.2.7 Attribute Classification
The system has two groups of attributes, extracted and user defined. Extracted attributes are those that are automatically recognized by WIB Review from a Phrase List or a Regular Expression and those that a user accepts as correct from the Extracted value or are manually entered into the Review Page. You can select which group(s) of attributes are included in the Export Configuration. To include both groups select Both to include only one of the two groups select the appropriate radial button.
Section 13.2.8 Attribute Inclusion
Attribute inclusion determines which attribute classification is exported. There are four (4) options available, and each option is detailed below.
Section 13.2.8.1 Both
Selecting Both will export all attributes classes and system attributes.
Section 13.2.8.2 User
Selecting User will export the User class attributes and system attributes.
Section 13.2.8.3 Extracted
Selecting Extracted will export the Extracted class attributes and system attributes.
Section 13.2.8.4 Logical
Selecting Logical will logically determine which class attributes to export with the system attributes. The following logic determines which attribute class is exported:
IF a user defined attribute is not null, then the user attribute is exported.
IF a user defined attribute is null and an extracted attribute is defined then the extracted attribute is exported.
IF both attribute classes contain a value, the user defined attribute takes precedence and will be exported.
IF both attribute classes are null the user class attribute is exported as a null value.
Section 13.2.9 Include OCR
The images and OCR are separate components unless the images are exported as a PDF. You can choose to use Radix Data OCR results or run the OCR through an engine in another application.
Section 13.2.10 Include metadata
If WIB™ us used as a processor of images and no data is extracted and the capture metadata is not needed, the images can be exported without the metadata. In the concept of export, metadata, or system generated attributes, extracted data, and user attributes are all defined as metadata.
Section 13.2.11 Attribute Types
The system has two groups of attributes, extracted and user defined. Extracted attributes are those that are automatically recognized by WIB Review from a Phrase List or a Regular Expression and those that a user accepts as correct from the Extracted value or are manually entered into the Review Page. You can select which group(s) of attributes are included in the Export Configuration. To include both groups select Both to include only one of the two groups select the appropriate radial button.
Section 13.2.12 Grayscale images
Images are captured in color. However, there may arise the need to export images in grayscale. This option converts the color images to grayscale during the export process. Grayscale images are in JPEG image format.
Section 13.2.13 PDF
Portable Document Format (PDF) is an image format. This option creates a PDF which combines the OCR and image into a combined file that is searchable. PDFs can be exported as single-page or multi-page.
Section 13.2.13.1 Single-Page
One PDF file is exported for each image in a container.
Section 13.2.13.2 Multi-Page
All images for a container are combined into a single PDF file.
Section 13.2.13.3 DPI
DPI is a printer resolution measured in dots per inch (dpi). The higher the dpi, the finer the printed output you’ll get. Most inkjet printers have a resolution of approximately 720 to 2880 dpi. Printer resolution is different from, but related to, image resolution.
The DPI setting can be set when creating a PDF file. If exporting to other image formats, DPI cannot be set.
Section 13.2.14 Column Mapper
The column mapper allows you to select which attributes are in the export and how those attributes are named. You can rename the attributes to match another system name for seamless imports. To exclude an attribute, make sure the selector is not checked. To rename the attribute enter the new name in the Exported Header field next to the Attribute Name.
Section 13.2.14.1 Include System Attributes
System attributes are automatically included in the export. To change the header name of the order the system attributes appear in the export, select include System Attributes.
Section 13.2.14.2 Map a single attribute to two (2) fields in the Export
Add the attribute to the Column Mapper a second time and change the Exported Header name.
Section 13.2.15 Column Order
You can set the order the attributes appear in the export. To change the order, select an attribute and drag it to the correct position. Do this for each attribute until the Column Order appears in the order you want them to appear in the export.
Section 13.2.16 XLM Character Encoding
You can replace special characters in the export using the XML Character Encoding by adding the Character and the replacement Representation. These characters will be replaced in the file containing the metadata. Some examples include but are not limited to the following:
Character Name
Character
Encoded Representation
Ampersand
&
&
Less than (Left angle bracket)
<
<
Greater than (Right angle bracket)
>
>
Double quotation
“
"
Single quotation (apostrophe)
‘
'
Section 13.2.17 Query
JSON view of the query that generated the results in the search or the JSON for the active box record in Review.
Section 13.2.18 Export History
The export history includes the following information: Box Identifier, Image Filename, Export Package, Configuration, username, Boolean export status for the OCR Text, Image, and metadata, the capture date of the image, the date of export, and the expiry date for the export package. Please note that the expiry date only applies to the export package and not the images in storage.
Select Include Export History to include the history with the export or select only the ‘Include Export History’ and leave all other export options off to export ONLY the history. This configuration can then be scheduled to export the history.