- Getting Started
- Framework Components
- Document Understanding in AI Center
- Pipelines
- ML Packages
- Data Manager
- OCR Services
- Licensing
- References
Document Understanding User Guide
Validation Station
The Validation Station is a user interface that can be used to validate and correct data extracted from documents using the Document Understanding framework.
Validation Station can be used both as a stand-alone tool and integrated with Orchestrator/Automation Cloud Orchestrator.
Also, it can be used either as an attended activity with the Present Validation Station or as a type of action in Action Center with the Create Document Validation Action, Wait for Document Validation Action and Resume, as part of Orchestrator Tasks.
Certain inputs need to be provided for the Validation Station to function. Here is a list with the minimum required inputs in order to configure the Validation Station as an attended activity:
- Provide a Taxonomy needed for processing the document. It can be created directly in the workflow by using the Taxonomy Manager wizard, and it can be loaded using the Load Taxonomy activity. It should be provided as a
DocumentTaxonomy
variable. - Provide the Document Path for the file you want to validate.
- Provide a Document Object Model for the file, which can be obtained as an output of the Digitize Document activity.
- Provide the Document Text of the file, which can be obtained as an output of the same Digitize Document activity.
-
Provide the Automatic Extraction Results which represents the data automatically extracted by the robot, that now needs human validation. This is the output of the Data Extraction Scope activity.
Note: If no variable is provided in the AutomaticExtractionResults property field, or if no classification is published in the provided variable, the Validation Station opens in full manual processing mode.
You can use the Present Validation Station activity anywhere in your workflows.
An attended activity is functional only if the file that needs to be processed is available locally on the machine that executes the process.
At run-time, a window opens and displays the Validation Station allowing any interaction from the user side.
Increase your productivity by creating an orchestration process that adds document validation actions in Action Center, in both on-premises Orchestrator or its Automation Cloud Orchestrator counterpart. This action reduces your need for storing the documents locally, having a robot installed on each human's operated machine, or having the robot wait for human users to finish validation.
This approach requires you to have your robot connected to your Orchestrator and to build a project of type Orchestration Process in UiPath Studio. You can publish your project to Orchestrator once it is ready. Here you can find more details about this process.
When running a project with UiPath Orchestrator, you need to make use of the Create Document Validation Action and Wait for Document Validation Action and Resume activities because this is how you are creating the connection between the unattended robots executing the automatic part and the Knowledge Workers that can pick up actions from Action Center (on-premises or cloud-based).
In this scenario, the Validation Station UI is displayed directly in UiPath Orchestrator's Action Center but has the same functionalities as the one run as an attended activity.
The Validation Station is a powerful tool that enables users to review and correct automatically extracted data from files, or manually process files for data extraction.
The Validation Station, once opened, presents all extracted information along with the file being processed:
The fields that are visible in the Validation Station are the ones defined in the Taxonomy used in your workflow.
The right area of the Validation Station contains an interactive version of the original document, in which text or document sections can be selected, and words can be clicked based on the output of the digitization process.
This area also contains options for zooming in and out, selecting and rotating pages, searching through the document, or switching to text view.
Here are the options available in the right part of the Validation Station screen:
Option |
Description |
---|---|
|
Toggles between the text view and image view of the document. |
|
Zooms in on the document. |
|
Resets the zoom level on the document. This option is enabled only if the document was previously zoomed in or out. |
|
Zooms out on the document. |
|
Rotates the current page clockwise. |
|
Initiates a search between results in the document used by the Validation Station. |
|
Displays all the available keyboard shortcuts supported by the Validation Station. |
Here is an example of how to use the Toggle, Zoom In, Zoom Out and Reset Zoom options:
To zoom in or out, you can also** use the CTRL+scroll mouse wheel combination**: CTRL+scroll up to view a specific section of the document; CTRL+scroll down to view a larger section of the document.
The below example illustrates the Rotate option:
The next example explains how to use the Search option:
There are many keyboard shortcuts that can be used to optimize the human interaction with the Validation Station. We encourage you to use them as much as possible. These can be reviewed in the Keyboard Shortcuts popup.
Shortcut |
Description |
---|---|
n |
Moves to the next field |
p |
Moves to the previous field |
f v |
Marks a value as validated |
f c |
Changes the extracted value |
f z |
Reverts to the previous value |
f a |
Adds an additional value |
f s |
Toggles between suggestions |
ESC |
Exits edit mode (for Fields and Tables) Collapses the derived parts (for Fields) Deselects a line or Exits table selection mode (for Table Selection) |
DEL |
Removes the selected value (for Fields) Removes the selected line (for Table Selection) |
d + |
Zooms in |
d - |
Zooms out |
d 0 |
Resets zoom |
d r |
Rotates the page clockwise |
d t |
Toggles the text mode |
/ |
Initiates a search |
→ |
Moves to the right cell |
← |
Moves to the left cell |
↑ |
Moves to the top cell |
↓ |
Moves to the bottom cell |
t v |
Marks a cell as validated |
t c |
Changes the extracted cell |
t z |
Reverts to the previous cell value |
t DEL |
Removes the selected cell |
s ← |
Moves the selected line to the left |
s → |
Moves the selected line to the right |
s ↑ |
Moves the selected line up |
s ↓ |
Moves the selected line down |
s d |
Duplicates the selected line |
s v |
Selects vertical lines |
s f |
Selects horizontal lines |
s a |
Auto-detects lines by mouse movement |
s t |
Hand tool selection - moves and deletes lines |
? |
Opens the keyboard shortcuts list |
! |
Reports the document as an exception |
CTRL Enter |
Saves all data |
CTRL DEL |
Discards all current changes |
1 2 3 4 5 6 7 8 9 q w e r y u i o a g h j k l b m @ # $ % ^ & ***** ( ) [ ] { ** |
Assigns values to a field Selects the corresponding field |
Visualize the available keyboard shortcuts:
The left area displays the document type you have selected for the current validation and enables you to select the state of each element and link it to its corresponding word or area in the document.
You can use the field shortcuts to assign values to a field or to toggle between fields. Once a value is assigned to a field, it is highlighted in accordance with the color of the selected field.
The Document Type field is a special field that you can act upon:
- If the extraction results contain a document type, and that document type is correct, then no action is required.
- If the extraction results contain a document type, and that document type is incorrect, then you have to select the correct one and provide evidence for it from within the document.
- If no extraction result is provided and only one document type exists in the taxonomy, then that document type is pre-selected but needs evidencing.
- If no extraction result is provided and there are multiple document types in the taxonomy, then you have to manually select the desired document type and provide evidence for it.
All fields that contain information have an Options drop-down menu that can be accessed by clicking on the button. A drop-down list becomes visible, displaying multiple editing options.
The menu includes the following options:
- Change extracted value - Changes the automatically extracted value with a manually selected one. This field is active only when one or multiple values are selected from the document and are different from the original value.
- Revert to previous value - Resets the field's value to its last state. This option is active only when a value was previously altered or deleted.
- Mark as missing - Marks a field as missing
if the information is not available in the document.
There are several ways of selecting text while using the Validation Station wizard. Using them allows you to quickly navigate through the entire document and easily select the desired words for validating a field.
Here is a list of all the available selection options:
- Select one word - Click the desired word.
- Select consecutive words - Click the first word, then SHIFT+click the last word from range.
- Select multiple disparate words - Click the first word, then CTRL+click the rest of the desired words.
- Combine multiple selections - Click the first word, then SHIFT+click the last word from a range for the first selection, then hold CTRL+click and SHIFT+click to add another range, until you've completed your selections.
- Area selection - Make a selection and
choose the selection type:
- Tokens - Selects all words in the selected area.
- Custom area - Captures only the area and not the words in it.
Area selection can be observed below:
Edit the field's value - Changes the content of a field by positioning the mouse on that field, clicking the value, and adding the desired input.
The option - Reverts the field to its prior state. One-click on this option takes you one step back, meaning that if you had several changes on that field, multiple clicks might be required for returning to a certain value. This field is active only when a value was previously modified or deleted.
The option - Adds a value to the field by using the Custom area or Tokens selection. The button becomes available when a selection is made in the document and differs from the one in the field. The selection can be made for multi-value fields at all times, and for single-value fields only if no value is present for that field.
The option - Confirms the information included into the field. Once confirmed, the check box turns blue, like this .
The interface of the Validation Station is interactive, meaning that when a field is selected on the left side, the right side moves the focus on it by highlighting it.
The option - Enables you to select and add additional values from the document to a specific field.
The option - Enables you to add a value to a field without requiring reference from the document.
Below you can find explanations for all the options available for a table field:
Icon |
Description |
---|---|
|
Indicates the table's area in the document. |
|
Opens a drop-down list with the following options:
|
|
When selected, confirms the data accuracy. |
|
Enabled when the field's value is missing. |
|
Enabled when the initial extracted value had been altered or deleted. When selected, it restores the previous value. |
|
Removes the value, in this case, the existing table. |
All table fields have a drop-down menu with the following options:
- Change extracted value - Selects a new value from the document and replaces the initial one. This field becomes available only when the newly selected value is different from the original selection.
- Revert to previous value - Replaces the actual value with the previous one. This field is active only when altering the initial value.
- Mark as missing - Marks a field as missing if the info is not available in the document.
The drop-down menu available at the end of each row has the following options:
- Make it the table header - Transforms the selected row in the table's header. The row cannot be seen in the table's body anymore but the information can be visualized any time a header's field is selected.
- Insert row above - Allows you to add a new
row, above the selected one. The row is automatically added to the table, except
that all fields are marked as Not extracted. You have to manually select the
value from the document and add it by using the Add extracted value option.
An example of how to use this option is illustrated below:
- Insert row below - Applies the same principle as for the Insert row above option, the only difference being that the row is inserted below and not above the actual selection.
- Delete row - Deletes the selected row.
You can define the table header when using the Extract new table option by enabling the Extract header function, by selecting the information from the document, or by transforming one of the existing rows into the table's header.
- Extract the table with a defined header by
enabling the Extract header function once the Extract new table option
is active.
- Confirm the table's header by positioning
yourself on the header and selecting its equivalent from the table. Once the value
is selected, click the Change extracted value from the drop-down menu.
- Transform existing row into table's header
by positioning yourself on one of the rows and selecting from the drop-down menu the
Make it the table header option.
A table can manually be selected and defined, straight from the Validation Station wizard. If no table is selected, or if you are not happy with the automatic selection, then you can use the options available on the drop-down menu found at the end of the first row.
- Extract new table - Replaces the existing table with the new selection. You need to mark all rows and columns. Please keep in mind that the first row becomes the header of the table.
- Extract rows from here - Enables the custom selection tool and offers you the possibility to manually select new rows and add them below the currently selected one. Once the area of the new rows is selected, you must define each column. You can do this by using the available options presented in the table below.
- Highlight row - Highlights in the table the selected row.
- Highlight table - Highlights the entire
extracted table area.
- Delete all rows -
Deletes the existing table.
Icon
Function
Removes all lines visible in the selection.
Removes only the selected line from the selection.
Enables horizontal lines in the selection.
Enables vertical lines in the selection.
Enables you to adjust the line's direction using the mouse.
Enables the selection, rearrangement, and removal of lines.
- Delete all rows -
Deletes the existing table.
Below you can see an example of how to extract a new table row and how to use the available options. The Extract header function is not available when appending rows.
Here is how you can extract a new table by using the Extract new table option:
You can click the Save new table button to automatically confirm all fields or you can deny the operation by clicking the No button, return to the table selection, and manually confirm each field.
Number, date, and address fields allow you to review and correct formatted (parsed) parts of a specific value. The editable parts are as follows:
Field Type |
Editable Formatted Parts |
---|---|
Number |
Value |
Date |
|
Address |
|
Name |
|
When you extract or correct a value for a field of any of these types, the Validation Station tries to automatically parse the value into its formatted components.
The language setting displays the detected prevalent language within the document, as identified during the digitization process. This enables the Validation Station to parse numbers and dates more accurately, according to the language of the document. You can change the language setting by using the drop-down menu. By doing so, when you manually extract or change a date or number value, the Validation Station will first try to format the selected string according to the selected language, and will fallback to English US if parsing is not successful as such. The formatting function only applies to the editable formatted parts of a value, not to the original string value.
To ensure the best automatic formatted value detection, we recommend you check the detected language and correct it if necessary.
You have the option to report a document as an exception. If this situation occurs, the Present validation Station throws an exception that should be caught by the RPA workflow and treated separately. The exception message displays the Reason for Exception filled in by the user.
Here is how you can report a document as an exception:
You have the option to manually or automatically confirm all fields. For manual confirmation, you need to select the check box of each field. If a check box is not manually confirmed, then this process is automatically done when the Save button is clicked, and then the action is confirmed by clicking the Continue & save button.
The following options are available on the bottom side of the Validation Station:
Button |
Function |
---|---|
|
Displays and allows users to correct the automatically detected prevalent language within the document. |
|
Saves the confirmed fields. |
|
Saves and closes the table selection area. The button is enabled only when the table field is active. |
|
Reports the document as being an exception. |
|
Enabled only when not all values are manually confirmed. By selecting it, all data is automatically confirmed and saved. |
|
Enabled when no change has been done to the table. |
|
Enabled after a change has been done to the table. |
- Validation Station Management
- Requirements for Document Validation
- Validation Station as an Attended Activity
- Validation Station in Action Center
- Using Validation Station
- Document View Section
- Data Extraction Section
- Selection Modes
- Other Options
- Table Fields - Cell Level Processing
- Table Fields - Table Level Processing
- Value Formatting and Language Setting
- Report Exceptions
- Data Confirmation & Validation