Subscribe

UiPath Document Understanding

UiPath Document Understanding

Document Understanding

Document Understanding is the main starting point for creating new projects. You can use it for structured or semi-structured documents, and also along with pre-trained models. You can always start a training session from scratch, validate your documents, and customize the projects as needed.

1920

Overview page

Here you can find a list of all the created projects, with specific details. You can sort the projects alphabetically, or by date, create a new project, and customize the page view.

Projects

Presents the list of all created projects. There are three types of sorting for Projects:

  • Alphabetically in ascending order
  • Alphabetically in descending order
  • By creation date.
    The default sorting order is by the creation date.
1280

Once a project is created you can select the type of the document. There are two options:

  • Using Forms AI (fixed layout format) - intended for Forms AI
  • Using Semi-Structured AI - intended for Document Manager
1920

If you want to delete a document type, then open your project, select the document type that you want to delete, open the Action Menu and click on the Delete option.

1280

Columns

Use this function to customize the amount of details available for the Projects list. Here's the full list of details that can be displayed for each created project.

  • Project - Displays the name of the project.
  • Document types - Displays the type of documents used for each project.
  • Extractors - Displays the number of extractors used for each project.
  • Pages processed - Displays the number of processed pages for each project.
  • Date - Displays the creation date for each project.
1280
  • Reset - Checks all fields as visible, no matter of the previous selections.
1280

New project

Create a new project by using the New project button. Clicking on the New project opens a new popup window.

807

When a new project is created, the following information is required:

OptionDescriptionField status
NameProvide a name for the new project.Mandatory
DescriptionProvide more details about the project.Optional
OCR methodSelect the OCR method for the new project.
Choose between the following options:
- UiPath Document OCR
- UiPath Chinese, Japanese, Korean OCR
- Google Cloud Vision OCR
- Google Cloud Vision for Japanese
-Microsoftt Read OCR
Mandatory
OCR URLProvide the OCR URL corresponding to the chosen OCR method.
Here's the list of the OCR URLs corresponding to the UiPath OCR method.
Mandatory
OCR API KeyProvide the OCR API Key for the chosen OCR method.
If the OCR method is UiPath OCR, then the value of this field is available on the cloud platform by going to Admin/Licenses/Robots & Services/Document Understanding
Optional
Apply OCR on PDFEstablishes if the OCR process should be applied or not to PDF documents. If set to Yes, the OCR is applied to all PDF pages of the document.
If set to No, the OCR is not applied to any pages and returns only the text embedded in the PDF . When set to Auto OCR applies only to the scanned pages of the document. The default value is Auto.
Mandatory

📘

Note:

UiPath OCR API Key is also available on the cloud platform by going to Admin/Licenses/Robots & Services/Document Understanding and copying the available key.

Project page

Here you can find more in depth details about the created project, edit the name of the project, or create a new document type.

The first row of information presents the following details:

  • Project name - The name of the project.
  • Date - Details about the project time creation.
  • Settings - Opens the popup menu from the Overview page. More details about it can be found here.

Document types and Extractors

Below the general details, you can find two tabs:
Document type - Provides a list of all the available document types for the selected project. Defining a document type is mandatory for creating an extractor or classifier. A document type can be an invoice, a purchase order, or a standardized form.

Document types menuDescription
NamePresents the name list of all created document types.
Extraction methodPresents the used type of extraction method for the selected document type.
PagesPresents the total number of pages found in each document type.
RefreshRefreshes the list of projects.

Extractors - Provides the list of extractors created for the document types of the selected project. Use extractors to train the document types.

Extractors types menuDescription
NamePresents the name list of all extractors from the created document types.
TypePresents the type of used extractor.
Creation datePresents the creation date for each extractor.
Pages processedPresents the number of processed pages by each extractor.
StatusPresents the status of each extractor.
RefreshRefreshes the list of extractors.
Extractor's menuOffers the possibility to copy the Extractor's URL link.

Both Document types and Extractors have three types of sorting the information:

  • Alphabetically in ascending order.
  • Alphabetically in descending order.
  • By creation date.
    The default sorting order is by the creation date.

📘

Note:

Document types are mandatory in a project, while extractors are optional. You can choose to add an extractor for any of the document types.

Other options

The following options are applicable throughout the entire interface of Document Understanding.

Search option enables you to initiate a search among the available list of projects, document types, or extractors. Search is active separately for each selected tab, meaning that if you want to search for an extractor, you should select the Extractors tab. Same applies for Document types. Start a search by typing the name of the project inside the Search bar.

1280

Refresh - Refreshes the list of projects.
Remove project - Deletes the selected project. The Remove project button becomes visible only after opening the action menu.

1280

📘

Note:

When deleting a project, all containing document types and extractors are deleted along with the project.

Page scrolling - Easy scroll through the list of project pages, document types, or extractors. Go page by page or skip directly to the first/last page.

1280

Items per page - Select the number of projects, document types, or extractors displayed per project page.

1280

REST API - Opens the REST API framework capabilities.

Updated 12 days ago


Document Understanding


Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.