Document Understanding User Guide

DELIVERY:

Last updated Feb 4, 2025

About ML Packages

Using a Document Understanding ML Package involves these steps:

Collect document samples and the requirements of the data points that need to be extracted.
Label documents using Data Manager.
Data Manager itself will connect to an OCR Service.
Export labeled documents as a Training data set and upload that exported folder to AI Center Storage.
Export labeled documents as a Testing data set and upload that exported folder to AI Center Storage.
Run a Training Pipeline on AI Center.
Evaluate the model performance with an Evaluation Pipeline on AI Center.
Deploy the trained model as an ML Skill in AI Center.
Query the ML Skill from an RPA workflow using the UiPath.DocumentUnderstanding.ML activity package.
Note: Remember that using Document Understanding ML Packages requires that the machine on which AI Center is installed can access https://du-metering.uipath.com.

Important: When creating a UiPath.DocumentUnderstanding.ML.Activities Package in AI Center, the package name should not be any python reserved keyword, such as class , break, from, finally, global, None, etc. Note that this list is not exhaustive since the package name is used for class <pkg-name> and import <pkg-name>.

These are out-of-the-box Machine Learning Models to classify and extract any commonly occurring data points from semi-structured or unstructured documents, including regular fields, table columns, and classification fields, in a template-less approach.

Document Understanding contains multiple ML Packages split into four main categories:

UiPathDocumentOCR

This is a non-retrainable model which can be used with the UiPath Document OCR engine activity as part of the Digitize Document activity. To be used, it must first be made public so that a URL can be copy-pasted into the UiPath Document OCR engine activity.

UiPathDocumentOCR requires access to the Document Understanding metering server at https://du.uipath.com/metering if the ML skill is running on an AI Center on-premises regular deployment. No internet access is needed on AI Center on-premises air-gapped deployments.

The UiPathDocumentOCR ML Package in AI Center is optimized for running on GPU, so we strongly recommend using it on GPU. If no GPU is available, we recommend using the standalone docker container.

DocumentUnderstanding

This is a generic, retrainable model for extracting any commonly occurring data points from any type of structured or semi-structured documents, building a model from scratch. This ML Package must be trained. If deployed without training first, deployment fails with an error stating that the model is not trained.

Out-of-the-box Pre-trained ML Packages

These are retrainable ML Packages that hold the knowledge of different Machine Learning Models.

They can be customized to extract additional fields or support additional languages using Pipeline runs. Using state-of-the-art transfer learning capabilities, this model can be retrained on additional labeled documents and tailored to specific usecases or expanded for additional Latin, Cyrillic or Greek language support.

The dataset used may have the same fields, a subset of the fields, or have additional fields. To benefit from the intelligence already contained in the pre-trained model, you need to use fields with the same names as in the out-of-the-box model itself.

These ML Packages are:

Invoices: The fields extracted out-of-the-box can be found here.
InvoicesAustralia Preview: The fields extracted out-of-the-box can be found here.
InvoicesIndia Preview: The fields extracted out-of-the-box can be found here.
InvoicesJapan Preview: The fields extracted out-of-the-box can be found here.
Receipts: The fields extracted out-of-the-box can be found here.
PurchaseOrders Preview: The fields extracted out-of-the-box can be found here.
UtilityBills Preview: The fields extracted out-of-the-box can be found here.

These models are deep learning architectures built by UiPath. A GPU can be used both at serving time and training time but is not mandatory. A GPU delivers>10x improvement in speed for Training in particular.

Other Out-of-the-box ML Packages

These are non-retrainable Packages that are required for non-ML components of the Document Understanding suite.

These ML Packages are:

FormExtractor: Deploy as Public Skill and paste the URL into the Form Extractor activity.
IntelligentFormExtractor: Deploy as Public Skill and paste the URL into the Intelligent Form Extractor activity. Make sure to first deploy the HandwritingRecognition ML Skill and configure that as the OCR for this package.
IntelligentKeywordClassifier: Deploy as Public Skill and paste the URL into the Intelligent Keyword Classifier activity.
HandwritingRecognition: Deploy as Public Skill and use as OCR when creating the IntelligentFormExtractor package.

On this page

UiPathDocumentOCR
DocumentUnderstanding
Out-of-the-box Pre-trained ML Packages
Other Out-of-the-box ML Packages

Was this page helpful?

PREVIOUSTraining and Evaluation Pipelines

NEXTSupported Languages

Support and Services

Get The Help You Need

UiPath Academy

Learning RPA - Automation Courses

UiPath Forum

UiPath Community Forum

Trust and Security

Cookies Policy