Subscribe

UiPath Document Understanding

UiPath Document Understanding

Configure Data Manager

You must first create a working folder for holding your ML data. This is referenced in all commands documented below.

📘

Note

Run the configuration steps below before launching Data Manager.
If later on, you need to change the configuration (like the OCR engine, or a user password), you need to stop Data Manager using the Docker stop command, run the configuration commands, and then launch Data Manager again.
See here the Docker cheat sheet.

Adding users (only when running the standalone docker container)

An admin user with the admin username and admin password is created by default.

To create new users, stop the Data Manager container if it is running, use the following command, and then start the Data Manager container again:

docker run --rm -it -p <port_number>:80 -v "<path_to_working_folder>:/app/data" aiflprodweacr.azurecr.io/datamanager:latest --license-agreement accept --user <username> --passw <password>

Each user can also modify their password from the Settings -> Password view accessible through the button at the top right of the screen.

Enabling SSL Encryption (https)

This is not necessary when running Data Manager on your own machine or on a secure office network.

However, if you plan to run Data Manager on a remote server open to the Internet, then we strongly suggest you enable SSL encryption.

In order to do this you need to obtain the DNS name of the remote server and to generate a https certificate (.crt file) and key (.key file) for that domain name, and place them in a folder called certs on the remote server.

Then you need to launch the Data Manager using the following command:

docker run -d -p <port_number>:80 -v "<path_to_working_folder>:/app/data" -v "<path_to_certs_folder>:/certs" aiflprodweacr.azurecr.io/datamanager:latest --license-agreement accept --https-certificate /certs/<cert_filename.crt> --https-private-key /certs/<key_filename.key>

In this command, <cert_filename.crt> refers to the name of the .crt file, and <key_filename.key> refers to the name of the .key file which you have placed in the certs folder.

Using a predefined schema

In order to use the Retraining capability in AI Center, you need to use a set of fields based on the fields already extracted by the out-of-the-box pre-trained models offered by UiPath (Invoice and Receipts extraction). This list of fields is called a schema.

To make it easier to get started we are providing the schemas of the out-of-the-box models. These are zip files that you can import into Data Manager just like you would import a dataset, by clicking on the Import button at the top of the screen, and then select the zip file from the dialog. The Data Manager will detect that it is a new schema and will import it directly.

The schemas for the pretrained ML models provided by UiPath are available at the following links:
Invoices:
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/invoices/schema.zip
Receipts:
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/receipts/schema.zip
Purchase Orders:
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/puchase_orders/schema.zip
Utility Bills:
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/utility_bills/schema.zip
Invoices-Australia:
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/invoices-australia/schema.zip
Invoices-India:
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/invoices-india/schema.zip
Invoices-Japan
https://github.com/UiPath/Infrastructure/raw/main/ML/AiFabric/schema/invoices-japan/schema.zip

🚧

Warning

Invoices-Japan ML Model only supports Google Cloud Vision OCR.

Updated 24 days ago


Configure Data Manager


Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.