Subscribe

UiPath Document Understanding

UiPath Document Understanding

Machine Learning Extractor

What is Machine Learning Extractor

The Machine Learning Extractor is a data extraction tool using machine learning models in order to identify and report on data targeted for data extraction.

This activity is the companion of UiPath Document Understanding Models, as the means to consume such models within your workflows.

The ML approach is strongly recommended for structured or semi-structured documents in which layouts of different document providers vary greatly. Given its machine learning approach, the extractor uses a trained machine learning model, that learns and can then infer values for the targeted fields, even from documents and layouts it has never seem before. In other words, if documents do not follow a text or layout pattern, the Machine Learning Extractor may be a good option for your use case.

The Machine Learning Model can be used in multiple ways:

  • with one of UiPath's public Document Understanding endpoints, if you wish to use generic models targeting certain document types; or
  • with custom trained machine learning models starting from the UiPath Document Understanding available models.

This extractor can be trained / re-trained. See the How to Train section for details.

🚧

Warning:

Images with a resolution lower than 50 x 50 pixels cannot be processed, generating an error.

Special Requirements

You need to use

  • one of UiPath's public Document Understanding endpoints for data extraction, or
  • machine learning models hosted in AI Center in Automation Cloud, or
  • machine learning models hosted in AI Center on-prem, but licensed through Automation Cloud, you need to use your Automation Cloud Document Understanding API Key.

To use the Machine Learning Extractor with on-prem licensing, you need to host your Document Understanding models in your AI Center on-prem (air-gapped install) instance.

How to Configure

Activity Configuration


If the endpoint you are using is licensed through Automation Cloud, you need to provide your Cloud Document Understanding API Key.

If you are using the Machine Learning Extractor with either a UiPath Document Understanding public endpoint, or with a public ML Skill in AI Center, then you need to configure the Endpoint argument of the activity with the corresponding URL.

If you are using the Machine Learning Extractor with a deployed ML Skill, then you need to configure the ML Skill argument of the activity with the correct selection from your AI Center hosted ML skills list.

If you try to set both options, an error is displayed - either in the Configuration Wizard, or in the workflow directly:

Learn More

To learn more about Machine Learning Extractor, please visit this page.

Updated 12 days ago


Machine Learning Extractor


Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.