document-understanding
latest
false
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Dec 12, 2024

Machine Learning Extractor

What is Machine Learning Extractor

The Machine Learning Extractor is a data extraction tool using machine learning models in order to identify and report on data targeted for data extraction.

This activity is the companion of UiPath® Document UnderstandingTM models, as the means to consume such models within your workflows.

The ML approach is strongly recommended for structured or semi-structured documents in which layouts of different document providers vary greatly. Given its machine learning approach, the extractor uses a trained machine learning model, that learns and can then infer values for the targeted fields, even from documents and layouts it has never seem before. In other words, if documents do not follow a text or layout pattern, the Machine Learning Extractor may be a good option for your use case.

The Machine Learning Model can be used in multiple ways:

  • with one of UiPath's public Document Understanding endpoints, if you wish to use generic models targeting certain document types; or
  • with custom trained machine learning models starting from the UiPath Document Understanding available models.

This extractor can be trained / re-trained. See the Machine Learning Extractor Trainer section for details.

Important: Images with a resolution lower than 50 x 50 pixels cannot be processed, generating an error.

Special requirements

You need to use

  • one of UiPath's public Document Understanding endpoints for data extraction, or
  • machine learning models hosted in AI Center in Automation Cloud, or
  • machine learning models hosted in AI Center on-prem, but licensed through Automation Cloud, you need to use your Automation Cloud Document Understanding API Key.

To use the Machine Learning Extractor with on-prem licensing, you need to host your Document Understanding models in your AI Center on-prem (air-gapped install) instance.

How to configure

Activity configuration

If the endpoint you are using is licensed through Automation Cloud, you need to provide your Cloud Document Understanding API Key.

If you are using the Machine Learning Extractor with either a UiPath Document Understanding public endpoint, or with a public ML Skill in AI Center, then you need to configure the Endpoint argument of the activity with the corresponding URL.

If you are using the Machine Learning Extractor with a deployed ML Skill, then you need to configure the ML Skill argument of the activity with the correct selection from your AI Center hosted ML skills list.

If you try to set both options, an error is displayed - either in the Configuration Wizard, or in the workflow directly:



  • What is Machine Learning Extractor
  • Special requirements
  • How to configure
  • Activity configuration

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.