What is Machine Learning Extractor
The Machine Learning Extractor is a data extraction tool using machine learning models in order to identify and report on data targeted for data extraction.
This activity is the companion of UiPath Document Understanding Models, as the means to consume such models within your workflows.
The ML approach is strongly recommended for structured or semi-structured documents in which layouts of different document providers vary greatly. Given its machine learning approach, the extractor uses a trained machine learning model, that learns and can then infer values for the targeted fields, even from documents and layouts it has never seem before. In other words, if documents do not follow a text or layout pattern, the Machine Learning Extractor may be a good option for your use case.
The Machine Learning Model can be used in multiple ways:
- with one of UiPath's public Document Understanding endpoints, if you wish to use generic models targeting certain document types; or
- with custom trained machine learning models starting from the UiPath Document Understanding available models.
This extractor can be trained / re-trained. See the How to Train section for details.
Images with a resolution lower than 50 x 50 pixels cannot be processed, generating an error.
You need to use
- one of UiPath's public Document Understanding endpoints for data extraction, or
- machine learning models hosted in AI Center in Automation Cloud, or
- machine learning models hosted in AI Center on-prem, but licensed through Automation Cloud, you need to use your Automation Cloud Document Understanding API Key.
To use the Machine Learning Extractor with on-prem licensing, you need to host your Document Understanding models in your AI Center on-prem (air-gapped install) instance.
How to Configure
If the endpoint you are using is licensed through Automation Cloud, you need to provide your Cloud Document Understanding API Key.
If you are using the Machine Learning Extractor with either a UiPath Document Understanding public endpoint, or with a public ML Skill in AI Center, then you need to configure the Endpoint argument of the activity with the corresponding URL.
If you are using the Machine Learning Extractor with a deployed ML Skill, then you need to configure the ML Skill argument of the activity with the correct selection from your AI Center hosted ML skills list.
If you try to set both options, an error is displayed - either in the Configuration Wizard, or in the workflow directly:
To learn more about Machine Learning Extractor, please visit this page.
Updated about a month ago