Document Understanding User Guide

DELIVERY:

Last updated Apr 4, 2025

OCR Services

About OCR Services

OCR services are used for the following purposes:

At data labeling time, when importing documents into Document Manager. The engines available for this step are UiPath Document OCR, Google Cloud Vision OCR, and Microsoft Read OCR.
At run time when calling models from RPA workflows. The engines available for this step are all the engines integrated with the UiPath RPA platform including the above, plus Abbyy Finereader, Microsoft OCR (legacy), Microsoft Project Oxford OCR, and Tesseract.

In production, we recommend calling the OCR using the Digitize Document activity in your workflow and passing the Document Object Model as input to the activity calling the ML model. For this purpose, you need to use the Machine Learning Extractor activity (Official feed).

As a quick convenience for testing purposes, you can also configure the OCR directly in AI Center (Settings window), but this is not recommended for production deployments.

Configuring OCR Service in Document Manager and AI Center Document Understanding ML Packages

The table below shows how to configure the six supported OCR engine types in both Document Manager and AI Center.

Important: The ocr.method argument corresponds to the OCR Engine dropdown in the ML Package creation view in AI Center.

OCR Engine	OCR Method	OCR Key	OCR URL
UiPath	UiPath Document OCR	UiPath Automation Cloud Document Understanding API Key Enterprise Plan	`http://<IP_addr>:<port_number>`
Google	Google Cloud Vision OCR	GCP Console API Key	`https://vision.googleapis.com/v1/images:annotate`
Microsoft Read 2.0 On-Prem	Microsoft Read OCR	None	`http://<IP_addr>:<port_number>/vision/v2.0/read/core/Analyze`
Microsoft Read 2.0 Azure	Microsoft Read OCR	API Key for your resource from Azure Portal	`<Azure_resource_Endpoint>/vision/v2.0/read/core/asyncBatchAnalyze`
Microsoft Read 3.2 On-Prem	Microsoft Read OCR	None	`http://<IP_addr>:<port_number>/vision/v3.2/read/analyze`
Microsoft Read 3.2 Azure	Microsoft Read OCR	API Key for your resource from Azure Portal	`<Azure_resource_Endpoint>/vision/v3.2/read/analyze`

On this page

About OCR Services
Configuring OCR Service in Document Manager and AI Center Document Understanding ML Packages

Was this page helpful?

PREVIOUSCheckboxes and Signatures

NEXTInstall and Use

Support and Services

Get The Help You Need

UiPath Academy

Learning RPA - Automation Courses

UiPath Forum

UiPath Community Forum

Trust and Security

Cookies Policy