Document Understanding User Guide

DELIVERY:

Last updated May 8, 2025

OCR services

OCR services are used for the following purposes:

At data labeling time, when importing documents into Document Manager. The engines available for this step are UiPath Document OCR, Google Cloud Vision OCR, and Microsoft Read OCR.
At run time when calling models from RPA workflows. Visit OCR Engines to check the engines available for this step.

The table below shows how to configure the six supported OCR engine types in both Document Manager and AI Center.

Important: The ocr.method argument corresponds to the OCR Engine dropdown in the ML Package creation view in AI Center.

OCR Engine	OCR Method	OCR Key	OCR URL
UiPath	UiPath Document OCR (Cloud and On-Premises) UiPathDocumentOCR_CPU(On-Premises) OCR for Chinese, Japanese, Korean (Cloud and On-Premises)	Automation Suite API Key* Enterprise Plan	The ML Skill URL generated after deploying UiPath Document OCR.
Google	Google Cloud Vision OCR	GCP Console API Key	`https://vision.googleapis.com/v1/images:annotate`
Microsoft Read 2.0 On-Prem	Microsoft Read OCR	None	`http://<IP_addr>:<port_number>/vision/v2.0/read/core/Analyze`
Microsoft Read 2.0 Azure	Microsoft Read OCR	API Key for your resource from Azure Portal	`<Azure_resource_Endpoint>/vision/v2.0/read/core/asyncBatchAnalyze`
Microsoft Read 3.2 On-Prem	Microsoft Read OCR	None	`http://<IP_addr>:<port_number>/vision/v3.2/read/analyze`
Microsoft Read 3.2 Azure	Microsoft Read OCR	API Key for your resource from Azure Portal	`<Azure_resource_Endpoint>/vision/v3.2/read/analyze`

On this page

PREVIOUSThe Auto-Fine-tuning Loop (Public Preview)

NEXTOverview

Get The Help You Need

Learning RPA - Automation Courses

UiPath Community Forum