document-understanding
latest
false
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Dec 12, 2024

ML packages with OCR capabilities

Optimize your results and ease your work when using Document UnderstandingTM by incorporating into your workflow one of the ML packages that have OCR capabilities.

UiPathDocumentOCR (on-premises and cloud)

This is a non-retrainable model which can be used with the UiPath Document OCR engine activity as part of the Digitize Document activity. To be used, the ML Skill must first be made public so that a URL can be copy-pasted into the UiPath® Document OCR engine activity.

You can run UiPathDocumentOCR on GPU or CPU, accuracy being the same on both cases, predictions on GPU being faster than the one on CPU.

UiPathDocumentOCR requires access to the Document Understanding metering server at https://du.uipath.com/metering if the ML skill is running on an AI Center on-premises regular deployment. No internet access is needed on AI Center on-premises air-gapped deployments.

UiPathDocumentOCR_CPU (on-premises only)

This ML Package can be deployed the same way as the UiPathDocumentOCR ML Package, with the following differences:

  • it is optimized to run on CPU, so you should see a 3-4x speedup when running in workflow, and 5-10x speedup when using it to import documents into Document Manager
  • accuracy is slightly lower than the UiPathDocumentOCR ML Package, and it is similar to the UiPath.DocumentUnderstanding.OCR.LocalServer Studio package
  • due to being faster, the CPU is also recommended when documents are large (over 20 pages per doc) in the absence of a GPU, which is ideal.

UiPath Extended Languages OCR

UiPath Extended Languages OCR is capable of processing documents in over 200 languages, especially in Chinese, Korean, Vietnamese, Thai, major Indian languages, and languages that use the Cyrilic or Greek alphabets.

You can use the URL of this endpoint into the UiPath Extended Languages OCR activity, or directly in a Document Understanding project at configuration time.

OCR for Chinese, Japanese, Korean (on-premises and cloud)

Available as an endpoint, CPU only, in Document Understanding framework. You can use the URL of this endpoint into the OCR for Chinese, Japanese and Korean activity, or directly in a Document Manager session, at configuration time.

Note: The UiPath Chinese, Japanese, Korean OCR will be deprecated starting with January 2025. We recommend using the UiPath Extended Languages OCR instead. Check the deprecation timeline for more information about upcoming deprecations and removals.

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.