- Overview
- Getting started
- Activities
- Insights dashboards
- Document Understanding Process
- Quickstart tutorials
- Framework components
- ML packages
- Overview
- Document Understanding - ML package
- DocumentClassifier - ML package
- ML packages with OCR capabilities
- 1040 - ML package
- 1040 Schedule C - ML package
- 1040 Schedule D - ML package
- 1040 Schedule E - ML package
- 1040x - ML package
- 3949a - ML package
- 4506T - ML package
- 709 - ML package
- 941x - ML package
- 9465 - ML package
- ACORD131 - ML package
- ACORD140 - ML package
- ACORD25 - ML package
- Bank Statements - ML package
- Bills Of Lading - ML package
- Certificate of Incorporation - ML package
- Certificate of Origin - ML package
- Checks - ML package
- Children Product Certificate - ML package
- CMS 1500 - ML package
- EU Declaration of Conformity - ML package
- Financial Statements - ML package
- FM1003 - ML package
- I9 - ML package
- ID Cards - ML package
- Invoices - ML package
- Invoices Australia - ML package
- Invoices China - ML package
- Invoices Hebrew - ML package
- Invoices India - ML package
- Invoices Japan - ML package
- Invoices Shipping - ML package
- Packing Lists - ML package
- Payslips - ML package
- Passports - ML package
- Purchase Orders - ML package
- Receipts - ML Package
- Remittance Advices - ML package
- UB04 - ML package
- Utility Bills - ML package
- Vehicle Titles - ML package
- W2 - ML package
- W9 - ML package
- Other Out-of-the-box ML Packages
- Public endpoints
- Traffic limitations
- OCR Configuration
- Pipelines
- OCR services
- Supported languages
- Deep Learning
- Licensing
Document Understanding User Guide
ML packages with OCR capabilities
Optimize your results and ease your work when using Document UnderstandingTM by incorporating into your workflow one of the ML packages that have OCR capabilities.
This is a non-retrainable model which can be used with the UiPath Document OCR engine activity as part of the Digitize Document activity. To be used, the ML Skill must first be made public so that a URL can be copy-pasted into the UiPath® Document OCR engine activity.
You can run UiPathDocumentOCR on GPU or CPU, accuracy being the same on both cases, predictions on GPU being faster than the one on CPU.
UiPathDocumentOCR requires access to the Document Understanding metering server at https://du.uipath.com/metering if the ML skill is running on an AI Center on-premises regular deployment. No internet access is needed on AI Center on-premises air-gapped deployments.
This ML Package can be deployed the same way as the UiPathDocumentOCR ML Package, with the following differences:
- it is optimized to run on CPU, so you should see a 3-4x speedup when running in workflow, and 5-10x speedup when using it to import documents into Document Manager
- accuracy is slightly lower than the UiPathDocumentOCR ML Package, and it is similar to the UiPath.DocumentUnderstanding.OCR.LocalServer Studio package
- due to being faster, the CPU is also recommended when documents are large (over 20 pages per doc) in the absence of a GPU, which is ideal.
UiPath Extended Languages OCR is capable of processing documents in over 200 languages, especially in Chinese, Korean, Vietnamese, Thai, major Indian languages, and languages that use the Cyrilic or Greek alphabets.
You can use the URL of this endpoint into the UiPath Extended Languages OCR activity, or directly in a Document Understanding project at configuration time.
Available as an endpoint, CPU only, in Document Understanding framework. You can use the URL of this endpoint into the OCR for Chinese, Japanese and Korean activity, or directly in a Document Manager session, at configuration time.