document-understanding
latest
false
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Dec 12, 2024

ML models and capabilities

All of our models can be trained to understand any language recognized by an OCR. This also applies to languages not immediately recognized by the model. For instance, if you're working with purchase orders in Romanian, you'll need to specifically train the model with Romanian language purchase orders. Check the OCR page for a list of languages supported by our OCR engines.

Important: Exceptions are Chinese, Japanese, Korean and right-to-left languages. To process documents in these languages, you can only train them on the out-of-the-box model package specifically designed for that language (like Invoices China, Invoices Hebrew, or Invoices Japan). Alternatively, you can use the generic Document Understanding package.
The following table lists the languages supported out-of-the-box by our models.
ML model or capabilityLanguages supported out of the box (pre-trained)
Form ExtractorThe same languages supported by the OCR engine
Intelligent Keyword ClassifierThe same languages supported by the OCR engine
Machine Learning ClassifierThe same languages supported by the model used
Machine Learning ExtractorThe same languages supported by the model used
Generative ClassifierThe same languages supported by the OCR engine
Generative ExtractorThe same languages supported by the OCR engine
709 - Preview English
941x - Preview English
1040English
1040 Schedule C - Preview English
1040 Schedule D - Preview English
1040 Schedule E - Preview English
1040x - Preview English
3949a - Preview English
9465 - Preview English
4506TEnglish
Acord125English
Acord126English
Acord131English
Acord140English
Acord25English
Bank StatementsEnglish
Bills of Lading
  • English
  • German
Certificate of Incorporation/ Good StandingEnglish
Certificate of OriginEnglish
ChecksEnglish
Children Product CertificateEnglish
CMS1500English
EU Declaration of ConformityEnglish
Financial StatementsEnglish
FM1003 - Preview English
I9English
IDCardsTrained to support ID cards and driver's licenses from:
  • Aadhaar
  • Australia
  • Austria
  • Belgium
  • Canada
  • Croatia
  • Cyprus
  • Finland
  • France
  • Germany
  • Hong Kong
  • Hungary
  • India
  • Italy
  • Netherlands
  • PAN cards
  • Poland
  • Romania
  • Saudi Arabian
  • Spain
  • Switzerland
  • United Kingdom
  • United States of America (all 50 states, including Washington D.C.)
Invoices
  • English
  • French
  • German
  • Portuguese
  • Romanian
  • Spanish
InvoicesAustralia
Note: Deprecated: model has been merged into the Invoices model. See the Invoices model page for details.
English
Invoices China
  • Chinese - Simplified
  • Chinese - Traditional
Invoices Hebrew - Preview Hebrew
Invoices IndiaEnglish
Invoices JapanJapanese
Invoices ShippingEnglish
Packing ListsEnglish
PassportsAll nationalities
Pay slipsEnglish
Purchase Orders
  • English
  • German
Receipts
  • English
  • Finnish
  • French
  • German
  • Norwegian
  • Romanian
  • Spanish
Japanese
RemittanceAdvicesEnglish
UB04 - ML package - Preview English
Utility BillsEnglish
Vehicle TitlesEnglish
W2English
W9
  • English
  • Spanish

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.