Document Understanding
latest
false
- Overview
- Getting Started
- Activities
- Insights Dashboards
- Document Understanding Process
- Quickstart Tutorials
- Framework Components
- ML Packages
- Overview
- Document Understanding - ML Package
- DocumentClassifier - ML Package
- ML Packages With OCR Capabilities
- 1040 - ML Package
- 1040 Schedule C - ML Package
- 1040 Schedule D - ML Package
- 1040 Schedule E - ML Package
- 4506T - ML Package
- 990 - ML Package - Preview
- ACORD125 - ML Package
- ACORD126 - ML Package
- ACORD131 - ML Package
- ACORD140 - ML Package
- ACORD25 - ML Package
- Bank Statements - ML Package
- BillsOfLading - ML Package
- Certificate of Incorporation - ML Package
- Certificate of Origin - ML Package
- Checks - ML Package
- Children Product Certificate - ML Package
- CMS 1500 - ML Package
- EU Declaration of Conformity - ML Package
- Financial Statements - ML Package
- FM1003 - ML Package
- I9 - ML Package
- ID Cards - ML Package
- Invoices - ML Package
- Invoices Australia - ML package
- Invoices China - ML package
- Invoices India - ML package
- Invoices Japan - ML package
- Invoices Shipping - ML Package
- Packing Lists - ML Package
- Payslips - ML Package
- Passports - ML Package
- Purchase Orders - ML Package
- Receipts - ML Package
- RemittanceAdvices - ML Package
- UB04 - ML Package
- Utility Bills - ML Package
- Vehicle Titles - ML Package
- W2 - ML Package
- W9 - ML Package
- Other Out-of-the-box ML Packages
- Public Endpoints
- Traffic limitations
- OCR Configuration
- Pipelines
- OCR Services
- Deep Learning
- Licensing
Supported Languages
Document Understanding User Guide
Last updated Apr 26, 2024
Supported Languages
- For OCR Language Support see, OCR.
- For Ml Packages Language Support see ML Packages.
- For On-Prem Endpoints Language Support, see On-Prem Endpoints.
Observations
- To train a model on Japanese documents, use either the DocumentUnderstanding package or the InvoicesJapan package.
- To train a model on Chinese documents, use either the DocumentUnderstanding package or the InvoicesChina package.
- To train a model on Latin script documents, use any package except for InvoicesJapan or InvoicesChina.
- For the supported languages, retraining may be required to get the expected accuracy if the documents are considerably different from the original model training dataset.
- For the supported languages which are not pre-trained by the model, you can train a model with your own data in AI Center, assuming the OCR engine supports it as well.