- Getting Started
- Framework Components
- ML Packages
- Pipelines
- Data Manager
- OCR Services
- Document Understanding deployed in Automation Suite
- Document Understanding deployed in AI Center standalone
- Deep Learning
- Licensing
- References
- UiPath.Abbyy.Activities
- UiPath.AbbyyEmbedded.Activities
- UiPath.DocumentUnderstanding.ML.Activities
- UiPath.DocumentUnderstanding.OCR.LocalServer.Activities
- UiPath.IntelligentOCR.Activities
- UiPath.OCR.Activities
- UiPath.OCR.Contracts
- UiPath.DocumentProcessing.Contracts
- UiPath.OmniPage.Activities
- UiPath.PDF.Activities
Document Understanding User Guide
Language Support
The supported languages for different Document Understanding components can be found in the table below.
Components |
Supported Languages |
---|---|
The left-to-right languages supported by the OCR engine of choice:
Right-to-left languages are not supported even if the OCR engine supports them. | |
Same as above, except for HandwritingRecognition which supports only English. | |
Please refer to ML Packages Supported Languages. For the supported languages, retraining may be required to get the expected accuracy if the documents are considerably different from the original model training dataset. For the languages not supported in this list, you can experiment with the approach of creating a custom model to extract any left-to-right language, assuming the OCR engine supports it as well. Automatic reformatting of dates in a standard
yyyy-mm-dd format for Asian languages is currently supported only for Japanese. For documents in other Asian languages, you can extract
the dates as String content type and format it in the RPA workflow.
|