Document Understanding
latest
false
- Overview
- Getting Started
- Activities
- Insights Dashboards
- Document Understanding Process
- Quickstart Tutorials
- Framework Components
- ML Packages
- Overview
- Document Understanding - ML Package
- DocumentClassifier - ML Package
- ML Packages With OCR Capabilities
- 1040 - ML Package
- 1040 Schedule C - ML Package
- 1040 Schedule D - ML Package
- 1040 Schedule E - ML Package
- 4506T - ML Package
- 990 - ML Package - Preview
- ACORD125 - ML Package
- ACORD126 - ML Package
- ACORD131 - ML Package
- ACORD140 - ML Package
- ACORD25 - ML Package
- Bank Statements - ML Package
- BillsOfLading - ML Package
- Certificate of Incorporation - ML Package
- Certificate of Origin - ML Package
- Checks - ML Package
- Children Product Certificate - ML Package
- CMS 1500 - ML Package
- EU Declaration of Conformity - ML Package
- Financial Statements - ML Package
- FM1003 - ML Package
- I9 - ML Package
- ID Cards - ML Package
- Invoices - ML Package
- Invoices Australia - ML package
- Invoices China - ML package
- Invoices India - ML package
- Invoices Japan - ML package
- Invoices Shipping - ML Package
- Packing Lists - ML Package
- Payslips - ML Package
- Passports - ML Package
- Purchase Orders - ML Package
- Receipts - ML Package
- RemittanceAdvices - ML Package
- UB04 - ML Package
- Utility Bills - ML Package
- Vehicle Titles - ML Package
- W2 - ML Package
- W9 - ML Package
- Other Out-of-the-box ML Packages
- Public Endpoints
- Traffic limitations
- OCR Configuration
- Pipelines
- OCR Services
- Deep Learning
- Licensing
Document Understanding User Guide
Last updated Apr 26, 2024
Artifacts
For an Evaluation Pipeline, the Outputs pane also includes an artifacts / eval_metrics folder which contains two files:
evaluation_default.xlsx
is an Excel spreadsheet with three different sheets:- The first sheet presents a summary of the overall scores and the scores per batch, for each field, Regular, Column, and Classification fields. A percentage of the perfectly extracted documents is also provided for both per batch and overall documents.
- The second sheet presents a side by side, color coded comparison of Regular Fields, for increasing document accuracy. The most inaccurate documents are presented at the top to facilitate diagnosis and troubleshooting.
- The third sheet presents a side by side color, coded comparison of the Column Fields.
- All scores presented in the Excel file represent accuracy scores.
evaluation_metrics_default.txt
contains the F1 scores of the predicted fields.