document-understanding
latest
false
- 概述
- 入门指南
- Activities (活动)
- Insights 仪表板
- Document Understanding 流程
- 快速入门教程
- 框架组件
- 模型详细信息
- 概述
- Document Understanding - ML 包
- DocumentClassifier - ML 包
- 具有 OCR 功能的 ML 包
- 1040 - ML 包
- 1040 附表 C - ML 包
- 1040 附表 D - ML 包
- 1040 附表 E - ML 包
- 1040x - ML 包
- 3949a - ML 包
- 4506T - ML 包
- 709 - ML 包
- 941x - ML 包
- 9465 - ML 包
- ACORD125 - ML 包
- ACORD126 - ML 包
- ACORD131 - ML 包
- ACORD140 - ML 包
- ACORD25 - ML 包
- 银行对账单 - ML 包
- 提单 - ML 包
- 公司注册证书 - ML 包
- 原产地证书 - ML 包
- 检查 - ML 包
- 儿童产品证书 - ML 包
- CMS1500 - ML 包
- 欧盟符合性声明 - ML 包
- 财务报表 (Financial statements) - ML 包
- FM1003 - ML 包
- I9 - ML 包
- ID Cards - ML 包
- Invoices - ML 包
- InvoicesAustralia - ML 包
- 中国发票 - ML 包
- 希伯来语发票 - ML 包
- 印度发票 - ML 包
- 日本发票 - ML 包
- 装运发票 - ML 包
- 装箱单 - ML 包
- 工资单 - ML 包
- 护照 - ML 包
- 采购订单 - ML 包
- 收据 - ML 包
- 汇款通知书 - ML 包
- UB04 - ML 包
- 水电费账单 - ML 包
- 车辆所有权证明 - ML 包
- W2 - ML 包
- W9 - ML 包
- 其他开箱即用的 ML 包
- 公共端点
- 流量限制
- OCR 配置
- 管道
- OCR 服务
- 支持的语言
- 深度学习
- 数据与安全性
- 许可和计费逻辑
重要 :
新发布内容的本地化可能需要 1-2 周的时间才能完成。

Document Understanding classic user guide
上次更新日期 2026年4月23日
“智能 OCR”活动
With Intelligent OCR activities you can process documents in a comprehensive manner, allowing you to not only digitize, extract, classify, and validate documents, but also train your extractor and classifiers on your specific data, so they can be faster and more accurate. The steps involved in creating Document UnderstandingTM processes using Intelligent OCR activities are:
- Create the Taxonomy: Define document types and convert them into a Document Object Model variable using the Load Taxonomy activity.
- Digitize documents: Prepare documents so robots can process them using an OCR engine, by storing their text inside a String variable, and basic information about them inside a Document Object Model file.
- Classify documents: Prepare documents using certain classifiers, so robots can identify what types of files they're processing.
- Validate the classification of documents: Verify and validate that the documents have been correctly classified.
- Train your classifiers: Configure your classifiers based on input received while validating the classification
- Extract data from documents: Identify and extract specific information from your documents using various extractors to send it for validation.
- Validate the extractions documents: Verify and validate the documents you processed, classified, and extracted, using the input of your team members within Action Center.
- Train your extractors: Configure your extractors based on input received while validating the extraction.
- Consume exported data: Once you validate the extracted data, you can use it as it is or export it as a DataSet variable using the Export Extraction Results activity.
在开始之前
在开始使用 IntelligentOCR.Activities 之前,请检查以下特征:
- 高可配置性,这也涉及较高的学习曲线。
- 存在多个对象和活动,旨在满足灵活性。
- 由于以下复杂性,可重用性降低:
- 您需要在工作流中进行多种配置。
- 您需要将显式参数从一个活动重复传递到另一个活动,例如:
- 分类
- 文档对象模型
- 文本
- 分类结果
- 提取结果