- 概述
- 文档处理合同
- 发行说明
- 关于文档处理合同
- Box 类
- IPersistedActivity 接口
- PrettyBoxConverter 类
- IClassifierActivity 接口
- IClassifierCapabilitiesProvider 接口
- 分类器文档类型类
- 分类器结果类
- 分类器代码活动类
- 分类器原生活动类
- 分类器异步代码活动类
- 分类器文档类型功能类
- ContentValidationData Class
- EvaluatedBusinessRulesForFieldValue Class
- EvaluatedBusinessRuleDetails Class
- 提取程序异步代码活动类
- 提取程序代码活动类
- 提取程序文档类型类
- 提取程序文档类型功能类
- 提取程序字段功能类
- 提取程序原生活动类
- 提取程序结果类
- FieldValue Class
- FieldValueResult Class
- ICapabilitiesProvider 接口
- IExtractorActivity 接口
- 提取程序有效负载类
- 文档操作优先级枚举
- 文档操作数据类
- 文档操作状态枚举
- 文档操作类型枚举
- 文档分类操作数据类
- 文档验证操作数据类
- 用户数据类
- 文档类
- 文档拆分结果类
- DomExtensions 类
- 页类
- 页面分区类
- 多边形类
- 多边形转换器类
- 元数据类
- 词组类
- 词类
- 处理源枚举
- 结果表格单元类
- 结果表值类
- 结果表列信息类
- 结果表类
- 旋转枚举
- Rule Class
- RuleResult Class
- RuleSet Class
- RuleSetResult Class
- 分区类型枚举
- 词组类型枚举
- IDocumentTextProjection 接口
- 分类结果类
- 提取结果类
- 结果文档类
- 结果文档范围类
- 结果数据点类
- 结果值类
- 结果内容引用类
- 结果值令牌类
- 结果派生字段类
- 结果数据源枚举
- 结果常量类
- 简单字段值类
- 表字段值类
- 文档组类
- 文档分类类
- 文档类型类
- 字段类
- 字段类型枚举
- FieldValueDetails Class
- 语言信息类
- 元数据输入类
- 文本类型枚举
- 类型字段类
- ITrackingActivity 接口
- ITrainableActivity 接口
- ITrainableClassifierActivity 接口
- ITrainableExtractorActivity 接口
- 可训练的分类器异步代码活动类
- 可训练的分类器代码活动类
- 可训练的分类器原生活动类
- 可训练的提取程序异步代码活动类
- 可训练的提取程序代码活动类
- 可训练的提取程序原生活动类
- 基本数据点类 - 预览
- 提取结果处理程序类 - 预览
- Document Understanding ML
- Document Understanding OCR 本地服务器
- Document Understanding
- 智能 OCR
- 发行说明
- 关于“智能 OCR”活动包
- 项目兼容性
- 加载分类
- 将文档数字化
- 分类文档作用域
- 基于关键词的分类器
- Document Understanding 项目分类器
- 智能关键词分类器
- 创建文档分类操作
- 创建文档验证工件
- 检索文档验证工件
- 等待文档分类操作然后继续
- 训练分类器范围
- 基于关键词的分类训练器
- 智能关键词分类训练器
- 数据提取作用域
- Document Understanding 项目提取程序
- Document Understanding 项目提取程序训练器
- 基于正则表达式的提取程序
- 表单提取程序
- 智能表单提取程序
- 文档脱敏
- 创建文档验证操作
- 等待文档验证操作然后继续
- 训练提取程序范围
- 导出提取结果
- 机器学习提取程序
- 机器学习提取程序训练器
- 机器学习分类器
- 机器学习分类训练器
- 生成分类器
- 生成式提取程序
- 配置身份验证
- ML 服务
- OCR
- OCR 合同
- OmniPage
- PDF
- [未公开] Abbyy
- [未列出] Abbyy 嵌入式

Document Understanding 活动
Document Understanding 项目提取程序
UiPath.IntelligentOCR.Activities.DataExtraction.DuAppExtractor
描述
Extracts data from documents using a certain modern project and version. Visit Document Understanding for Modern Experience key concepts to learn more about modern projects and document types.
You can use this activity only with the Data Extraction Scope activity.
项目兼容性
Windows - Legacy | Windows
配置
设计器面板
- Project - Select the desired modern project from the dropdown menu. The available options are:
- Generative Predefined – Modern project type that uses pre-trained generative models.
- Predefined Non-Latin Languages – Modern project type that uses pre-trained models for non-Latin document processing scenarios.
- 在 Studio 连接的组织和租户中可用的新式项目。
备注:
If you have created more than 500 projects on your tenant and use the Document Understanding Project Extractor activity, UiPath Studio or Studio Web will not display any projects beyond the initial 500. Therefore, those projects cannot be used.
- Version - Select the deployed version of the desired project. If you select a version, you cannot select a tag. This field is disabled if you select the Predefined project type.
- Tag - Select a tag that links directly to a specific version of your chosen project. For example, if you select the Staging tag, the activity uses the project version assigned to this tag during the document extraction process. If you select a tag, you cannot select a version. This field is disabled if you select the Predefined project type.
属性面板
常见
- “显示名称”- 活动的显示名称。
身份验证
The Authentication properties of this activity allow you to execute it via on-premises robots. Before configuring these properties, ensure you have fulfilled the prerequisites mentioned on the Configuring Authentication page. Once these steps are completed, you can proceed to fill in the Authentication properties of the activity.
- Runtime Credentials Asset - Use this field when you need to access Document Understanding modern project resources while the robot is connected to a local Orchestrator, or from a different tenant. You can choose to enter a Credential Asset, for authentication purposes, in one of the following ways:
-
From the dropdown list, select the desired Credential Asset from the Orchestrator to which the UiPath® Robot is connected.
-
如果您在 Orchestrator 凭据资产中存储了用于访问项目的外部应用程序凭据,请手动输入 Orchestrator 凭据资产的路径。
路径的格式应为:
<OrchestratorFolderName>/<AssetName>。
-
- Runtime Tenant Url - Use this field, alongside the Runtime Credentials Asset field. Enter the URL of the tenant that the robot will connect to in order to execute the extraction. The URL should be in the following format:
https://<baseURL>/<OrganizationName>/<TenantName>.
输入
- Project - Select the desired modern project from the dropdown menu. The available options are:
- Predefined: A default option that exposes the public UiPath® extraction model. You can select this, if you want to use the out-of-the-box extraction capabilities of UiPath®.
- Predefined Non-Latin Languages – Modern project type that uses pre-trained models for non-Latin document processing scenarios.
- Tag - Select a tag that links directly to a specific version of your chosen project. For example, if you select the Staging tag, the activity uses the project version assigned to this tag during the document extraction process. If you select a tag, you cannot select a version. This field is disabled if you select the Predefined project type.
- “超时(毫秒)”- 指定最长等待时间(以毫秒为单位),如果超出该时间后活动并未运行,系统便会抛出错误。默认值为 30000 毫秒(30 秒)。
- Version - Select the deployed version of the desired project. This field is disabled if you select the Predefined project type.
其他
- “私有”- 选中后将不再以“Verbose”级别记录变量和参数的值。
Ensure that the Project Name is supplied with the exact same letter casing as it was originally defined in the Document Understanding project.
You can use variables in the Project and Version fields.
输出
- ReferenceDocumentId - Document ID used within the UiPath Document Understanding system. This field only supports string values.
配置 Document Understanding 项目提取程序
To map your taxonomy fields to a specific extractor, perform the following steps in the Configure Extractors Wizard:
If you wish to use the same project resources at runtime, ensure the Authentication properties of the activity match those in the Get Capabilities wizard.
- Select Get or refresh extractor capabilities.
- Configure the Design time credentials that allow you to map the taxonomy fields of a modern project from a specific tenant or organization: Before configuring these properties, ensure you have fulfilled the prerequisites mentioned on the Configuring Authentication page. Once these steps are completed, input your external application credentials into the wizard.
- App Id: Enter the App ID you generated from the external application in the organization you're trying to access.
- App Secret: Enter the App Secret generated from the same external application.
- Tenant Url: Provide the URL of the specific tenant whose resources you wish to use. The format of the URL should be:
https://<baseURL>/<OrganizationName>/<TenantName>.
- Select Get Projects to populate the Project dropdown list with projects from the organization and tenant where you created the external application.
- For Project, select your desired modern project from the dropdown list.
- For Version, select a version for your chosen project. After you select a version, you cannot select a project tag.
- Optionally, for Tag, you can select a tag associated with a version of your chosen project. If you select a tag, its corresponding version is automatically used. Therefore, you cannot select a version, while you are using a tag.
- Select Get Capabilities.
备注:
If you use variables for the Project, Version, and Tag fields, then the Get Capabilities wizard will also request you to select an existing project and version that the robot can access, for configuration purposes.
Figure 1. The Get Capabilities wizard overview
