document-understanding
2023.10
false
UiPath logo, featuring letters U and I in white

Document Understanding 用户指南

Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
上次更新日期 2024年12月18日

ML 包离线安装

入门指南

根据要使用的模型,您需要满足以下条件:

  • 对于 2022.10 及更高版本的模型:
    • Download the needed Document UnderstandingTM bundle. Here are the links for all the available bundles. The du bundle contains information about all models included into a specific version. For example, the dusemistructured-2023.10.0.tar.gz contains information about all out-of-the-box pre-trained ML Packages included in the 2023.10.0 version.
  • 对于 2022.4 及更早版本的模型(python37duv3python37duv4):
    • 所有 ML 包都以 .zip 文件的形式提供,并在 AI Center 中作为自定义包直接上传。要下载模型,请联系您的客户经理、CSM 或支持团队,以获取每个包的下载链接。
    • 下载所需的 Document Understanding 捆绑包。此处是所有可用捆绑包的链接。

安装离线捆绑包

离线安装要求在命令行中将下载的 DU 捆绑包重命名为 du-ondemand.tar.gz。例如,如果您下载了名为 dusemistructured-2023.10.0.tar.gz 的 DU 捆绑包,则需要在安装时将其重命名为 du-ondemand.tar.gz
  1. 对于 Windows 计算机,请通过捆绑包链接直接下载,并将文件重命名为 du-ondemand.tar.gz
  2. 对于 Linux 计算机,请在可以访问互联网的计算机上按照以下命令下载所需的捆绑包:

    wget -O ~/<bundle-name.tar.gz> 'bundle-link'wget -O ~/<bundle-name.tar.gz> 'bundle-link'

    以下示例说明了如何下载适用于 Linux 的所需捆绑包:

    wget -O ~/du-ondemand.tar.gz 'https://download.uipath.com/automation-suite/2023.10.0/dusemistructured-2023.10.0.tar.gz'wget -O ~/du-ondemand.tar.gz 'https://download.uipath.com/automation-suite/2023.10.0/dusemistructured-2023.10.0.tar.gz'
  3. 将以下捆绑包复制到集群的主计算机(进行安装的位置)上的 /uipath/tmp 文件夹:
    scp ~/<bundle-name.tar.gz> <username>@<node dns>:/uipath/tmp/scp ~/<bundle-name.tar.gz> <username>@<node dns>:/uipath/tmp/
  4. 连接到此主计算机并加载捆绑包:

    ./configureUiPathAS.sh registry upload --optional-offline-bundle "/uipath/tmp/du.tar.gz" --offline-tmp-folder "/uipath/tmp"./configureUiPathAS.sh registry upload --optional-offline-bundle "/uipath/tmp/du.tar.gz" --offline-tmp-folder "/uipath/tmp"

将模型上传到 AI Center

下载并安装模型后,请按照此处所述的步骤将模型上传到 AI Center。

表单提取程序和智能关键字分类器

使用以下公共 URL 访问“表单提取程序”和“智能关键字分类器”:

  • <FQDN>/du_/svc/formextractor
  • <FQDN>/du_/svc/intelligentkeywords
注意: 使用公共 URL 时,请将 <FQDN> 占位符替换为实际的环境信息。例如,在工作流中使用时, <FQDN>/du_/svc/formextractor将变为 https://servicefabricserver.domain.com/du_/svc/formextractor

将 Document Understanding™ 捆绑包上传到外部 Docker 注册表

请按照以下步骤将 Document Understanding 捆绑包上传到外部 Docker 注册表:

  • Pull the desired DU image from the UiPath® registry hosted on registry.uipath.com
  • 根据您的 Docker 注册表名称重命名映像主机。
  • 将映像推送到外部 Docker 注册表

拉取所需的 Document Understanding 映像

Pull the images from UiPath® registry by running these commands:
docker pull <uipath_registry_server>/<image_name>docker pull <uipath_registry_server>/<image_name>

以下示例说明了如何从名为 registry.uipath.com 的注册表中拉取 UiPath 文档 OCR 捆绑包的映像:

docker pull registry.uipath.com/aicenter/du-doc-ocr:v23.10.0docker pull registry.uipath.com/aicenter/du-doc-ocr:v23.10.0

重命名映像主机

通过运行以下命令,重命名映像主机:

docker tag <uipath_registry_server>/<image_name> <your_registry_server>/<image_name>docker tag <uipath_registry_server>/<image_name> <your_registry_server>/<image_name>

以下示例说明了如何将 UiPath 文档 OCR 捆绑包的映像从名为 registry.uipath.com 的注册表重命名为名为 registry.mycompany.com 的注册表:

docker tag registry.uipath.com/aicenter/du-doc-ocr:v23.4.0-rc70 registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0docker tag registry.uipath.com/aicenter/du-doc-ocr:v23.4.0-rc70 registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0

将映像推送到外部 Docker 注册表

通过运行以下命令,将映像推送到外部 Docker 注册表:

docker push <your_registry_server>/<image_name>docker push <your_registry_server>/<image_name>

以下示例说明了如何将 UiPath 文档 OCR 捆绑包的映像推送到外部 Docker 注册表:

docker push registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0docker push registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0

每个 Document Understanding 捆绑包的映像

2023.10.7

Document Understanding 捆绑包 图像
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10-12.01-rc01

2023.10.6

Document Understanding 捆绑包 图像
UiPath 文档 OCRaicenter/du-doc-ocr:v23.10-09.25-rc08
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-09.25-rc08
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10-10.15-rc03
文档分类器aicenter/du-ml-document-type-text-classifier:v23.10-09.25-rc08

Out-of-the-box Pre-trained ML Packages

aicenter/du-semistructured:v23.10-09.25-rc08

2023.10.5

Document Understanding 捆绑包 图像
UiPath 文档 OCRaicenter/du-doc-ocr:v23.10-rc28
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-rc28
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10-rc58
文档分类器aicenter/du-ml-document-type-text-classifier:v23.10-rc28

Out-of-the-box Pre-trained ML Packages

aicenter/du-semistructured:v23.10-rc28

2023.10.4

Document Understanding 捆绑包 图像
UiPath 文档 OCRaicenter/du-doc-ocr:v23.10.4-rc125
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10.4-rc125
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.4-rc45
文档分类器aicenter/du-ml-document-type-text-classifier:v23.10.4-rc125

Out-of-the-box Pre-trained ML Packages

aicenter/du-semistructured:v23.10.4-rc125

2023.10.3

Document Understanding 捆绑包 图像
UiPath 文档 OCRaicenter/du-doc-ocr:v23.10.3-rc04
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.3-rc20
文档分类器aicenter/du-ml-document-type-text-classifier:v23.10.3-rc04

Out-of-the-box Pre-trained ML Packages

aicenter/du-semistructured:v23.10.3-rc04

2023.10.2

Document Understanding 捆绑包 图像
UiPath 文档 OCRaicenter/du-doc-ocr:v23.10.2-rc22
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10.2-rc22
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.2-rc13
文档分类器aicenter/du-ml-document-type-text-classifier:v23.10.2-rc42

Out-of-the-box Pre-trained ML Packages

aicenter/du-semistructured:v23.10.2-rc22

2023.10.1

Document Understanding 捆绑包 图像
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.1-rc19

2023.10.0

Document Understanding 捆绑包 图像
UiPath 文档 OCRaicenter/du-doc-ocr:v23.10.0-rc68
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10.0-rc68
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.0-rc82

Out-of-the-box Pre-trained ML Packages

aicenter/du-semistructured:v23.10.0-rc68

此页面有帮助吗?

获取您需要的帮助
了解 RPA - 自动化课程
UiPath Community 论坛
Uipath Logo White
信任与安全
© 2005-2024 UiPath。保留所有权利。