- Visão geral
- Processo do Document Understanding
- Tutoriais de início rápido
- Componentes do framework
- Pacotes de ML
- Visão geral
- Document Understanding - Pacote de ML
- DocumentClassifier - Pacote de ML
- Pacotes de ML com recursos de OCR
- 1040 - Pacote de ML
- 1040 Schedule C - Pacote de ML
- 1040 Schedule D - Pacote de ML
- 1040 Schedule E - Pacote de ML
- 4506T - Pacote de ML
- 990 - Pacote de ML - Prévia
- ACORD125 - Pacote de ML
- ACORD126 - Pacote de ML
- ACORD131 - Pacote de ML
- ACORD140 - Pacote de ML
- ACORD25 - Pacote de ML
- Extratos bancários - Pacote de ML
- ConhecimentoDeEmbarque - Pacote de ML
- Certificado de incorporação - Pacote de ML
- Certificado de origem - Pacote de ML
- Cheques - Pacote de ML
- Certificado de produtos filhos - Pacote de ML
- CMS1500 — Pacote de ML
- Declaração de Conformidade da UE - Pacote de ML
- Demonstrações financeiras - Pacote de ML
- FM1003 - Pacote de ML
- I9 - Pacote de ML
- Cartões de identificação - Pacote de ML
- Faturas - Pacote de ML
- FaturasAustrália - Pacote de ML
- FaturasChina - Pacote de ML
- FaturasÍndia - Pacote de ML
- FaturasJapão - Pacote de ML
- Envio de faturas - Pacote de ML
- Romaneio de carga - Pacote de ML
- Passaportes - Pacote de ML
- Contracheques — Pacote de ML
- Ordens de compra - Pacote de ML
- Recibos – Pacote de ML
- AvisosDePagamento - Pacote de ML
- UB04 - Pacote de ML
- Contas de serviços - Pacote de ML
- Títulos de veículos - Pacote de ML
- W2 - Pacote de ML
- W9 - Pacote de ML
- Outros pacotes de ML prontos para uso
- Endpoints públicos
- Requisitos de Hardware
- Pipelines
- Document Manager
- Serviços de OCR
- Aprendizagem profunda
- Document Understanding implantado no Automation Suite
- Instalar e usar
- Experiência de primeira execução
- Implantar o UiPathDocumentOCR
- Implante um Pacote de ML pré-configurado
- Pacotes offline 2023.10.12+patch1
- Pacotes offline 2023.10.12
- Pacotes offline 2023.10.11
- Pacotes offline 2023.10.10
- Pacotes offline 2023.10.9
- Pacotes offline 2023.10.8
- Pacotes offline 2023.10.7+patch1
- Pacotes offline 2023.10.7
- Pacotes offline 2023.10.6
- Pacotes offline 2023.10.5
- Pacotes offline 2023.10.4
- Pacotes offline 2023.10.3
- Pacotes offline 2023.10.2
- Pacotes offline 2023.10.1
- Pacotes offline 2023.10.0
- Use o Document Manager
- Usar o Framework
- Document Understanding implantado no AI Center autônomo
- Licenciamento
- Atividades
- UiPath.Abbyy.Activities
- UiPath.AbbyyEmbedded.Activities
- UiPath.DocumentProcessing.Contracts
- UiPath.DocumentUnderstanding.ML.Activities
- UiPath.DocumentUnderstanding.OCR.LocalServer.Activities
- UiPath.IntelligentOCR.Activities
- UiPath.OCR.Activities
- UiPath.OCR.Contracts
- UiPath.OmniPage.Activities
- UiPath.PDF.Activities

Document Understanding user guide
Introdução
The UiPath® Document UnderstandingTM framework facilitates the processing of incoming files, from file digitization to extracted data validation, all in an open, extensible, and versatile environment.
O Document Understanding foi projetado para ajudar você a combinar diferentes abordagens para extrair informações de vários tipos de documentos. O principal objetivo é tornar o processo de extração de dados o mais fácil possível: criar um único fluxo de trabalho que extrairá dados de vários documentos.
Antes de usar a estrutura do Document Understanding, é recomendável entender os seguintes componentes do framework Document Understanding:
- Taxonomia What documents need to be processed and what data is required from them? Used to define the document types and the pieces of information targeted for data extraction (fields) for each document type, and formalizes this information into a dedicated Taxonomy structure. This metadata information is managed through the Taxonomy Manager.
- Digitalização What does this file contain? Used to obtain the textual content and the structure of the incoming document, turning a file into machine-readable content so it can be further processed downstream.
- Classificação de Documento What types of documents from the taxonomy are found in this file? Used to automatically determine what document types are found within a digitized file.
- Validação da classificação de documentos Is the predicted classification correct? This is how I can review and correct it. Used for assisting in the human validation and correction of the automatic classification and document splitting results.
- Treinamento de classificação Did the human review the data? This is how the robot can learn from it. Used to pass the human validated information back to the classifiers, to use it to improve their future predictions.
- Extração de Dados What data can be found in this particular document? Used to capture the information required for the identified document type, within the given input document and classification page range.
- Validação de extração de dados Is the extracted information correct? This is how I can review and correct it. Used for assisting in the human validation and correction of the automatically extracted data results.
- Treinamento em extração de dados Did the human review the data? This is how the robot can learn from it. Used to pass the human validated extracted data back to the extractors, to use it to improve their extraction predictions.
- Data Consumption Used to export the validated data in order to consume it.
- Metering & Charging Logic Used to explain the consumption of units per page for each available service.
O diagrama a seguir apresenta os componentes da Estrutura do Document Understanding e como eles se relacionam entre si:

The Document Understanding framework is found in the UiPath.IntelligentOCR.Activities package. Once the UiPath.IntelligentOCR.Activities package is installed, the Taxonomy Manager wizard appears in the top ribbon of the UiPath Studio. This same package contains all the core document understanding framework activities.
The scope activities (Classify Document Scope, Data Extraction Scope, Train Classifiers Scope, Train Extractors Scope) that are part of the Document Understanding framework allow you to use any document classification and data extraction algorithms that fit your use case and then train these algorithms.
The Document Understanding framework can be used not only with the out-of-the-box classifiers and extractors but also with any custom-built ones. These can be created using the abstract classes from the UiPath.DocumentProcessing.Contracts package and can be implemented as classification or data extraction activities. Custom-built OCR engines can also be created using the abstract classes from the UiPath.OCR.Contracts package.
Recursos
Dedicated Document Understanding courses can be found in the UiPath RPA Academy.
O Fórum Community da UiPath é o local para obter suporte da nossa comunidade sempre crescente de usuários.