- Overview
- About the Document Understanding activities
- Document Processing Contracts
- Release notes
- About the Document Processing Contracts
- Box Class
- IPersistedActivity interface
- PrettyBoxConverter Class
- IClassifierActivity Interface
- IClassifierCapabilitiesProvider Interface
- ClassifierDocumentType Class
- ClassifierResult Class
- ClassifierCodeActivity Class
- ClassifierNativeActivity Class
- ClassifierAsyncCodeActivity Class
- ClassifierDocumentTypeCapability Class
- ExtractorAsyncCodeActivity Class
- ExtractorCodeActivity Class
- ExtractorDocumentType Class
- ExtractorDocumentTypeCapabilities Class
- ExtractorFieldCapability Class
- ExtractorNativeActivity Class
- ExtractorResult Class
- ICapabilitiesProvider Interface
- IExtractorActivity Interface
- ExtractorPayload Class
- DocumentActionPriority Enum
- DocumentActionData Class
- DocumentActionStatus Enum
- DocumentActionType Enum
- DocumentClassificationActionData Class
- DocumentValidationActionData Class
- UserData Class
- Document Class
- DocumentSplittingResult Class
- DomExtensions Class
- Page Class
- PageSection Class
- Polygon Class
- PolygonConverter Class
- Metadata Class
- WordGroup Class
- Word Class
- ProcessingSource Enum
- ResultsTableCell Class
- ResultsTableValue Class
- ResultsTableColumnInfo Class
- ResultsTable Class
- Rotation Enum
- SectionType Enum
- WordGroupType Enum
- IDocumentTextProjection Interface
- ClassificationResult Class
- ExtractionResult Class
- ResultsDocument Class
- ResultsDocumentBounds Class
- ResultsDataPoint Class
- ResultsValue Class
- ResultsContentReference Class
- ResultsValueTokens Class
- ResultsDerivedField Class
- ResultsDataSource Enum
- ResultConstants Class
- SimpleFieldValue Class
- TableFieldValue Class
- DocumentGroup Class
- DocumentTaxonomy Class
- DocumentType Class
- Field Class
- FieldType Enum
- LanguageInfo Class
- MetadataEntry Class
- TextType Enum
- TypeField Class
- ITrackingActivity Interface
- ITrainableActivity Interface
- ITrainableClassifierActivity Interface
- ITrainableExtractorActivity Interface
- TrainableClassifierAsyncCodeActivity Class
- TrainableClassifierCodeActivity Class
- TrainableClassifierNativeActivity Class
- TrainableExtractorAsyncCodeActivity Class
- TrainableExtractorCodeActivity Class
- TrainableExtractorNativeActivity Class
- Document Understanding Digitizer
- Document Understanding ML
- Document Understanding OCR Local Server
- Document Understanding
- Release notes
- About the Document Understanding activity package
- Project compatibility
- Set PDF Password
- Merge PDFs
- Get PDF Page Count
- Extract PDF Text
- Extract PDF Images
- Extract PDF Page Range
- Extract Document Data
- Create Validation Task and Wait
- Wait for Validation Task and Resume
- Create Validation Task
- Classify Document
- Create Classification Validation Task
- Create Classification Validation Task and Wait
- Wait for Classification Validation Task and Resume
- Intelligent OCR
- Release notes
- About the IntelligentOCR activity package
- Project compatibility
- Configuring Authentication
- Load Taxonomy
- Digitize Document
- Classify Document Scope
- Keyword Based Classifier
- Document Understanding Project Classifier
- Intelligent Keyword Classifier
- Create Document Classification Action
- Wait For Document Classification Action And Resume
- Train Classifiers Scope
- Keyword Based Classifier Trainer
- Intelligent Keyword Classifier Trainer
- Data Extraction Scope
- Document Understanding Project Extractor
- RegEx Based Extractor
- Form Extractor
- Intelligent Form Extractor
- Present Validation Station
- Create Document Validation Action
- Wait For Document Validation Action And Resume
- Train Extractors Scope
- Export Extraction Results
- ML Services
- OCR
- OCR Contracts
- Release notes
- About the OCR Contracts
- Project compatibility
- IOCRActivity Interface
- OCRAsyncCodeActivity Class
- OCRCodeActivity Class
- OCRNativeActivity Class
- Character Class
- OCRResult Class
- Word Class
- FontStyles Enum
- OCRRotation Enum
- OCRCapabilities Class
- OCRScrapeBase Class
- OCRScrapeFactory Class
- ScrapeControlBase Class
- ScrapeEngineUsages Enum
- ScrapeEngineBase
- ScrapeEngineFactory Class
- ScrapeEngineProvider Class
- OmniPage
- PDF
- [Unlisted] Abbyy
- [Unlisted] Abbyy Embedded
About the Document Understanding activities
Document Understanding activities allow you to process, classify, and extract documents, for the purpose of automating interpretation for them.
Document Understanding activities are a key component of document processing. It makes full document processing possible by allowing the system to interpret the meaning of a document and extract information from it. You can use document classification and data extraction methods to extract and interpret information from multiple documents with different structures.
At a high level, documents go through the following five fundamental steps:
- Defining document types and data to be extracted;
- Providing the text and its location;
- Classifying the documents from the specified list;
- Extracting the information;
- Confirming the extracted data by a human.
Then, the extracted data can be exported and consumed the way you want to.
The Document Understanding activities include the following packages:
- Document Processing Contracts: Contains data structures and interfaces to be used for integrating with the Document Understanding framework.
- Document Understanding Digitizer: A library that contains digitization related functions.
- Document Understanding ML: Contains activities that integrate machine learning models for Document Understanding workflows.
- Document Understanding OCR Local Server: Contains no activities, but once installed in a project, it enables you to use UiPath Document OCR running locally on the robot machine.
- Document Understanding Process: A template that provides logging, exception handling, retry mechanisms, and all the methods that should be used in a Document Understanding workflow, out of the box.
- Document Understanding: Allows you to process documents.
- Intelligent OCR: Contains the infrastructure for enabling document processing workflows.
- ML Services: Allows you to retrieve and call all ML Skills available on your tenant, as well as upload files or create labeling tasks.
- OCR: Contains the UiPath Screen OCR and UiPath Document OCR activities which use UiPath's in-house OCR engines.
- OCR Contracts: Contains the contracts needed to develop new OCR engines.
- OmniPage: Contains the OmniPage OCR activity which extracts data from an indicated UI (User Interface) element or image using the OmniPage OCR engine.
- PDF: Contains activities designed to extract data from PDF and XPS files.
Visit Activity or API-based limits to check the current specifications and parameters for optimal utilization of Document Understanding modern projects and Document Understanding activities.
Activity package | Windows - Legacy Studio projects | Windows (Modern) Studio projects | Cross-platform Studio projects |
---|---|---|---|
Document Understanding ML | |||
Document Understanding OCR Local Server | |||
Document Understanding Activities | |||
Intelligent OCR | |||
ML Services | |||
OCR | |||
OCR Contracts | |||
Omnipage | |||