Document Understanding Activities

Last updated Dec 5, 2024

About the PDF activity package

The UiPath.PDF.Activities pack contains activities designed to extract data from PDF and XPS files and store it into string variables. The data can be extracted from the entire document or from a range of pages specified under the Range property found in each of the activities.

Note: Starting with version 3.3.0, this activity package is validated for use in C# projects.

Note:

If an error mentioning the Docotic.Pdf library is encountered at runtime, then you should upgrade the UiPath.PDF.Activities package to version v3.1.0 or higher.

If you use in your project either one of the following packages: UiPath.DocumentUnderstanding.ML.Activities version 1.7.0 or UiPath.IntelligentOCR.Activities version 4.13.0 or UiPath.PDF.Activities version 3.4.0, then you need to update the rest of the packages to the mentioned versions.

In the case of scanned documents, data extraction can also be achieved by using OCR-based activities, Read PDF With OCR and Read XPS With OCR. You can choose any of the available in the UiPath® ecosystem, apart from the screen OCR related engines, and simply drop the engine in the body of the activity.

Note: If you want to use the UiPath.PDF.Activities package in the same project with the UiPath.IntelligentOCR.Activities package, you need to use either version 2.x of both, or versions 3.x of both. UiPath.IntelligentOCR.Activities version 3.0 and higher is incompatible with a UiPath.PDF.Activities version lower than 3.0, and a UiPath.PDF.Activities version 3.0 or higher is incompatible with an UiPath.IntelligentOCR.Activities version lower than 3.0.

Was this page helpful?

PREVIOUSRelease notes

NEXTProject compatibility

Support and Services

Get The Help You Need

UiPath Academy

Learning RPA - Automation Courses

UiPath Forum

UiPath Community Forum

Trust and Security

Cookies Policy