UiPath Activities Guide

About the OmniPageOCR Activities Pack

The UiPath.OmniPageOCR.Activities package contains the OmniPage OCR activity, which is powered by the Nuance OmniPage OCR Engine.

This pack incorporates technical solutions that provide all the means of analyzing and processing document flows. The main scope of the OmniPage OCR is of achieving full digitization, classification and data extraction capabilities.

It can be used as an alternative to the other OCR engines. You can use it with any of the available activities from the UI Automation, Computer Vision packages and also with the Digitize Document activity from the Intelligent OCR package.

Within the OmniPage OCR package you can choose to use the Basic or the Extended language package. The difference between the packages is that the Extended package needs to be installed separately and it incorporates a wide range of languages, including Asian, Arabic, Thai, Hebrew and Vietnamese.

Release Notes

Important!

When searching for the OmniPage OCR pack in the Manage Packages option, you can see three packages: UiPath.OmniPage.Activities which contains the activity itself, UiPath.OmniPage.Bundle which is a dependency of the first pack and it is automatically installed with it, and the UiPath.OmniPage.Bundle.Extended pack which can be manually installed afterwords and it adds extra languages.

If you want access to other languages, then please install the Extended package by following the next steps:

  1. Click on the Manage Packages option.
  2. Search for the OmniPage OCR package.
  3. You can see the following activities available: UiPath.OmniPage.Activities which is the standalone package, the UiPath.OmniPage.Bundle package that is already included in the first package, as a dependency, and the UiPath.OmniPage.Bundle.Extended package that includes all the extended languages.
  4. Click on the UiPath.OmniPage.Bundle.Extended package.
  5. Select the desired version and then click on the Save button.

Example of using the OmniPage OCR with an Extended Language

This is how the automation process can be built:

  1. Open Studio and create a new Process named by default Main.

Note:

Add your files to the project directory in order to be able to run the entire process from the same place.

  1. Drag a Sequence container in the Workflow Designer.
    • Create the following variables:
Variable Name
Variable Type
Default Value

textFile

Image

-

extractedText

String

-

  1. Drag a Digitize Document activity inside the Sequence container.
    • In the Properties panel, add the path of the file you want to digitize, in the DocumentPath field. You can find a sample file in the downloadable example.
  2. Place an OmniPage OCR engine inside the Digitize Document activity.
    • In the Properties panel, add the value Image in the Image field.
    • Select the Extended option from the EnginePack drop-down list.
    • Select the check box for the ExtractWords option. This extracts the on-screen position of each detected word.
    • Add the value "qct" in the Language field. This represents the language code for Traditional Chinese.
    • Add the variable extractedText in the Text field for capturing and retaining all the text from the document.
  3. Drag a Write Line activity below the Digitize Document activity.
    • Add the variable extractedText in the Text field.
  4. Run the process. The used activities are analyzing the provided file and extract all the detected words written in the Traditional Chinese language.

Powered by OmniPage OCR.

                Nuance™ | OCR © | 2019 Nuance Communications. All rights reserved.

About the OmniPageOCR Activities Pack


Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.