Activities
latest
false
Banner background image
Document Understanding Activities
Last updated Apr 10, 2024

Using OmniPage With an Extended Language

This is how the automation process can be built:

  1. Open Studio and create a new Process named by default Main.

    Note: Add your files to the project directory in order to be able to run the entire process from the same place.
  2. Drag a Sequence container in the Workflow Designer.

    • Create the following variables:

      Variable Name

      Variable Type

      Default Value

      textFile

      Image

       

      extractedText

      String

       
  3. Drag a Digitize Document activity inside the Sequence container.

    • In the Properties panel, add the path of the file you want to digitize, in the DocumentPath field. You can find a sample file in the downloadable example.
  4. Place an OmniPage OCR engine inside the Digitize Document activity.

    • In the Properties panel, add the value Image in the Image field.
    • Select the Extended option from the EnginePack drop-down list.
    • Select the check box for the ExtractWords option. This extracts the on-screen position of each detected word.
    • Add the value "qct" in the Language field. This represents the language code for Traditional Chinese.
    • Add the variable extractedText in the Text field for capturing and retaining all the text from the document.
  5. Drag a Write Line activity below the Digitize Document activity.

    • Add the variable extractedText in the Text field.
  6. Run the process. The used activities are analyzing the provided file and extract all the detected words written in the Traditional Chinese language.

Download the example from here.

Powered by OmniPage OCR.



Nuance™ | OCR © | 2019 Nuance Communications. All rights reserved.

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.