UiPath Activities

The UiPath Activities Guide

UiPath Screen OCR


Extracts a string and associated information about the textual content of images. The UiPath Screen OCR activity is optimized for usage on screen images. The activity can be used in any UI Automation scenario in which an OCR engine is needed.



UiPath Screen OCR is available as a Cloud service as well as part of the On-Prem Linux Computer Vision.


Supported Characters

The UiPath Screen OCR activity only supports the following list of characters:
! \ " # ¥ £ $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ A B C D E F G H I J K L M N O P Q R S T U V W X Y Z [ \ \ ] _ a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~



  • DisplayName - The display name of the activity.


  • Image - The image that you want to process. This field supports only Image variables.


  • ApiKey - The API key used to provide you access to the UiPath Screen OCR (not required for the Preview period).
  • Endpoint - The endpoint for UiPath Screen OCR. The default project settings value is
  • Timeout (milliseconds) - Specifies the amount of time (in milliseconds) to wait for a specified element to be found before an error is thrown. The default value is 100000 milliseconds (100 seconds).


  • Private - If selected, the values of variables and arguments are no longer logged at Verbose level.


  • Result - Provides the extracted words along with their on-screen position. This field supports only KeyValuePair<Rectangle,String> variables.
  • Text - Provides the extracted text. This field supports only String variables.

Updated 8 days ago

UiPath Screen OCR

Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.