# Digitization related activities

> Digitizes a document, extracting its Document Object Model (DOM) and text and storing them in their corresponding variable types. More details [here](https://docs.uipath.com/activities/other/latest/document-understanding/digitize-document).

## Framework components

### Digitize Document

Digitizes a document, extracting its Document Object Model (DOM) and text and storing them in their corresponding variable types. More details [here](https://docs.uipath.com/activities/other/latest/document-understanding/digitize-document).

## OCR engines

### UiPath Extended Languages OCR

Extracts a string and its information from an indicated UI element or image by using the OCR engine. Visit [UiPath Extended Languages OCR](https://docs.uipath.com/activities/other/latest/document-understanding/uipath-extended-languages-ocr) for more information.

### OCR for Chinese, Japanese, Korean

:::note
The UiPath Chinese, Japanese, Korean OCR will be deprecated starting with January 2025. We recommend using the **UiPath Extended Languages OCR** instead. Check the [deprecation timeline](https://docs.uipath.com/overview/other/latest/overview/deprecation-timeline#document-understanding) for more information about upcoming deprecations and removals.
:::

Extracts a string and its information from an indicated UI element or image by using the OCR engine. Visit [OCR for Chinese, Japanese, Korean](https://docs.uipath.com/activities/other/latest/document-understanding/cjk-ocr) for more information.

### UiPath Document OCR

Extracts a string and associated information about the textual content of document images. More details [here](https://docs.uipath.com/activities/other/latest/document-understanding/ui-path-document-ocr).

### OmniPage OCR

Extracts a string and its information from an indicated UI element or image using OmniPage OCR Engine. More details [here](https://docs.uipath.com/activities/other/latest/document-understanding/omnipage-ocr).

### Google Cloud Vision OCR

Extracts a string and its information from an indicated UI element or image using the Google Cloud OCR engine. More details [here](https://docs.uipath.com/activities/other/latest/ui-automation/google-cloud-ocr).

### Microsoft Azure Computer Vision OCR

Extracts a string and its information from an indicated UI element or image by using the Microsoft Azure Computer Vision OCR engine. More details [here](https://docs.uipath.com/activities/other/latest/ui-automation/microsoft-azure-computer-vision-ocr).

### Microsoft OCR

Extracts a string and its information from the provided image. Microsoft OCR activity uses the Windows 10 built-in OCR, if available, otherwise, it resumes to the default MODI OCR Engine. More details [here](https://docs.uipath.com/activities/other/latest/ui-automation/microsoft-ocr).

### Tesseract OCR

Extracts a string and its information from an indicated UI element or image using Tesseract OCR Engine. More details [here](https://docs.uipath.com/activities/other/latest/ui-automation/google-ocr).