document-understanding
latest
false
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Nov 7, 2024

OCR

Printed text

Language (Language Code)UiPath Document OCR
Adyghe (ADY) 
Afar (AA) 
Afrikaans (AFR)available
Akan (AK) 
Albanian (SQI)available
Algonquin (ALQ) 
Angika (Devanagari) (ANP) 
Arabic (ARA)available (Preview)
Asturian (AST)available
Asu (ASA) 
Avaric (AV) 
Awadhi-Hindi (Devanagari) (AWA) 
Aymara (AYM) 
Azerbaijani (Latin) (AZ) 
Bafia (KSF) 
Bagheli (BFY) 
Bambara (BM) 
Bashkir (BA) 
Basque (EU)available
Belarusian (Cyrilic) (BE, BE-CYRL) 
Belarusian (Latin) (BE, BE-LATN) 
Bemba (BEM) 
Bena (BEZ) 
Bhojpuri-Hindi (Devanagari) (BHO) 
Bikol (BIK) 
Bislama (BI)available
Bodo (Devanagari) (BRX) 
Bosnian (Latin) (BS) 
Brajbha (BRA) 
Breton (BR) 
Bulgarian (BG) 
Bundeli (BNS) 
Buryat (Cyrilic) (BUA) 
Catalan (CA)available
Cebuano (CEB)available
Chamling (RAB) 
Chamorro (CH) 
Chechen (CE) 
Chhattisgarhi (Devanagari) (HNE) 
Chiga (CGG) 
Chinese - Simplified (ZH-Hans) 
Chinese - Traditional (Hant) 
Choctaw (CHO) 
Chukot (CKT) 
Chuvash (CV) 
Cornish (KW)available
Corsican (CO) 
Cree (CR) 
Creek (MUS) 
Crimean Tatar (Latin) (CRH) 
Croatian (HR)available
Crow (CRO) 
Czech (CS)available
Danish (DA)available
Dargwa (DAR) 
Dari (PRS) 
Dhimal (Devanagari) (DHI) 
Dogri (Devanagari) (DOI) 
Duala (DUA) 
Dungan (DNG) 
Dutch (NL)available
Efik (EFI) 
English (EN)available
Erzya (Cyrilic) (MYV) 
Estonian (ET)available
Faroese (FO) 
Fijian (FJ)available
Filipino (FIL)available
Finnish (FI)available
Fon (FON) 
French (FR)available
Friulian (FUR)available
Ga (GAA) 
Gaelic - Irish (GA)available
Gaelic - Scottish (GD)available
Gagauz (Latin) (GAG)available
Galician (GL)available
Ganda (LG) 
Gayo (GAY) 
German (DE)available
Gilbertese (GIL)available
Gondi (Devanagari) (GON) 
Greek (EL) 
Greenlandic (KL) 
Guarani (GN) 
Gurung (Devanagari) 
Gusii (GUZ) 
Haitian Creole (HT)available
Halbi (Devanagari) (HLB) 
Hani (HNI)available
Haryanvi (BGC) 
Hawaiian (HAW) 
Hebrew (HE)available
Herero (HZ) 
Hiligaynon (HIL) 
Hindi (HI) 
Hmong Daw (Latin) (MWW)available
Ho (Devanagari) (HOC) 
Hungarian (HU)available
Iban (IBA) 
Icelandic (IS) 
Igbo (IG) 
Iloko (ILO) 
Inari Sami (SMN) 
Indonesian (ID)available
Ingush (INH) 
Interlingua (IA)available
Inuktitut (Latin) (IU) 
Italian (IT)available
Japanese (JA) 
Jaunsari (Devanagari) (JNS) 
Javanese (JV)available
Jola-Fonyi (DYO) 
Kabardian (KBD) 
Kabuverdianu (KEA) 
Kachin (Latin) (KAC)available
Kalenjin (KLN) 
Kalmyk (XAL) 
Kangri (Devanagari) (XNR) 
Kanuri (KR) 
Karachay-Balkar (KRC) 
Kara-Kalpak (Cyrilic) (KAA-CYR) 
Kara-Kalpak (Latin) (KAA) 
Kashubian (CSB) 
Kazakh (Cyrilic) (KK-CYR) 
Kazakh (Latin) (KK-LATN) 
Khakas (KJH) 
Khaling (KLR) 
Khasi (KHA)available
K'iche' (QUC) 
Kikuyu (KI) 
Kildin Sami (SJD) 
Kinyarwanda (RW) 
Komi (KV) 
Kongo (KN) 
Korean (KO) 
Korku (KFQ) 
Koryak (KPY) 
Kosraean (KOS) 
Kpelle (KPE) 
Kuanyama (KJ) 
Kumyk (Cyrilic) (KUM) 
Kurdish (Arabic) (KU-ARAB) 
Kurdish (Latin) (KU-LATN) 
Kurukh (Devanagari) (KRU) 
Kyrgyz (Cyrilic) (KY) 
Lak (LBE) 
Lakota (LKT) 
Latin (LA)available
Latvian (LV)available
Lezghian (LEX) 
Lingala (LN) 
Lithuanian (LT)available
Lower Sorbian (DSB) 
Lozi (LOZ) 
Lule Sami (SMJ) 
Luo (Kenya and Tanzania) (LUO) 
Luxembourgish (LB)available
Luyia (LUY) 
Macedonian (MK) 
Machame (JMC) 
Madurese (MAD) 
Mahasu Pahari (Devanagari) (BFZ) 
Makhuwa-Meetto (MGH) 
Makonde (KDE) 
Malagasy (MG) 
Malay (Latin) (MS)available
Maltese (MT) 
Malto (Devanagari) (KMJ) 
Mandinka (MNK) 
Manx (GV) 
Maori (MI) 
Mapundungun (ARN) 
Marathi (MR) 
Mari (Russia) (CHM) 
Masai (MAS) 
Mende (Sierra Leone) (MEN) 
Meru (MER) 
Meta' (MGO) 
Minangkabau (MIN) 
Mohawk (MOH) 
Mongolian (Cyrilic) (MN) 
Mongondow (MOG) 
Montenegrin (Cyrilic) (CNR-CYRL) 
Montenegrin (Latin) (CNR-LATN) 
Morisyen (MFE) 
Mundang (MUA) 
Nahuatl (NAH) 
Navajo (NV) 
Ndonga (NG) 
Neapolitan (NAP)available
Nepali (NE) 
Ngomba (JGO) 
Niuean (NIU) 
Nogay (NOG) 
North Ndebele (ND) 
Northern Sami (Latin) (SME) 
Norwegian (NO)available
Nyanja (NY) 
Nyankole (NYN) 
Nzima (NZI) 
Occitan (OC)available
Ojibway (OJ) 
Oromo (OM) 
Ossetic (OS) 
Pampanga (PAM) 
Pangasinan (PAG) 
Papiamento (PAP) 
Pashto (PS) 
Pedi (NSO) 
Persian (FA) 
Polish (PL)available
Portuguese (PT)available
Punjabi (Arabic) (PA) 
Quechua (QU) 
Ripurian (KSH)available
Romanian (RO)available
Romansh (RM)available
Rundi (RN) 
Russian (RU) 
Rwa (RWK) 
Sadri (Devanagari) (SCK) 
Sakha (SAH) 
Samburu (SAQ) 
Samoan (Latin) (SM) 
Sango (SG) 
Sangu (Gabon) 
Sanskrit (Devanagari) (SA) 
Santali (Devanagari) (SAT) 
Scots (SCO) 
Sena (SEH) 
Serbian (Cyrilic) (SR-CYRL) 
Serbian (Latin) (SR, SR-LATN))available
Shambala (KSB) 
Shona (SN) 
Siksika (BLA) 
Sirmauri (Devanagari) (SRX) 
Skolt Sami (SMS) 
Slovak (SK)available
Slovenian (SL)available
Soga (XOG) 
Somali (Arabic) (SO) 
Somali (Latin) (SO-LATN) 
Songhai (SON) 
South Ndebele (NR) 
Southern Altai (ALT) 
Southern Sami (SMA) 
Southern Sotho (ST) 
Spanish (ES)available
Sundanese (SU) 
Swahili (Latin) (SW)available
Swati (SS) 
Swedish (SV)available
Tabassaran (TAB) 
Tachelhit (SHI) 
Tahitian (TY) 
Taita (DAV) 
Tajik (Cyrilic) (TG) 
Tamil (TA) 
Tatar (Cyrilic) (TT-CYRL) 
Tatar (Latin) (TT) 
Teso (TEO) 
Tetum (TET)available
Thai (TH) 
Thangmi (THF) 
Tok Pisin (TPI) 
Tongan (TO)available
Tsonga (TS) 
Tswana (TN) 
Turkish (TR)available
Turkmen (Latin) (TK) 
Tuvan (TYV) 
Udmurt (UDM) 
Uighur (Cyrilic) (UG-CYRL) 
Ukranian (UK) 
Upper Sorbian (HSB) 
Urdu (UR) 
Uyghur (Arabic) (UG) 
Uzbek (Arabic) (UZ-ARAB) 
Uzbek (Cyrilic) (UZ-CYRL) 
Uzbek (Latin) (UZ)available
Vietnamese (VI) 
Volapuk (VO)available
Vunjo (VUN) 
Walser (WAE) 
Welsh (CY)available
Western Frisian (FY) 
Wolof (WO) 
Xhosa (XH) 
Yucatec Maya (YUA) 
Zapotec (ZAP) 
Zarma (DJE) 
Zhuang (ZA) 
Zulu (ZU)available

Handwritten text

Language (Language Code)UiPathDocumentOCR_Handwriting
Chinese Simplified (ZH-HANS) 
English (EN)available
French (FR)available
German (DE)available
Italian (IT) 
Japanese (JA) 
Korean (KO) 
Portuguese (PT) 
Spanish (ES) 

Supported characters

AlphabetUiPath Document OCR
Arabic'ا','ب','ة','ت','ث','ج','ح','خ','د','ذ','ر','ز','س','ش','ص','ض','ط','ظ','ع','غ','ـ','ف','ق','ك','ل','م','ن','ه','و','ى','ي','ٓ','ٔ','ٕ','٠','١','٢','٣','٤','٥','٦','٧','٨','٩','٪','٫','٬','٭','ٱ','۔','ً','ٌ','ٍ','َ','ُ','ِ','ّ','ْ','ٰ','ۥ','ۦ','آ','،','؛','؟','ء','أ','ؤ','إ','ئ'
Hebrewא ב ג ד ה ו ז ח ט י ך כ ל ם מ ן נ ס ע ף פ ץ צ ק ר ש ת ₪
LatinA B C D E F G H I J K L M N O P Q R S T U V W X Y Z a b c d e f g h i j k l m n o p q r s t u v w x y z À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ñ Ò Ó Ô Õ Ö Ø Ù Ú Û Ü Ý ß à á â ã ä å æ ç è é ê ë ì í î ï ñ ò ó ô õ ö ø ù ú û ü ý Ā ā Ă ă Ą ą Ć ć Ċ ċ Č č Ď ď Đ đ Ē ē Ė ė Ę ę Ě ě Ğ ğ Ġ ġ Ħ ħ Ī ī Ĭ ĭ Į į İ ı Ĺ ĺ Ľ ľ Ł ł Ń ń Ň ň Ŋ ŋ Ō ō Ő ő Œ œ Ŕ ŕ Ř ř Ś ś Š š Ť ť Ŧ ŧ Ū ū Ŭ ŭ Ů ů Ų ų Ź ź Ż ż Ž ž Ə Ǵ ǵ Ș ș Ț ț ə μ
Other characters! " # $ % & \ ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ [ \ \ ] ^ _ { | } ~ £ ¥ § © ® ° ¿ € ≤ ≥
  • Printed text
  • Handwritten text
  • Supported characters

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.