Notes de publication de Document Understanding
2024.10.0
Date de publication : 11 novembre 2024
Document Understanding™ 2024.10 LTS Release
We are excited to announce that our latest OCR engine, UiPath Extended Languages OCR, is now in general availability. The new OCR is capable of digitizing documents in over 200 languages, bringing a significant improvement over its predecessor, especially in regards to Chinese, Japanese, and Korean. Additionally, it can process documents in Thai, Vietnamese, all major languages from India, and languages using the Cyrilic alphabet, and Greek.
We've made significant improvements to our document digitization process. Now, when using the UiPath Extended Languages OCR, the output will be regular word boxes instead of individual characters.
- Cette version apporte des améliorations de précision et de performance pour la reconnaissance de l’écriture manuscrite.
- La reconnaissance et la détection de Reconnaissance des caractères à l’encre magnétique (MIRC) ont été améliorées, ce qui permet une précision améliorée, en particulier pour les chèques.
- Previously, numbers were not recognized in some instances when a space was used as separator. Numbers are now recognized when space is used as separator.
- The confidence score for the UiPath Document Understanding OCR is improved, particularly when used on lower quality images. In workflows where confidence score is used to decide if documents need human validation in Action Center, this may result in an increased number of documents undergoing validation.
Nous avons résolu un problème où les zones d’annotation étaient renvoyées horizontalement, même si certains documents étaient légèrement de travers, ce qui provoquait un désalignement de l’annotation.