Document Understanding Release Notes

DELIVERY:

Last updated Feb 24, 2026

March 2022

General Release Notes - Document Understanding

29 March 2022

Improvements

Added more descriptive tooltips on Training,Validation, and Evaluation document types.

Bug fixes

Fixed a known issue that was causing the search or the download of a document which contained characters that require URL encoding (&, ,, +, #, ') in its file name to fail with invalid query.
Fixed a bug that caused the Predict functionality to fail on documents with very dense text.

7 March 2022

Improvements

Implemented inside document search which allows you to search for instances of text solely in your current document. This is particularly helpful for documents with many pages. The search bar can be found at the bottom left hand side of the screen and it can also be accessed using the shortcut Ctrl + Shift + F.

When using the Predict functionality, manually labeled data is deleted and the document is overwritten with the new values from the model.

The split.csv is no longer used when importing a dataset into another Document Manager session, or when running a Training Pipeline. The data from the file is now integrated in the JSON files from the latest folder in the dataset, more exactly in the subset field. So, if you manually modify the file or delete it completely from the dataset, it does not have an impact over the training of the model. Please know, however, that the file is still kept for document level export in the case of ML Packages version 21.10 or before.

Added the option to permanently delete individual files. This can be found in the drop-down next to the document name, alongside the download option.

On this page