document-understanding
latest
false
- Document Understanding Release Notes
- ML Packages and Public Endpoints Release Notes
Document Understanding Release Notes
Last updated Oct 31, 2024
March 2022
Improvements
Added more descriptive tooltips on Training,Validation, and Evaluation document types.
Bug fixes
- Fixed a known issue that was causing the search or the download of a document which contained characters that require URL
encoding (
&
,,
,+
,#
,'
) in its file name to fail with invalid query. - Fixed a bug that caused the Predict functionality to fail on documents with very dense text.
Improvements
Implemented inside document search which allows you to search for instances of text solely in your current document. This
is particularly helpful for documents with many pages. The search bar can be found at the bottom left hand side of the screen
and it can also be accessed using the shortcut
Ctrl
+ Shift
+ F
.
When using the Predict functionality, manually labeled data is deleted and the document is overwritten with the new values from the model.
The
split.csv
is no longer used when importing a dataset into another Document Manager session, or when running a Training Pipeline. The
data from the file is now integrated in the JSON files from the latest folder in the dataset, more exactly in the subset field. So, if you manually modify the file or delete it completely from the dataset, it does not have an impact over the
training of the model. Please know, however, that the file is still kept for document level export in the case of ML Packages
version 21.10 or before.
Added the option to permanently delete individual files. This can be found in the drop-down next to the document name, alongside the download option.