communications-mining
latest
false
UiPath logo, featuring letters U and I in white

Communications Mining User Guide

Last updated Nov 27, 2024

Training using Low confidence

User permissions required: 'View Sources' AND 'Review and annotate'.

The final key step in Explore is training using 'Low confidence' mode, which shows you messages that are not well covered by informative label predictions. These messages will have either no predictions or very low confidence predictions for labels that the platform understands to be informative.

'Informative labels' are those labels that the platform understands to be useful as standalone labels, by looking at how frequently they're assigned with other labels.

This is a very important step for improving the overall coverage of your model. If you see messages which should have existing labels predicted for them, this is a sign that you need to complete more training for those labels. If you see relevant messages for which no current label is applicable, you may want to create new labels to capture them.

You can assign labels to messages in this mode in the same way as any other Explore mode.

To access this mode, use the dropdown in the top left-hand corner of the Explore page:

Dropdown menu to access ‘Low confidence’

How much training should I do for this step?

This mode will present you with 20 messages at a time, and you should complete a reasonable amount of training in this mode, going through multiple pages of messages and applying the correct labels, to help increase the model's coverage (see here for a detailed explanation of coverage).

The total amount of training you need to complete in 'Low confidence' will depend on a few different factors:

  • How much training you completed in Shuffle and Teach- the more training you do in Shuffle and Teach, the more your training set should be a representative sample of the dataset as a whole, and the fewer relevant messages there should be in 'Low confidence'.
  • The purpose of the dataset - if the dataset is intended to be used for automation and requires very high coverage, then you should complete a larger proportion of training in 'Low confidence' to identify the various edge cases for each label.

At a minimum, you should aim to annotate five pages of messages in this mode. Later on in the Refine phase when you come to check your coverage, you may find that you need to complete more training in 'Low confidence' to improve your coverage further.

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.