- Getting started
- Balance
- Clusters
- Concept drift
- Coverage
- Datasets
- General fields (previously entities)
- Labels (predictions, confidence levels, hierarchy, etc.)
- Models
- Streams
- Model Rating
- Projects
- Precision
- Recall
- Reviewed and unreviewed messages
- Sources
- Taxonomies
- Training
- True and false positive and negative predictions
- Validation
- Messages
- Administration
- Manage sources and datasets
- Understanding the data structure and permissions
- Create a data source in the GUI
- Uploading a CSV file into a source
- Create a new dataset
- Multilingual sources and datasets
- Enabling sentiment on a dataset
- Amend a dataset's settings
- Delete messages via the UI
- Delete a dataset
- Export a dataset
- Using Exchange Integrations
- Preparing data for .CSV upload
- Model training and maintenance
- Understanding labels, general fields and metadata
- Label hierarchy and best practice
- Defining your taxonomy objectives
- Analytics vs. automation use cases
- Turning your objectives into labels
- Building your taxonomy structure
- Taxonomy design best practice
- Importing your taxonomy
- Overview of the model training process
- Generative Annotation (NEW)
- Dastaset status
- Model training and annotating best practice
- Training with label sentiment analysis enabled
- Train
- Introduction to Refine
- Precision and recall explained
- Precision and recall
- How does Validation work?
- Understanding and improving model performance
- Why might a label have low average precision?
- Training using Check label and Missed label
- Training using Teach label (Refine)
- Training using Search (Refine)
- Understanding and increasing coverage
- Improving Balance and using Rebalance
- When to stop training your model
- Using general fields
- Generative extraction
- Using analytics and monitoring
- Automations and Communications Mining
- Licensing information
- FAQs and more
Communications Mining User Guide
Overview of setting up your extraction fields
- At any point during the model training process, you can set up a new extraction, modify your schema, or add any additional fields to your existing schema in Explore.
- By setting up your extractions in Explore, you can:
- base your fields off data from your messages.
- add new fields to extractions as you see them.
- At any point during the model training process, you can set up a new extraction, modify your schema, or add any additional fields to your existing schema in Settings.
- If you know what fields you want to extract upfront, set up your extractions in bulk, in Settings.
- To set up your extractions, set up your fields that require a name and a field type. It is recommended to do this at the lowest child-level label.
- Be descriptive and concise. Choose field names that accurately describe the data they represent. Aim for a balance between brevity and clarity. Give your field an accurate and descriptive name, as it gives the model the necessary context on the role of the field.
- For example, for an address change, if you only want to extract a new address, it is helpful to have configured field names called: new street address, new town, new postcode, and new city.
- Avoid ambiguous field names. Ensure that field names are unambiguous and not easily confused with other fields or concepts in your project. For example, instead of using Value, use a more specific name like Sales Amount or Account Balance.
- You can have extraction fields with the same field type in, but not for multiple general fields. To address this for general fields, create another field type with the same settings to address this.
You need to create 2 different fields types (one for Date Before and Date After, and map them to the respective form definitions.
A Field Name is used to prompt the model. If your extractions are not performing as expected, adjust your Field Name to be more specific to your use case. Adjusting the field name may help with performance.
The field names below are just examples – how you name your fields is use case dependent, and depends on the context of what you are trying to extract.
Use case | Not recommended Field Names | Better performing Field Names |
---|---|---|
As part of an address change request, you want to extract the details of the new address to input into your system downstream. |
|
|
As part of a logistics shipping request, you want to identify the total tax breakdown (both the VAT amount, and the VAT rate) on each of your goods to input into SAP. |
|
|
As part of an invoice change request, you want to identify what the old invoice number was and what it needs to be changed to, to cancel the old invoice, and re-issue a new one. |
|
|
There are two different types of fields that help facilitate end-to-end automation:
- General fields
- Extraction fields.
It is important to understand the different types of fields available in Communications Mining, and when to use each one.
GENERAL FIELDS | EXTRACTION FIELDS |
---|---|
General fields are fields that you may want to extract, that can be found across multiple different topics/labels in a dataset.
| Extraction fields are the fields conditioned (and created) on a specific label. In other words, it is tied to a specific label that you want to automate.
|
The following table captures the key distinctions between general fields and extraction fields. Check the differences because two completely different models predict these field kinds.
Field type | Predicted | Reviewed at | Spanless* vs. Spanful* | Overlap spans? | Share field types between fields of same kind | Supported Data Types** |
General Fields | Automatically across dataset | A paragraph level | Only spanful | No | No (for now) |
|
Extraction Fields | Only on demand (currently) | A message level (in context of label) | Both spanful and spanless | Yes | Yes |
|
Check the Spanless fields in the Spanful vs. Spanless Fields page of this guide.
Check the Data types supported by each field kind in the Data Types page of this guide.
In this example, the platform is able to identify the extraction fields, relevant to facilitating the end-to-end automation of these two labels.
In this example, the platform isn’t confident enough that a certain label in the taxonomy applies to this message. The platform can still extract certain fields from the message itself. When you set up general fields, the platform can pick up these fields, irrespective of a label prediction.
You can set up or modify both your general fields or extraction fields through the Explore page by following the steps below.
- On a communication containing a label, where you want to define your extraction field in Explore, select Annotate Fields.
- If you set up an extraction field,
hover next to the label name in the Field annotations bar on the right, and
select Manage fields. If you set up a general field, hover next to General
fields and manage your fields there.
- Select New extraction field to add a new extraction field. You can add more than one field.
- Fill in the extraction Field name(s) and field type that you want to
extract. You can select an existing field type or create a new one if what you’re
trying to extract is not configured.
- Select Save in the bottom right to save the extraction fields.
Set up or modify both your general fields or extraction fields through the Settings page, by following the steps below.
To configure fields via Train as well, follow these steps:
- Go to Settings, then Taxonomy.
- To create an extraction field, go to the Labels and fields tab.
- On the specific label that you want to create an extraction field on, select the dropdown menu. Selecting the drop-down expands the list of all the fields on a given label.
- To add a new extraction field, select Extraction field at the bottom.
- Fill out the Field name, as well as the Extraction field type to configure your new extraction field.
- To create a new general field, go to the General fields tab. Select New field in the top right corner.
- Fill out the Field name, and General field type to configure your new General field(s).
When you set up your fields, you have to select the specific data type.
- Date
- Exact Text
- Inferred Text
- Monetary Quantity
- Number
The following table details when to use each type.
Field Types | ||||
Data Type | General Field | Extraction Field | Description | Examples |
String | X | X | Strings can include any characters (letters, numbers, etc.).
Strings can also have input values that are explicitly present (spanful) in the message or inferred (spanless). Check below for more details. |
|
Date* | X | X | Dates come in varying unstructured formats and use UiPath’s® pre-trained date field.
|
|
Number | X | X | Quantities come in varying unstructured formats and use UiPath’s® pre-trained quantity field to interpret numbers.
|
|
Monetary Quantity* | X | X | Similarly, monetary quantities also typically come in varying unstructured formats and use UiPath’s® pre-trained monetary
quantity model.
|
|
Regex | X | | If a specific field always needs to be extracted in a specific format, the rules can be configured with RegEx. For more details, check the official UiPath® documentation |
|
Template | X | | Check the official UiPath® documentation for a list of supported templates |
|
Many fields may need to be normalized into a structured data format for downstream processes.
Within the platform, monetary quantities and dates are general field types that are automatically normalized. For more details, check the official UiPath® documentation on field normalization.
What is a spanful field?
A spanful field is a data point that is explicitly stated in the text (e.g., a Trade ID, Policy Number).
What is a spanless field?
A spanless field is a data point that might not be explicitly stated in the text but needs to be extracted from the message (i.e., can be inferred from the message). In other words, the span of text you want to extract might not necessarily be present in the message.
When setting up general fields, specify if the input value must be present in the message, or if it can be inferred from the message (i.e. – needs to be extracted exactly as-is from the text), or not.
Some examples of fields that may need to be spanless:
- Values that need to be normalized (e.g., a date).
- Values that need to be concatenated across different areas in an email.
- Values that are not present anywhere in an email, but are implied through the nature of the email
- Values that span across multiple paragraphs, lines, or columns (i.e., do not appear in a continuous span).
A field type is the initial state of your new field. If you do not have a field type to use, follow these steps to set up a new field type. You can set up the new field type from the drop-down when creating a field, but also on the field type page itself if needed.
Put the broadest field type possible, then fine-tune it to be more specific.
- A - Give your field type a name.
Note: The field type name is NOT used by the model for context the same way that field names are.
- B - Define whether you are setting up a new field type for an extraction field, or a general field.
- C - When setting up your general fields or extraction fields, you have to select the specific data type for the field type.
Note: Depending on whether you set up a new field type or general field for an extraction, your data type that you can configure may vary. Additional configurations are also applicable, depending on the data type that you select.
You can set up a new field type either through the Explore page, or the Settings page, via the Train tab.
Once the data type has been configured on a field type, you cannot change it. Select the correct data type when creating a field type. If you don't select the correct data, you have to delete the field type and re-create it with the correct data type.
You can set up a new field type for both Extraction fields and General fields through the Settings page.
To set up a new field type in the Settings page, follow the steps below.
(1) Settings > (2) Taxonomy > (3) Field Types > (4) New Field type > (5) Set up your field type.
To set up your field types via the Explore page, follow the steps below.
(1) Explore > (2) Annotate Fields > (3) click the 3 dots next to either the general field or extraction field section. You can only create a new field type in its respective section > (4) Manage fields > (5) Select the field type drop down then New field type. Set up your field type.
- Explore page
- Settings page
- Train page
- General guidance
- Field name best practice
- General vs. extraction fields
- Extraction Fields Example
- General Fields Example
- Set up your fields via Explore
- Set up your fields via Settings
- Setting up field types
- Creating a new field type
- Creating a new field type via Settings
- Creating a new field type via Explore