document-understanding
2023.4
false
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Dec 18, 2024

RegEx Based Extractor

What is RegEx Based Extractor

The Regex Based Extractor is the perfect tool for simple use cases, in which, for certain fields, data is always found in a strict, predictable format and context. In other words, if you have a field for which you can define a Regular Expression that is consistently good when matched, then the Regex Based Extractor is a good choice.

The activity comes with a configuration wizard that assists you in defining the regular expressions for the fields you want to target for data extraction in this way.

The activity supports both simple fields as well as table field extraction.

It is recommended to look into other extraction methods, in case there is a high variability of the context and format of the expected values. In such cases, either a Form Extractor or a Machine Learning Extractor may be better suited.

This extractor does not have learning (training) capabilities and requires up-front configuration.

Special requirements

There are no special requirements for using the Regex Based Extractor.

How to configure

Activity configuration

The Regex Based Extractor has two major configurations to be considered:

  • the Configure Regular Expressions wizard - which allows you to define regular expressions for certain fields. This wizard also makes available the Regex Editor wizard, which assists you in building your regular expressions.
  • the UseVisualAlignment setting - which allows you to control whether the regular expressions configured for an extractor should be applied to the text output of the digitization component, or to a text version in which text lines are organized visually, and words are rearranged on lines based on their visual alignment.

Learn more

Learn more about Configure Regular Expression Wizard, by following this link.

  • What is RegEx Based Extractor
  • Special requirements
  • How to configure
  • Activity configuration
  • Learn more

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.