Process Mining - Data Loading

process-mining

2021.10

true

Process Mining user guide

Release notes
- 2024
  - 2021.10.4
- 2022
- 2021
Getting started
- About Process Mining
- Introduction to UiPath Process Mining
- User roles
- UiPath Process Mining components
- Platform architecture
  - Server architecture
  - Integration options
- From data to dashboard
- App and Discovery Accelerator development
Installation
- Hardware and software requirements
- Server installation
- Updating the license
- Deploying Apps and Discovery Accelerators
- Deploying the UiPath Process Mining Profiler
- Deploying a Connector (.mvp)
- Updating UiPath Process Mining
- Updating a customized version of an App or Discovery Accelerator
- Installing a training environment
Configuration
- Server configuration
- Using a Git repository
- Creating accounts for developers
  - Two-Factor authentication
- Backup
- About telemetry
- Set up Audit Logs
  - Category details
Integrations
- Set up integration with UiPath Automation Hub
- Set up Actionable Insights
Authentication
- Set up single sign-on through Azure Active Directory
- Set up single sign-on through Integrated Windows Authentication
- Set up single sign-on through SAML for Microsoft Active Directory
  - Configuring ADFS
- Adding Superadmin AD Groups
- Adding End-user AD Groups
- Set up LDAP
- Two-Factor Authentication
- Set up a Credential store
  - Setting up an Azure Key Vault Credential store
  - Use a Credential store
Working with Apps and Discovery Accelerators
- Working with charts
- Working with Process graphs
- Sending automation ideas to UiPath Automation Hub
- Filters
- Favorites
- Export
- Selecting the preferred language
AppOne menus and dashboards
- Introduction to AppOne
- Analyzing data in AppOne
- Overview of menus and dashboards in AppOne
- Menu Overview
- Menu Process
- Menu Timing
  - Timing - Timing
  - Timing - Due Dates
- Menu Conformance
- Menu Users
- Menu Details
AppOne setup
- Input tables of AppOne
TemplateOne 1.0.0 menus and dashboards
- TemplateOne menus and dashboards
- Menu Overview
- Menu Analysis
- Menu Efficiency
  - Efficiency - Automation
- Menu Compliance
  - Compliance - Tags
  - Compliance - Due Dates
- Menu Details
TemplateOne 1.0.0 setup
- Getting Started with TemplateOne
- Steps to roll-out TemplateOne 1.0.0
- Input tables of TemplateOne 1.0.0
- Adding custom attributes
- Configuring the context bar
TemplateOne menus and dashboards
- TemplateOne menus and dashboards
- Menu Overview
- Menu Analysis
- Menu Efficiency
  - Efficiency - Automation
- Menu Compliance
  - Compliance - Tags
  - Compliance - Due Dates
- Menu Details
TemplateOne 2021.4.0 setup
- Getting started with TemplateOne
- Steps to roll-out TemplateOne
- Input tables of TemplateOne
  - Input tables of TemplateOne 2021.4.0
- Loading data into TemplateOne
Purchase to Pay Discovery Accelerator menus and dashboards
- Introduction to Purchase-to-Pay Discovery Accelerator
- Analyzing data with Purchase-to-Pay Discovery Accelerator
- Overview of menus and dashboards
- Menu Overview
  - Overview - Procurement
  - Overview - Accounts Payable
- Menu Procurement
- Menu Accounts Payable
- Menu Efficiency
- Menu Compliance
- Menu Details
Purchase to Pay Discovery Accelerator Setup
- Input tables of the Purchase-to-Pay Discovery Accelerator
  - Input tables of Purchase-to-Pay Discovery Accelerator 21.10
  - Input tables of Purchase-to-Pay Discovery Accelerator V. 21.4
- Adding automation estimates
Order to Cash Discovery Accelerator menus and dashboards
- Introduction to Order-to-Cash Discovery Accelerator
- Overview of menus and dashboards
- Menu Overview
- Menu Analysis
  - Analysis - End to End
  - Analysis - Deviations
- Menu Efficiency
  - Efficiency - Automation
  - Efficiency - Customers
- Menu Details
  - Details - End to end
Order to Cash Discovery Accelerator Setup
- Input tables of the Order-to-Cash Discovery Accelerator
  - Input tables of the Order-to-Cash Discovery Accelerator V21.4.1
  - Input tables of the Order-to-Cash Discovery Accelerator 21.4
- Adding automation estimates
Basic Connector for AppOne
- Deploying the Basic Connector
- Introduction to Basic Connector
- Input tables of the Basic Connector
- Loading data
  - Mapping attributes
  - Cleaning input data
- Adding tags
- Adding automation estimates
- Adding Due dates
- Adding Reference models
- Setting up Actionable Insights
- Setting collapsible charts
- Using the output dataset in AppOne
- Output tables of the Basic Connector
SAP Connectors
- Introduction to SAP Connector
- Loading data in the SAP Connector for AppOne
- SAP input
- Checking the data in the SAP Connector
- Adding process specific tags to the SAP Connector for AppOne
- Adding process specific Due dates to the SAP Connector for AppOne
- Adding automation estimates to the SAP Connector for AppOne
- Adding attributes to the SAP Connector for AppOne
- Adding activities to the SAP Connector for AppOne
- Adding entities to the SAP Connector for AppOne
SAP Order to Cash Connector for AppOne
- Order-to-Cash Process in UiPath Process Mining
  - Roles in the Order-to-Cash process
- Introduction to SAP Order-to-Cash Connector for AppOne
  - Entities
  - Activities
- Input Data of the SAP Order-to-Cash Connector for AppOne
- Other settings
- Optional attributes
- Order-to-Cash tags
- Order-to-Cash Due dates
- Order-to-Cash Reference models
SAP Purchase to Pay Connector for AppOne
- Purchase-to-Pay process in UiPath Process Mining
  - Roles in the Purchase-to-Pay process
- Introduction to SAP Purchase-to-Pay Connector for AppOne
  - Entities
  - Activities
- Input data of the SAP Purchase-to-Pay Connector for AppOne
- Other settings
- Purchase-to-Pay tags
- Purchase-to-Pay Due dates
SAP Connector for Purchase to Pay Discovery Accelerator
- Introduction to SAP Connector for Purchase-to-Pay Discovery Accelerator
  - Entities
  - Activities
- Input Data of the SAP Connector for Purchase-to-Pay Discovery Accelerator
- Configuring the SAP Connector for Purchase-to-Pay Discovery Accelerator
- Configuring the activity code
SAP Connector for Order-to-Cash Discovery Accelerator
- Introduction to SAP Connector for Order-to-Cash Discovery Accelerator
  - Entities
  - Activities
- Input data of the SAP Connector for Order-to-Cash Discovery Accelerator
- Configuring the SAP Connector for Order-to-Cash Discovery Accelerator
Superadmin
- The Superadmin page
- Collaborative development
  - Workspace conflicts
  - Commit
- Creating releases
- Viewing the branch history
- Creating Apps
- Modules
Dashboards and charts
- Creating dashboards
- Adding charts on a dashboard
- HTML panels
- Process graphs
- Legacy charts
  - Adding a compare period filter to a dashboard
- Migrating legacy charts to new charts
Tables and table items
- Connection string tables
  - Connection string types
  - Table scope
- Join tables
- Global tables
- Introduction to table items
- Datasource attributes
  - Attribute selectors
  - Adding new attributes to an App or Discovery Accelerator
- Metrics
- Filters
- Display format
- Expressions
- Maps
- Actions
Application integrity
- Maintain Application integrity
- Application issues
  - Finding Application issues
  - Solving Application issues
- Application Profiling
  - Profiling aalysis
  - Example analysis
How to ....
- Rebrand and restyle Apps and Discovery Accelerators
- Translate apps
- Use sharding in your applications
- Use generic script datasources
  - Example: Creating a Python Script
  - Example: Creating an R Script
- Create an anonymized dataset
- Set up native SAP extraction
  - Installing the Z_XTRACT_IS_TABLE Function Module on Your SAP System
- Set up automated data refreshes
- Use an access matrix to enable role-based access to data
Working with SQL connectors
- Introduction to SQL connectors
- Setting up a SQL connector
- CData Sync extractions
- Running a SQL connector
- Editing transformations
- Releasing a SQL Connector
- Scheduling data extraction
- Structure of transformations
- Using SQL connectors for released apps
- Generating a cache with scripts
- Setting up a local test environment
- Separate development and production environments
Useful resources
- Troubleshooting
- Performance
  - Data Volume
  - Data Loading
  - System Resources
  - Application Design
  - Connector Data Model Design
  - Internet Connections
- Security

Data Loading

Introduction

Data Loading Into the Connector

Data loading refers to the time required for loading in new data in the Connector. This is determined by the number of columns when reading in from database.

Some types of data are faster to load in than others. In a broad sense, the order is the following.

ODBC: this also depends on the driver and the database.
Flat files: csv’s.
Excel: these files contain overhead for the use in Excel, which makes them slower to read in. If possible, use text files instead of Excel files. Text files are much faster.

Multi-file script is quite slow to parse all the different flat files together and should be avoided if possible. Also avoid API’s for loading massive amounts of data.

Data Loading Into the Application

Data can be loaded in the following ways:

when the application is started (live data);
as the result of a scheduled data run (cached data);
a combination of live and cached data (incremental load).

Live Data

In general, live data is a lot slower, especially if there is a lot of data. Live data also needs continuous access to the data, which can be a problem during production hours.

As a general guideline, it is recommended to keep live data below 100.000 events. Actual performance heavily depends on the data and used data sources.

It is possible to retrieve live data based on the value of a filter. If the filter is changed, the new data is requested. Performance must be seriously considered for these kinds of use cases.

Live tables are loaded when the user logs in and/or changes a filter control. Live tables often lead to performance problems. It is recommended to use cached tables whenever possible.

Cached Data

For cached data, the startup time of the application is independent on the number of columns. When data is pre-calculated and cached, it can be loaded directly from the cache when it is requested. Extracting data from source systems can be time-consuming. It is recommended to schedule the cache updates, for example outside production hours.

Besides the extraction of data, the data is also transformed to the UiPath Process Mining internal format and all calculations that do not depend on user input, are cached.

For calculations that depend on user input, the initial state is cached. When the user changes a control or filter that changes the calculation, the calculation is performed again. Keeping these recalculations to a minimum is very important in good application design.

Incremental Load

By default, UiPath Process Mining does not incrementally load data. Because mutations often take place on the items in the ERP systems, archiving the data is often not a desired approach. Therefore, all data is loaded from the system to ensure we have the latest changes in our data model.

Incremental data loading theoretically can be set up by application developers. This requires sufficient information in the database for determining what data is new and what needs to be queries. Performance needs to be considered carefully. We only recommend using incremental data loading when this is absolutely necessary.

A more suitable alternative is to run incremental loads from the source system into a data lake/warehouse using specialized tools, then querying the data lake/warehouse from UiPath Process Mining. This ensures a low impact on the source system and shares the gains of incremental loads with the entire organisation, rather than specially for UiPath Process Mining.

External Scripts

In UiPath Process Mining you can load in data via scripts using or example Python or R. These scripts will call an external program to run and this output can be read in again. UiPath Process Mining provides the support on the interface between our platform and the script. UiPath Process Mining does not support on issues with the actual script which may cause a long runtime of the external tool.

Solutions

Drivers

Always make sure that you have installed the latest versions of MSSQL ODBC drivers for Windows Server 2016.

Debug Module

Sometimes it is not possible to reduce the data to be read in, for example. when the input data cannot be filtered yet. With a large input in your Connector, the reaction times may be slow. In order to speed up developing, you can add modules to your application.

You can use the module code to ensure that only in one module the data is actually being read in, while the other module do not load data and can be used to make changes to your data model. In this way changes are affected without having to wait on the data to initialize.

Was this page helpful?

PREVIOUSData Volume

NEXTSystem Resources

Introduction​

Data Loading Into the Connector​

Data Loading Into the Application​

Live Data​

Cached Data​

Incremental Load​

External Scripts​

Solutions​

Drivers​

Debug Module​