process-mining

2024.10

true

Process Mining

DELIVERY:

Last updated Apr 28, 2025

Transformations

Folder structure

The transformations of a process app consist of a dbt project. The following table describes the contents of a dbt project folder.

Folder/file	Contains
`dbt_packages\`	the `pm_utils` package and its macros.
`macros\`	optional folder for custom macros
`models\`	`.sql` files that define the transformations.
`models\schema\`	`.yml` files that define tests on the data.
`seed`	`.csv` files with configuration settings.
`dbt_project.yml`	the settings of the dbtproject.

Note:

The Event log and Custom process app templates have a simplified data transformations structure. Process apps created with these app templates do not have this folder structure.

dbt_project.yml

The dbt_project.yml file contains settings of the dbt project which defines your transformations. The vars section contains variables that are used in the transformations.

Date/time format

Each app template contains variables that determine the format for parsing date/time data. These variables have to be adjusted if the input data has a different date/time format than expected.

Data transformations

The data transformations are defined in .sql files in the models\ directory. The data transformations are organized in a standard set of sub directories.

Check out Structure of transformations for more information.

The .sql files are written in Jinja SQL, which allows you to insert Jinja statements inside plain SQL queries. When dbt runs all .sql files, each .sql file results in a new view or table in the database.

Typically, the .sql files have the following structure: Select * from {{ ref('Table_A') }} Table_A.

The following code shows an example SQL query.

select
    tableA."Field_1" as "Alias_1",
    tableA."Field_2",
    tableA."Field_3"
from {{ ref('tableA') }} as tableAselect
    tableA."Field_1" as "Alias_1",
    tableA."Field_2",
    tableA."Field_3"
from {{ ref('tableA') }} as tableA

Note:

In some cases, for process apps created with earlier versions of the app templates, the .sql files have the following structure:

With statements: One or more with statements to include the required sub tables.
- {{ ref(‘My_table) }} refers to table defined by another .sql file.
- {{ source(var("schema_sources"), 'My_table') }} refers to an input table.
Main query: The query that defines the new table.
Final query: Typically a query like Select * from table is used at the end. This makes it easy to make sub-selections while debugging.

For more tips on how to write transformations effectively, refer to Tips for writing SQL.

Adding source tables

To add a new source table to the dbt project, it must be listed in models\schema\sources.yml. This way, other models can refer to it by using {{ source(var("schema_sources"), 'My_table') }}. The following illustration shows an example.

Important: Each new source table must be listed in sources.yml.

For more information using on using source tables in queries, refe to Structure of transformations:1. Input. For more detailed information, refer to the official dbt documentation on Sources.

Data output

The data transformations must output the data model that is required by the corresponding app; each expected table and field must be present.

If you want to add new fields to you process app, you can add these fields in the transformations.

Macros

Macros make it easy to reuse common SQL constructions. For detailed information, refer to the official dbt documentation on Jinja macros.

pm_utils

The pm-utils package contains a set of macros that are typically used in Process Mining transformations. For more info about the pm_utils macros, check out ProcessMining-pm-utils.

The following illustration shows an example of Jinja code calling the pm_utils.optional() macro.

Seeds

Seeds are csv files that are used to add data tables to your transformations. For detailed information, refer to the official dbt documentation on jinja seeds.

In Process Mining, this is typically used to make it easy to configure mappings in your transformations.

After editing seed files, run the file by selecting Run file or Run all, to update the corresponding data table.

Activity configuration

activity_order is used as a tie breaker when two events are happening on the same timestamp.

Using SQL queries

You can use SQL queries in data transformations to set additional fields related to activities The following code shows an example SQL query to define the activity_order.

case
            when tableA."Activity" = 'ActivityA'
                then 1
            when tableA."Activity" = 'ActivityB'
                then 2
            when tableA."Activity" = 'ActivityC'
                then 3
            when tableA."Activity" = 'ActivityD'
                then 4
    end as "Activity_order"    case
            when tableA."Activity" = 'ActivityA'
                then 1
            when tableA."Activity" = 'ActivityB'
                then 2
            when tableA."Activity" = 'ActivityC'
                then 3
            when tableA."Activity" = 'ActivityD'
                then 4
    end as "Activity_order"

Note:

Most app templates have some predefined fields for activity configuration that you can adapt to your business needs. For process apps that do not have these predefined fields you can use the activity_configuration.csv seeds file.

Using the activity_configuration.csv seeds file

The activity_configuration.csv file can also be used to set additional fields related to activities. The following illustration shows an example activity_configuration.csv file.

Note:

The activity_configuration.csv cannot be used for Event log and Custom process app templates.

Tests

The models\schema\ folder contains a set of .yml files that define tests. These validate the structure and contents of the expected data. For detailed information, refer to the official dbt documentation on tests.

Note: When you edit transformations, make sure to update the tests accordingly. The tests can be removed if desired.

Dbt projects

Data transformations are used to transform input data into data suitable for Process Mining. The transformations in Process Mining are written as dbt projects.

This pages gives an introduction to dbt. For more detailed information, refer to the official dbt documentation.

pm-utils package

Process Mining app templates come with a dbt package called pm_utils. This pm-utils package contains utility functions and macros for Process Mining dbt projects. For more info about the pm_utils , refer to ProcessMining-pm-utils.

Updating the pm-utils version used for your app template

UiPath® constantly improves the pm-utils package by adding new functions.

When a new version of the pm-utils package is released, you are advised to update the version used in your transformations, to make sure that you make use of the latest functions and macros of the pm-utils package.

You find the version number of the latest version of the pm-utils package in the Releases panel of the ProcessMining-pm-utils.

Follow these steps to update the pm-utils version in your transformations.

Download the source code (zip) from the release of pm-utils.
Extract the zip file and rename to folder to pm_utils.
Export transformations from the inline Data transformations editor and extract the files.
Replace the pm_utils folder from the exported transformations with the new pm_utils folder.
Zip the contents of the transformations again and import them in the Data transformations editor.

On this page

Folder structure
dbt_project.yml
Data transformations
Adding source tables
Data output
Macros
pm_utils
Seeds
Activity configuration
Tests
Dbt projects
pm-utils package
Updating the pm-utils version used for your app template

Was this page helpful?

PREVIOUSSetting up a local test environment

NEXTCustom throughput time metrics

Support and Services

Get The Help You Need

UiPath Academy

Learning RPA - Automation Courses

UiPath Forum

UiPath Community Forum

Trust and Security

Cookies Policy