- Overview
- Adobe Sign
- Alteryx
- Amazon Comprehend
- Amazon Connect
- Amazon Rekognition
- Amazon Textract
- Box
- Release notes
- About the Box activity package
- Fix XAML After Package Update
- Project compatibility
- Setup
- Technical references
- Box Scope
- Commit Session
- Create File Version Session
- Create Upload Session
- Upload Part
- Delete Old Version
- Get Versions
- Promote Version
- Upload File Version
- Copy File
- Delete File
- Download File
- Get File Comments
- Get File Info
- Get File Info Details
- Get File Lock Info
- Lock File
- Move File
- Unlock File
- Upload File
- Copy Folder
- Create Folder
- Delete Folder
- Get Folder Info
- Get Folder Items
- Rename Folder
- Search For Content
- Execute Method
- Cognitive
- DocuSign
- Release notes
- About the DocuSign activity package
- Project compatibility
- Setup
- DocuSign Scope
- Invoke DocuSign Operation
- Create Envelope
- Create Envelope And Send
- Create Envelope From Template
- Create Envelope Recipients
- Get Document
- Get Documents
- Get Envelope
- Get Form Data
- Send Envelope
- Update Envelope
- List Attachments
- List Custom Fields
- List Documents
- List Envelope Status Changes
- List Recipients
- Create Bulk Send List
- Create Bulk Send Request
- Get Bulk Send List
- Get Bulk Send Lists
- Create Template Custom Fields
- Create Template Recipients
- Get Template
- List Templates
- Google Vision
- Jira
- Release notes
- About the Jira activity package
- Project compatibility
- Setup
- Jira Scope
- Execute Method
- Add Attachment
- Delete Attachment
- Download Attachment
- Get Attachments
- Add Comment
- Delete Comment
- Get Comments
- Create Component
- Delete Component
- Get Components
- Search Dashboards
- Create Filter
- Get Filters
- Create Issue Link
- Delete Issue Link
- Get Issue Link Types
- Get Issue Links
- Add Watcher
- Assign Issue
- Clone Issue
- Create Issue
- Delete Issue
- Get Issue
- Get Watchers
- Remove Watcher
- Search Issues Using JQL
- Transition Issue
- Update Issue
- Create Project
- Delete Project
- Get Project Types
- Get Projects
- Update Project
- Get Transitions
- Create User
- Delete User
- Find Users
- Marketo
- Microsoft Dynamics
- Microsoft Dynamics Finance and Operations
- Microsoft Translator
- Microsoft Vision
- Oracle Integration Cloud
- Oracle NetSuite
- Salesforce
- Release notes
- About the Salesforce activity package
- Project compatibility
- Setup
- Quickstart
- Salesforce Application Scope
- Create Bulk API Job
- Get Bulk API Job
- Get Bulk API Job Results
- Start or Abort Bulk API Job
- Assign File
- Download File
- Upload File
- Composite Request
- Delete Record
- Get List View Records
- Get Record
- Insert Record
- Update Record
- Upsert Record
- Execute Report
- Execute SOQL
- Format Column Names
- Format Labels And Values
- Lightning REST API Request
- Search
- SAP BAPI
- ServiceNow
- Slack
- Smartsheet
- Release notes
- About the Smartsheet activity package
- Project compatibility
- Setup
- Technical references
- Quickstart: Working With Rows
- Smartsheet Application Scope
- Attach File
- Attach URL
- Delete Attachment
- Get Attachment
- List Attachments
- Add Comment
- Delete Comment
- Get Comment
- List Discussions
- Copy Folder
- Create Folder
- Delete Folder
- Get Folder
- List Folders
- Add Group Members
- Get Group
- List Groups
- Remove Group Member
- Get Report
- List Reports
- Add Rows
- Copy Rows
- Create Discussion on Row
- Delete Rows
- Get Row
- List Rows
- Move Rows
- Update Rows
- Copy Sheet
- Create Sheet
- Create Sheet From Template
- Download Sheet
- Get Sheet
- List Sheets
- Update Sheet
- Add User
- Get User
- List Users
- Remove User
- Copy Workspace
- Create Workspace
- List Workspaces
- Import Sheet From CSV / XLSX
- Invoke Smartsheet Operation
- Search Activity
- Send Via Email
- Share Object
- SuccessFactors
- Tableau
- Twilio
- Workato
- Workday
Analyze Multipage Document
The Analyze Multipage Document activity uses the Amazon Textract StartDocumentAnalysis and GetDocumentAnalysis APIs to analyze a multi-page document stored in an S3 bucket (Bucket,DocumentName, and Version). If your document includes a table, you have the option to indicate if the first row contains column headers (DiscoverColumnHeaders) and/or ignore empty rows (IgnoreEmptyRows).
PageDetailCollection
object (Pages) that you can use as input variables in other activities outside of the Amazon Textract Activities Package.
The Analyze Multipage Document activity is essentially a combination of the Start Document Analysis, Get Document Analysis Status, and Get Document Analysis activities in a single activity.
PageDetail[]
object. In 2.0 this has been changed to a PageDetailCollection
to allow us to return the RawJson property for the method call, which was not possible with an array.
The following steps and message sequence diagram is an example of how the activity works from design time (i.e., the activity dependencies and input/output properties) to run time.
- Complete the Setup steps.
- Add the Amazon Scope activity to your project.
- Add the Analyze Single Page Document inside the Amazon Scope activity.
- Enter values for the S3 Storage input properties.
- Create and enter a
PageDetailCollection
variable for your Output property. -
Run the activity.
- Your input properties are sent to the AnalyzeDocument API.
-
The API returns the
PageDetail
value to your output property variable.
The values for the following properties are specified when adding this activity to your project in UiPath Studio.
DisplayName
The display name of the activity.
Attributes |
Details |
---|---|
Type |
|
Required |
Yes |
Default value |
Analyze Multipage Document |
Allowed values |
Enter a
String or
String variable.
|
Notes |
N/A |
Unlike the Get Document Analysis Status, which requires an external delay mechanism to poll the service for status changes, the Analyze Multipage Document includes the following, optional input properties to set an initial status check delay (InitialDelay) and status check interval (StatusCheckInterval).
InitialDelay
The amount of time to wait before the activity calls the Amazon Textract GetDocumentAnalysis API to retrieve the JobStatus value.
Attributes |
Details |
---|---|
Type |
Int32 (milliseconds)
|
Required |
No |
Default value |
15000 (not shown) |
Allowed values |
Enter a
Int32 or
Int32 variable.
|
Notes |
Enter your value in milliseconds (e.g., 30000 for 30 seconds); your value must be greater or equal to 15000. When analyzing a large document, it's recommended that you enter the estimated time it takes for the Amazon Textract service to complete its analysis. For example, if your document takes up to 2 minutes to analyze, you should enter 120000 as your value and use the StatusCheckInterval property to indicate how often you want to check for an updated status if the job doesn't complete within the 2-minute estimate. |
StatusCheckInterval
The amount of time to wait between calls to the Amazon Textract GetDocumentAnalysis API to retrieve the JobStatus value.
Attributes |
Details |
---|---|
Type |
Int32 (milliseconds)
|
Required |
No |
Default value |
10000 (not shown) |
Allowed values |
Enter a
Int32 or
Int32 variable.
|
Notes |
Enter your value in milliseconds (e.g., 15000 for 30 seconds); your value must be greater or equal to 10000. The objective of this property is to help manage the number of calls that your activity makes to the Amazon Textract API. |
AnalysisType
Specifies the types of analysis to perform. Use Tables to return information about the tables that are detected in the input document and Forms to return detected form data.
Attributes |
Details |
---|---|
Type |
enum |
Required |
No. |
Default value |
All |
Allowed values |
All, Tables, Forms |
Notes |
N/A |
DiscoverColumnHeaders
Indicates whether the tables in the document include column headers.
Attributes |
Details |
---|---|
Type |
Checkbox |
Required |
No |
Default value |
Not Selected |
Allowed values |
Selected or Not Selected |
Notes |
N/A |
IgnoreEmptyRows
Indicates whether empty rows in the document tables should be ignored when analyzing the document.
Attributes |
Details |
---|---|
Type |
Checkbox |
Required |
No |
Default value |
Not Selected |
Allowed values |
Selected or Not Selected |
Notes |
N/A |
Bucket
The name of the S3 bucket where the document is stored.
Attributes |
Details |
---|---|
Type |
|
Required |
Yes |
Default value |
Empty |
Allowed values |
Enter a
String or
String variable.
|
Notes |
The AWS Region for the S3 bucket that contains the document must match the Region that you selected in the Amazon Scope activity. For Amazon Textract to process a file in an S3 bucket, the user must have permission to access the S3 bucket; for more information, see step 6 in the Create IAM User section of the Setup guide. |
DocumentName
The case-sensitive name of the file in the specfied Bucket that you want to analyze.
Attributes |
Details |
---|---|
Type |
|
Required |
Yes |
Default value |
Empty |
Allowed values |
Enter a
String or
String variable.
|
Notes |
Supported document formats: PNG, JPEG, and PDF. |
Version
If the bucket has versioning enabled, you can specify the object version.
Attributes |
Details |
---|---|
Type |
|
Required |
No |
Default value |
Empty |
Allowed values |
Enter a
String or
String variable.
|
Notes |
N/A |
Private
If selected, the values of variables and arguments are no longer logged at Verbose level.
Attributes |
Details |
---|---|
Type |
Checkbox |
Required |
No |
Default value |
Not Selected |
Allowed values |
Selected or Not Selected |
Notes |
N/A |
Pages
The properties extracted from the specified document returned as an array.
Attributes |
Details |
---|---|
Type |
|
Required |
No (recommended if you plan to use the output data in subsequent activities) |
Default value |
Empty |
Allowed values |
Enter a
PageDetailCollection
variable
|
Notes |
Each object from the array represents the results
for one individual page. This is a change from previous versions
which returned a
PageDetail[] object.
See Page Detail for the description of the of the PageDetail object and its properties. |
The following image shows an example of the activity dependency relationship and input/output property values.
For step-by-step instructions and examples, see the Quickstart guides.