AI Center - Managing node scheduling

ai-center

2024.10

false

AI Center user guide

Before you begin
- Installing or upgrading AI Center
- Compatibility matrix
Getting started
Installing AI Center
Migration and upgrade
Projects
- About Projects
- Managing Projects
Datasets
- About Datasets
- Managing Datasets
Data Labeling
ML packages
Out of the box packages
Pipelines
ML Skills
- About ML Skills
- Managing ML Skills
ML Logs
- About ML Logs
Document UnderstandingTM in AI Center
- Document Manager
- OCR Services
AI Center API
- Overview
- API list
How to
- Managing node scheduling
- ML packages
  - Use Custom NER with continuous learning
Licensing
- AI Units
- Legal information
Basic Troubleshooting Guide
- AI Center troubleshooting

Managing node scheduling

Node scheduling configuration in AI Center for assigning ML replicas, skills, and pipelines to specific GPU or specialized hardware nodes.

Note:

Node scheduling is only applicable for agent nodes and should not be applied to server nodes.

Node scheduling is especially helpful to better handle hardware and make sure that ML replicas (pods), skills, or pipelines. You should mainly use them in two situations:

When you have a GPU node to make sure that only workload requiring GPU are scheduled on that node
To better isolate you ML workloads so they are not disturbing other applications. By default all nodes having the required resources are considered equal by kubernetes scheduler.

There are two methods in which you can direct the scheduling of ML replicas (pods), skills, or pipelines to specific nodes, and both are supported in UiPath® AI Center:

Assign Pods to Nodes using Node Affinity: this is useful when collocating ML pods with other pods on a node. Multiple labels can be applied to a node.
Taints and Tolerations: intended for a dedicated node, either repelling all pods (NoSchedule), or imposing a low scheduling preference (PreferNoSchedule) for pods that don't match the scheduling criteria. Although multiple taints can be applied to a node(s), node taints only support the “AND” Boolean logic.

Node affinity is a Pods property that attracts them to a set of nodes, either as a preference or a requirement. Taints on the other hand, allow a node to repel a set of pods.

The first method creates an affinity between replicas and nodes using node labels, while the second method applies anti-affinity by tainting the nodes.

The ML replicas' PodSpec templates are designed to support both methods and are customized based on GPU or non-GPU selection at deployment.

Node affinity

In the case where an agent node has been added to expand the resource pool and you want to influence the scheduling of ML pods on it, you can apply node affinity. Do this by adding a label to the node using the following command:

For CPU: kubectl label node <node_name> node.type=aic.ml.cpu
For GPU: kubectl label node <node_name> node.type=aic.ml.gpu

Node affinity does not ensure that the node is dedicated to serving ML workloads and does not prevent other workload pods from being scheduled to the same node where the labels are applied.

Taints

To dedicate a node, you need to use taints or a combination of node affinity and taints. To dedicate an agent node to serving either ML GPU or CPU pods, you can apply the following taints to the nodes:

For CPU: kubectl taint node <node_name> aic.ml/cpu=present:NoSchedule
For GPU: kubectl taint node <node_name> nvidia.com/gpu=present:NoSchedule

Combining node affinity and taints

To dedicate an agent node for serving ML GPU pods and influence scheduling for ML CPU pods to the same nodes, you can use a combination of node affinity and taints:

kubectl taint node <node_name> nvidia.com/gpu=present:PreferNoSchedule
kubectl label node <node_name> node.type=aic.ml.cpu
kubectl taint node <node_name> nvidia.com/gpu=present:PreferNoSchedule
kubectl label node <node_name> node.type=aic.ml.cpu

On this page

Node affinity
Taints
Combining node affinity and taints

Was this page helpful?

PREVIOUSAPI list

NEXTML packages

Node affinity​

Taints​

Combining node affinity and taints​

Was this page helpful?

Node affinity

Taints

Combining node affinity and taints