automation-suite
2023.4
false
Automation Suite on Linux Installation Guide
Last updated Sep 5, 2024

Starting and shutting down a node

This page explains the manual and automatic startup and shutdown behavior of Automation Suite.

Startup behavior

The rke2-service starts and is followed by node-drainer and node-uncordon. node-drainer does not do any action at startup, just returns confirmation that the service is up.
The node-uncordon only runs once and starts /opt/node-drain.sh nodestart, which uncordons the node. As part of the drain procedure that occurs at shutdown, this cordons the node, making it unschedulable. This state persists when the rke2 service starts. As such, the node must be uncordoned after rke2-service restarts.

Manual startup

The service starts automatically with Automation Suite. However, if rke2-service was manually stopped, you must start the service again by running the following commands:
  1. Start the Kubernetes process running on the server node:
    systemctl start rke2-serversystemctl start rke2-server
  2. Start the Kubernetes process running on the server node:
    systemctl start rke2-agentsystemctl start rke2-agent
  3. Once the rke2 service is started, uncordon the node to ensure Kubernetes can now schedule workloads on this node:
    systemctl restart node-uncordonsystemctl restart node-uncordon
  4. Once the node is started, you must drain the node:
    systemctl start node-drain.servicesystemctl start node-drain.service
    Important:

    Skipping step 4 could cause the Kubelet service to shut down in an unhealthy way if the system is restarted.

Shutdown behavior

During shutdown, systemd stops the services in the order they were started. Since the node-drain service has the After=rke2-server.service or After=rke2-agent.service directive, it executes its shutdown sequence before the rke2-service shutdown. This means that, in a properly configured system, simply gracefully shutting down the node is a safe operation.

Manual restart

If you plan to stop the rke2 service and reboot the machine, take the following steps:

  1. To ensure that the cluster is healthy while performing node maintenance activity, you must drain the workloads running on that node to other nodes. To drain the node, run the following command:
    systemctl stop node-drain.servicesystemctl stop node-drain.service
  2. Stop the Kubernetes process running on the server node:
    systemctl stop rke2-serversystemctl stop rke2-server
  3. Stop the Kubernetes process running on the agent node:
    systemctl stop rke2-agentsystemctl stop rke2-agent
  4. Kill the rke2 services and containerd and all child processes:

    rke2-killall.shrke2-killall.sh
    This should already be in the path, but it is located at /bin/rke2-killall.sh.

Files created during installation

  1. The following unit files are created during installation:
    • rke2-server.service (server only). Starts the rke2-server, which starts the server node.
    • rke2-agent.service (agent only). Starts the rke2-agent, which starts the agent node.
    • node-drain.service. Used at shutdown time. Executed before shutting down rke2-agent or rke2-server and performs a drain. Has a timeout of 300 seconds.
    • node-uncordon.service. Used at startup to uncordon a node.
    • var-lib-kubelet.mount. Autogenerated by fstab generator.
    • var-lib-rancher-rke2-server-db.mount. Autogenerated by fstab generator.
    • var-lib-rancher.mount. Autogenerated by fstab generator.
There are no strong dependencies between the unit files. However, node-drain and node-uncordon have the After=rke2-server.service or After=rke2-agent.service directive. This means that those services will start after the rke2-service.
  • Startup behavior
  • Shutdown behavior
  • Files created during installation

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.