Modifying a GPU worker group

Prerequisites:

CPU, GPU, RAM, Storage, and Instance quota sufficient for the desired worker group configuration change.
GPU quota must cover at least Min nodes + 1 to allow worker nodes to roll out the new configuration. If using Autoscale, the GPU quota must cover the maximum desired number of nodes.
One network subnet: the network used for Kubernetes nodes must have a Static IP Pool.

The steps are as follows:

Step 1: Access the FPT Cloud portal at console.fptcloud.com, navigate to the Kubernetes section, click the cluster you want to modify, go to Node Pools, and click the "Edit Workers" icon.

Step 2: In addition to the standard worker group configuration settings, configure the GPU options:

Select instance type: GPU
Select GPU type (A30, A100, H100, H200, etc.)
Select the GPU sharing configuration (None / Single / Mixed)
Select the GPU type configuration (CPU / RAM / GPU RAM)

warning

Changing the GPU sharing method will require all GPU-related workloads to be redeployed. Before making the change, scale the application down to 0 to avoid errors.
If the previous GPU sharing selection was None or None with Operator, you cannot change it to Single or Mixed.
If the previous GPU sharing selection was Single, you can only change it to the corresponding Single modes.

Step 3: Review the initialization information and click Save.

Step 4: Monitor the status of the worker group update in the Kubernetes cluster. Once the status is Succeeded (Running), proceed to deploy your applications.