GPU Service on Kubernetes
FPT Cloud provides Kubernetes with NVIDIA GPU support, featuring the following key capabilities:
- Flexible GPU configuration with selectable GPU type and GPU memory for each Worker Group.
- Automated GPU resource management and allocation in Kubernetes using NVIDIA Operator.
- GPU visualization and monitoring via NVIDIA DCGM.
- Automatic Container/Node scaling with Autoscaler when application GPU resource demand increases or decreases.
- GPU sharing support via Multi-Instance mechanism to optimize GPU resource usage and cost.
FPT Cloud uses the NVIDIA GPU Operator, which provides tools to automatically manage all software components required to use GPUs on Kubernetes. The GPU Operator allows users to use GPU resources in the same way as CPU resources within a Kubernetes cluster.
For detailed instructions on setting up and using GPU on Kubernetes, please refer to the following guides:
- Setup Kubernetes cluster for GPU usage
- How to use GPU in Kubernetes
- GPU sharing
- Setting up GPU telemetry
- Deploy application with GPU on Kubernetes