Skip to main content

Monitor Gpu Vms

GPU Virtual Machine provides metrics to help you monitor and troubleshoot your workloads. Monitoring metrics are collected to track the performance , availability , and resource usage of services, helping detect issues and optimize operations. Note: Metric data is retained for 30 days. There are 3 metric groups available:

  • Utilization metrics: Monitor CPU , memory , and GPU usage to assess system performance and resource efficiency.
  • Disk metrics: Track disk read/write speed and latency to detect storage issues or performance bottlenecks.
  • Network metrics: Measure the amount of data transmitted and received , and show how frequently those read/write actions occur.

For additional or customized metrics, please contact us to explore our advanced monitoring service.

Alt text