Skip to main content

Terms and definitions

TermsDefinitions
ModelModel or AI model is a computer program designed to simulate human intelligence
Model ServingIt is a product of Platform as a Service (PaaS) that allows the deployment and operation of AI models on the Kubernetes platform provided by FPT Cloud
Model TypeThe type of model to be deployed, including/excluding an image
ImageA template for packaging applications as containers for deployment
Image SourceThe type of source for the deployment image: Private/Public
Image RegistryThe address where the deployment image is hosted
Image TagThe tag of the deployment image
ClusterThe Kubernetes cluster for deploying the model
InstanceProcessing capacity unit
Instance ReplicaThe number of identical instances providing processing capacity for a single model
Resource Type FlavorType of processing capacity resources (CPU/RAM/DISK/GPU)
Deployment StrategyDeployment strategy on Kubernetes (Recreate/Rolling Update)
Startup CommandStartup command or initialization command
Environment VariableEnvironment variables that need to be configured
Nodes SelectorLabel selection for worker nodes to deploy the model
TagsLabel for the deployed model
Services TypeType of external connection for the deployed model on Kubernetes (Load Balancer/ClusterIP/Ingress)
Traffic TypeType of data connection (Public/Private)
PortPort for external connection and container ingress (TCP)
GPU Sharing ModeThe type of virtualization used to partition the GPU (MIG/MPS/Time-slice)