Skip to main content

Deployment configuration

Go to AI Platform --> Model Serving --> Deployment --> Deployment Name --> Deployment Settings

Alt text

InformationDescriptionPossible actions
Instance ReplicaNumber of Instances in DeploymentEdit increase/decrease instance
Resource FlavorPreset configuration of CPU/RAM/DISK/GPUEdit change Flavor
vCPUProcessor configuration (core)Edit increase/decrease vCPU
RAMProcessor configuration (core)Edit increase/decrease vCPU
GPUGraphics processor configuration (piece)Edit increase/decrease GPU
Storage TypeStorage capacity typeEdit change Storage type
Access ModeStorage access typeEdit change access type
Mount PathPath in Instance connecting to StorageEdit change path connecting to Storage
Volume CapacityStorage capacity (GB)Edit increase/decrease storage capacity
Deployment StrategyDeployment strategy (in K8S)Edit change deployment strategy
Startup CommandCommand to run when starting instanceEdit change launch command
ArgumentsInstance startup argumentsEdit change launch arguments
Environment VariableEnvironment variables assigned to the instanceEdit change environment variables
Nodes SelectorConfigure the label Worker nodes/Worker group to deploy toEdit change label
Application Health CheckConfigure the application health checkEdit to change the health check configuration
Readiness ProbeCheck if the instance is readyEdit to change the health check configuration
Liveness ProbeCheck if the instance is runningEdit to change the health check configuration
Startup ProbeCheck if the instance has startedEdit to change the health check configuration
ProtocolProtocol used for status checksEdit to change the health check configuration
PathPath used for status checksEdit to change the health check configuration
PortPort used for status checksEdit to change the health check configuration
Initial Delay SecondsInitial delay (in seconds) before starting health check actionsEdit to change the health check configuration
Period SecondsFrequency of probes in seconds. Default is 10 seconds. Minimum value is 1Edit to change the health check configuration
Timeout SecondsTimeout duration (in seconds) for probe actions. Default is 1 second. Minimum value is 1Edit to change the health check configuration
Success ThresholdNumber of consecutive successful probes after failure to consider it successful. Default is 1. Must be 1 for liveness. Minimum value is 1Edit to change the health check configuration
Failure ThresholdNumber of consecutive failed probes before an action is triggered. For liveness, the container restarts; for readiness, the instance is marked Unready. Default is 3. Minimum value is 1Edit to change the health check configuration
HTTP HeadersCreate HTTP headers in Key-Value format for health checksEdit to change the health check configuration
TagsAssign labels to the DeploymentEdit to add, edit, or delete labels assigned to the Deployment
Note :
TitleDescription
------
Change Deployment FlavorCurrently, changing the deployment resource from Flavor to Custom is not supported. This feature will be developed later.
Change ProcessWhen information in the Deployment Setting is updated, the Instance will be redeployed according to the new configuration.