Skip to main content

Deploy LoRA inference

How to deploy a fine-tuned LoRA model?

As a user, you can deploy your fine-tuned LoRA model to use it immediately via API without managing any infrastructure.

Steps

Go to the Deployment page from the navigation bar.
- Or click View deployment from the success pop-up after fine-tuning.

Click Deploy next to the LoRA model you want to deploy.
- Status will change to Deploying.
Once deployment is successful, the status will show Deployed.

How to manage deployed models?

On the Deployment page, you can:

Get API Key — retrieve the key to call your model.
View API request — open a pop-up with a sample JSON response.
Try in Playground — test the model directly in the UI.
Undeploy — stop the deployed model (confirmation required).

Status badges

Deploying — model is being deployed.
Deployed — model is ready for inference.
Stopped — model is undeployed.
Failed — deployment failed.

How to deploy a fine-tuned LoRA model?
- Steps
How to manage deployed models?
- Status badges