Skip to main content

Select Base Model

Retrieve base models from the Model Hub in two ways:

  • Model Catalog: A repository of model sources from various providers such as DeepSeek, Gemma, Llama, and Qwen.
  • Private Model: A repository for user-owned models and fine-tuned models.

Alt text

The Model Catalog includes the following models:

Base modelModel familyModel typeModel sizeLearning stage
deepseek-ai/DeepSeek-R1-Distill-Llama-70BDeepSeekLLM70BInstruction-tuned
deepseek-ai/DeepSeek-R1-Distill-Llama-8BDeepSeekLLM8BInstruction-tuned
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5BDeepSeekLLM1.5BInstruction-tuned
deepseek-ai/DeepSeek-R1-Distill-Qwen-14BDeepSeekLLM14BInstruction-tuned
deepseek-ai/DeepSeek-R1-Distill-Qwen-32BDeepSeekLLM32BInstruction-tuned
deepseek-ai/DeepSeek-R1-Distill-Qwen-7BDeepSeekLLM7BInstruction-tuned
google/gemma-3-12b-itGemmaLLM2BInstruction-tuned
google/gemma-3-12b-ptGemmaLLM2BPre-trained
google/gemma-3-1b-itGemmaLLM1BInstruction-tuned
google/gemma-3-1b-ptGemmaLLM1BPre-trained
google/gemma-3-27b-itGemmaLLM27BInstruction-tuned
google/gemma-3-27b-ptGemmaLLM27BPre-trained
google/gemma-3-4b-itGemmaLLM4BInstruction-tuned
google/medgemma-27b-text-itGemmaLLM (Medical)27BInstruction-tuned
meta-llama/Llama-3.1-70BLlamaLLM70BPre-trained
meta-llama/Llama-3.1-70B-InstructLlamaLLM70BInstruction-tuned
meta-llama/Llama-3.1-8BLlamaLLM8BPre-trained
meta-llama/Llama-3.1-8B-InstructLlamaLLM8BInstruction-tuned
meta-llama/Llama-3.2-1BLlamaLLM1BPre-trained
meta-llama/Llama-3.2-1B-InstructLlamaLLM1BInstruction-tuned
meta-llama/Llama-3.2-3BLlamaLLM3BPre-trained
meta-llama/Llama-3.2-3B-InstructLlamaLLM3BInstruction-tuned
meta-llama/Llama-3.3-70B-InstructLlamaLLM70BInstruction-tuned
mistralai/Mixtral-8x7B-Instruct-v0.1MistralMoE LLM8x7BInstruction-tuned
mistralai/Mixtral-8x7B-v0.1MistralMoE LLM8x7BPre-trained
Qwen/Qwen2-0.5BQwenLLM0.5BPre-trained
Qwen/Qwen2-0.5B-InstructQwenLLM0.5BInstruction-tuned
Qwen/Qwen2-1.5BQwenLLM1.5BPre-trained
Qwen/Qwen2-1.5B-InstructQwenLLM1.5BInstruction-tuned
Qwen/Qwen2-72BQwenLLM72BPre-trained
Qwen/Qwen2-72B-InstructQwenLLM72BInstruction-tuned
Qwen/Qwen2-7BQwenLLM7BPre-trained
Qwen/Qwen2-7B-InstructQwenLLM7BInstruction-tuned
Qwen/Qwen2-VL-2BQwenVLM2BPre-trained
Qwen/Qwen2-VL-2B-InstructQwenVLM2BInstruction-tuned
Qwen/Qwen2-VL-72BQwenVLM72BPre-trained
Qwen/Qwen2-VL-72B-InstructQwenVLM72BInstruction-tuned
Qwen/Qwen2-VL-7BQwenVLM7BPre-trained
Qwen/Qwen2-VL-7B-InstructQwenVLM7BInstruction-tuned
Qwen/Qwen2.5-0.5BQwenLLM0.5BPre-trained
Qwen/Qwen2.5-0.5B-InstructQwenLLM0.5BInstruction-tuned
Qwen/Qwen2.5-1.5BQwenLLM1.5BPre-trained
Qwen/Qwen2.5-1.5B-InstructQwenLLM1.5BInstruction-tuned
Qwen/Qwen2.5-14BQwenLLM14BPre-trained
Qwen/Qwen2.5-14B-InstructQwenLLM14BInstruction-tuned
Qwen/Qwen2.5-32BQwenLLM32BPre-trained
Qwen/Qwen2.5-32B-InstructQwenLLM32BInstruction-tuned
Qwen/Qwen2.5-3BQwenLLM3BPre-trained
Qwen/Qwen2.5-3B-InstructQwenLLM3BInstruction-tuned
Qwen/Qwen2.5-72BQwenLLM72BPre-trained
Qwen/Qwen2.5-72B-InstructQwenLLM72BInstruction-tuned
Qwen/Qwen2.5-7BQwenLLM7BPre-trained
Qwen/Qwen2.5-7B-InstructQwenLLM7BInstruction-tuned
Qwen/Qwen2.5-VL-32B-InstructQwenVLM32BInstruction-tuned
Qwen/Qwen2.5-VL-3B-InstructQwenVLM3BInstruction-tuned
Qwen/Qwen2.5-VL-72B-InstructQwenVLM72BInstruction-tuned
Qwen/Qwen2.5-VL-7B-InstructQwenVLM7BInstruction-tuned
Qwen/Qwen3-0.6BQwenLLM0.6BPre-trained
Qwen/Qwen3-1.7BQwenLLM1.7BPre-trained
Qwen/Qwen3-14BQwenLLM14BPre-trained
Qwen/Qwen3-30B-A3BQwenLLM30BPre-trained
Qwen/Qwen3-32BQwenLLM32BPre-trained
Qwen/Qwen3-4BQwenLLM4BPre-trained
Qwen/Qwen3-8BQwenLLM8BPre-trained
The Private Model , if you want to upload your models, please contact us or follow the guide upload model through SDK