Skip to main content
Back to top
Ctrl
+
K
Xinference
Getting Started
Models
User Guide
Examples
API Reference
Development
GitHub
Slack
Twitter
Getting Started
Models
User Guide
Examples
API Reference
Development
GitHub
Slack
Twitter
Section Navigation
Model Abilities
Chat & Generate
Tools
Vision
Embeddings
Rerank
Images
Audio (Experimental)
Video (Experimental)
Builtin Models
Large language Models
aquila2
aquila2-chat
aquila2-chat-16k
baichuan-2
baichuan-2-chat
c4ai-command-r-v01
chatglm3
chatglm3-128k
chatglm3-32k
code-llama
code-llama-instruct
code-llama-python
codegeex4
codeqwen1.5
codeqwen1.5-chat
codeshell
codeshell-chat
codestral-v0.1
cogvlm2
cogvlm2-video-llama3-chat
csg-wukong-chat-v0.1
deepseek
deepseek-chat
deepseek-coder
deepseek-coder-instruct
deepseek-vl-chat
gemma-2-it
gemma-it
glm-4v
glm4-chat
glm4-chat-1m
gorilla-openfunctions-v1
gorilla-openfunctions-v2
gpt-2
internlm2-chat
internlm2.5-chat
internlm2.5-chat-1m
internvl-chat
internvl2
llama-2
llama-2-chat
llama-3
llama-3-instruct
llama-3.1
llama-3.1-instruct
minicpm-2b-dpo-bf16
minicpm-2b-dpo-fp16
minicpm-2b-dpo-fp32
minicpm-2b-sft-bf16
minicpm-2b-sft-fp32
MiniCPM-Llama3-V-2_5
MiniCPM-V-2.6
mistral-instruct-v0.1
mistral-instruct-v0.2
mistral-instruct-v0.3
mistral-large-instruct
mistral-nemo-instruct
mistral-v0.1
mixtral-8x22B-instruct-v0.1
mixtral-instruct-v0.1
mixtral-v0.1
OmniLMM
openhermes-2.5
opt
orion-chat
orion-chat-rag
phi-2
phi-3-mini-128k-instruct
phi-3-mini-4k-instruct
platypus2-70b-instruct
qwen-chat
qwen-vl-chat
qwen1.5-chat
qwen1.5-moe-chat
qwen2-instruct
qwen2-moe-instruct
seallm_v2
seallm_v2.5
Skywork
Skywork-Math
Starling-LM
telechat
tiny-llama
wizardcoder-python-v1.0
wizardmath-v1.0
xverse
xverse-chat
Yi
Yi-1.5
Yi-1.5-chat
Yi-1.5-chat-16k
Yi-200k
Yi-chat
yi-vl-chat
zephyr-7b-alpha
zephyr-7b-beta
Embedding Models
bce-embedding-base_v1
bge-base-en
bge-base-en-v1.5
bge-base-zh
bge-base-zh-v1.5
bge-large-en
bge-large-en-v1.5
bge-large-zh
bge-large-zh-noinstruct
bge-large-zh-v1.5
bge-m3
bge-small-en-v1.5
bge-small-zh
bge-small-zh-v1.5
e5-large-v2
gte-base
gte-large
gte-Qwen2
jina-embeddings-v2-base-en
jina-embeddings-v2-base-zh
jina-embeddings-v2-small-en
m3e-base
m3e-large
m3e-small
multilingual-e5-large
text2vec-base-chinese
text2vec-base-chinese-paraphrase
text2vec-base-chinese-sentence
text2vec-base-multilingual
text2vec-large-chinese
Image Models
FLUX.1-dev
FLUX.1-schnell
kolors
sd-turbo
sd3-medium
sdxl-turbo
stable-diffusion-2-inpainting
stable-diffusion-inpainting
stable-diffusion-v1.5
stable-diffusion-xl-base-1.0
stable-diffusion-xl-inpainting
Audio Models
Belle-distilwhisper-large-v2-zh
Belle-whisper-large-v2-zh
Belle-whisper-large-v3-zh
ChatTTS
CosyVoice-300M
CosyVoice-300M-Instruct
CosyVoice-300M-SFT
FishSpeech-1.2-SFT
SenseVoiceSmall
whisper-base
whisper-base.en
whisper-large-v3
whisper-medium
whisper-medium.en
whisper-small
whisper-small.en
whisper-tiny
whisper-tiny.en
Rerank Models
bce-reranker-base_v1
bge-reranker-base
bge-reranker-large
bge-reranker-v2-gemma
bge-reranker-v2-m3
bge-reranker-v2-minicpm-layerwise
jina-reranker-v2
Video Models
CogVideoX-2b
Custom Models
Download Sources
LoRA Integration
Model Memory Calculation
Models
Builtin Models
Video Models
Video Models
#
The following is a list of built-in video models in Xinference:
CogVideoX-2b
Show Source