Skip to main content
Ctrl+K

Xinference

  • Getting Started
  • Models
  • User Guide
  • Examples
  • API Reference
  • Development
  • GitHub
  • Slack
  • Twitter
  • Getting Started
  • Models
  • User Guide
  • Examples
  • API Reference
  • Development
  • GitHub
  • Slack
  • Twitter

Section Navigation

  • Model Abilities
    • Chat & Generate
    • Tools
    • Vision
    • Embeddings
    • Rerank
    • Images
    • Audio (Experimental)
    • Video (Experimental)
  • Builtin Models
    • Large language Models
      • aquila2
      • aquila2-chat
      • aquila2-chat-16k
      • baichuan-2
      • baichuan-2-chat
      • c4ai-command-r-v01
      • code-llama
      • code-llama-instruct
      • code-llama-python
      • codegeex4
      • codeqwen1.5
      • codeqwen1.5-chat
      • codeshell
      • codeshell-chat
      • codestral-v0.1
      • cogvlm2
      • cogvlm2-video-llama3-chat
      • csg-wukong-chat-v0.1
      • deepseek
      • deepseek-chat
      • deepseek-coder
      • deepseek-coder-instruct
      • deepseek-v2
      • deepseek-v2-chat
      • deepseek-v2-chat-0628
      • deepseek-v2.5
      • deepseek-vl-chat
      • gemma-2-it
      • gemma-it
      • glm-4v
      • glm4-chat
      • glm4-chat-1m
      • gorilla-openfunctions-v2
      • gpt-2
      • internlm2-chat
      • internlm2.5-chat
      • internlm2.5-chat-1m
      • internvl-chat
      • internvl2
      • llama-2
      • llama-2-chat
      • llama-3
      • llama-3-instruct
      • llama-3.1
      • llama-3.1-instruct
      • minicpm-2b-dpo-bf16
      • minicpm-2b-dpo-fp16
      • minicpm-2b-dpo-fp32
      • minicpm-2b-sft-bf16
      • minicpm-2b-sft-fp32
      • MiniCPM-Llama3-V-2_5
      • MiniCPM-V-2.6
      • minicpm3-4b
      • mistral-instruct-v0.1
      • mistral-instruct-v0.2
      • mistral-instruct-v0.3
      • mistral-large-instruct
      • mistral-nemo-instruct
      • mistral-v0.1
      • mixtral-8x22B-instruct-v0.1
      • mixtral-instruct-v0.1
      • mixtral-v0.1
      • OmniLMM
      • openhermes-2.5
      • opt
      • orion-chat
      • orion-chat-rag
      • phi-2
      • phi-3-mini-128k-instruct
      • phi-3-mini-4k-instruct
      • platypus2-70b-instruct
      • qwen-chat
      • qwen-vl-chat
      • qwen1.5-chat
      • qwen1.5-moe-chat
      • qwen2-audio
      • qwen2-audio-instruct
      • qwen2-instruct
      • qwen2-moe-instruct
      • qwen2-vl-instruct
      • qwen2.5
      • qwen2.5-coder
      • qwen2.5-coder-instruct
      • qwen2.5-instruct
      • seallm_v2
      • seallm_v2.5
      • Skywork
      • Skywork-Math
      • Starling-LM
      • telechat
      • tiny-llama
      • wizardcoder-python-v1.0
      • wizardmath-v1.0
      • xverse
      • xverse-chat
      • Yi
      • Yi-1.5
      • Yi-1.5-chat
      • Yi-1.5-chat-16k
      • Yi-200k
      • Yi-chat
      • yi-coder
      • yi-coder-chat
      • yi-vl-chat
    • Embedding Models
      • bce-embedding-base_v1
      • bge-base-en
      • bge-base-en-v1.5
      • bge-base-zh
      • bge-base-zh-v1.5
      • bge-large-en
      • bge-large-en-v1.5
      • bge-large-zh
      • bge-large-zh-noinstruct
      • bge-large-zh-v1.5
      • bge-m3
      • bge-small-en-v1.5
      • bge-small-zh
      • bge-small-zh-v1.5
      • e5-large-v2
      • gte-base
      • gte-large
      • gte-Qwen2
      • jina-embeddings-v2-base-en
      • jina-embeddings-v2-base-zh
      • jina-embeddings-v2-small-en
      • jina-embeddings-v3
      • m3e-base
      • m3e-large
      • m3e-small
      • multilingual-e5-large
      • text2vec-base-chinese
      • text2vec-base-chinese-paraphrase
      • text2vec-base-chinese-sentence
      • text2vec-base-multilingual
      • text2vec-large-chinese
    • Image Models
      • FLUX.1-dev
      • FLUX.1-schnell
      • GOT-OCR2_0
      • kolors
      • sd-turbo
      • sd3-medium
      • sdxl-turbo
      • stable-diffusion-2-inpainting
      • stable-diffusion-inpainting
      • stable-diffusion-v1.5
      • stable-diffusion-xl-base-1.0
      • stable-diffusion-xl-inpainting
    • Audio Models
      • Belle-distilwhisper-large-v2-zh
      • Belle-whisper-large-v2-zh
      • Belle-whisper-large-v3-zh
      • ChatTTS
      • CosyVoice-300M
      • CosyVoice-300M-Instruct
      • CosyVoice-300M-SFT
      • FishSpeech-1.4
      • SenseVoiceSmall
      • whisper-base
      • whisper-base.en
      • whisper-large-v3
      • whisper-large-v3-turbo
      • whisper-medium
      • whisper-medium.en
      • whisper-small
      • whisper-small.en
      • whisper-tiny
      • whisper-tiny.en
    • Rerank Models
      • bce-reranker-base_v1
      • bge-reranker-base
      • bge-reranker-large
      • bge-reranker-v2-gemma
      • bge-reranker-v2-m3
      • bge-reranker-v2-minicpm-layerwise
      • jina-reranker-v2
      • minicpm-reranker
    • Video Models
      • CogVideoX-2b
      • CogVideoX-5b
  • Custom Models
  • Download Sources
  • LoRA Integration
  • Model Memory Calculation
  • Models
  • Model Abilities

Model Abilities#

  • Chat & Generate
    • Introduction
    • Quickstart
    • FAQ
  • Tools
    • Introduction
    • Quickstart
  • Vision
    • Introduction
    • Quickstart
  • Embeddings
    • Introduction
    • Quickstart
    • FAQ
  • Rerank
    • Introduction
    • Quickstart
  • Images
    • Introduction
    • Quickstart
  • Audio (Experimental)
    • Introduction
    • Quickstart
  • Video (Experimental)
    • Introduction
    • Quickstart

previous

Models

next

Chat & Generate

Show Source

© Copyright 2023, Xorbits Inc..

Created using Sphinx 7.4.7.

Built with the PyData Sphinx Theme 0.16.0.