Welcome to Xinference!#

Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications.

Developing Real-world AI Applications with Xinference#

from xinference.client import Client

client = Client("http://localhost:9997")
model = client.get_model("MODEL_UID")

# Chat to LLM
model.chat(
   messages=[{"role": "system", "content": "You are a helpful assistant"}, {"role": "user", "content": "What is the largest animal?"}],
   generate_config={"max_tokens": 1024}
)

# Chat to VL model
model.chat(
   messages=[
     {
        "role": "user",
        "content": [
           {"type": "text", "text": "What’s in this image?"},
           {
              "type": "image_url",
              "image_url": {
                 "url": "http://i.epochtimes.com/assets/uploads/2020/07/shutterstock_675595789-600x400.jpg",
              },
           },
        ],
     }
  ],
  generate_config={"max_tokens": 1024}
)

Getting Started#

Install Xinference

Install Xinference on Linux, Windows, and macOS.

Installation
Try it out!

Start by running Xinference on a local machine.

Using Xinference
Explore models

Explore a wide range of models supported by Xinference.

Builtin Models
Register your own model

Register model weights and turn it into an API.

Custom Models

Explore the API#

Chat & Generate

Learn how to chat with LLMs in Xinference.

Chat & Generate
Tools

Learn how to connect LLM with external tools.

Tools
Embeddings

Learn how to create text embeddings in Xinference.

Embeddings
Rerank

Learn how to use rerank models in Xinference.

Rerank
Images

Learn how to generate images with Xinference.

Images
Vision

Learn how to process image with LLMs.

Vision
Audio

Learn how to turn audio into text or text into audio with Xinference.

Audio (Experimental)
Video

Learn how to generate video with Xinference.

Video (Experimental)

Getting Involved#