Welcome to Xinference!#

Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications.

Developing Real-world AI Applications with Xinference#

from xinference.client import Client

client = Client("http://localhost:9997")
model = client.get_model("MODEL_UID")

# Chat to LLM
model.chat(
   prompt="What is the largest animal?",
   system_prompt="You are a helpful assistant",
   generate_config={"max_tokens": 1024}
)

# Chat to VL model
model.chat(
   chat_history=[
     {
        "role": "user",
        "content": [
           {"type": "text", "text": "What’s in this image?"},
           {
              "type": "image_url",
              "image_url": {
                 "url": "http://i.epochtimes.com/assets/uploads/2020/07/shutterstock_675595789-600x400.jpg",
              },
           },
        ],
     }
  ],
  generate_config={"max_tokens": 1024}
)

from xinference.client import Client

client = Client("http://localhost:9997")
model = client.get_model("MODEL_UID")

model.create_embedding("What is the capital of China?")

from xinference.client import Client

client = Client("http://localhost:9997")
model = client.get_model("MODEL_UID")

model.text_to_image("An astronaut walking on the mars")

from xinference.client import Client

client = Client("http://localhost:9997")
model = client.get_model("MODEL_UID")

with open("speech.mp3", "rb") as audio_file:
    model.transcriptions(audio_file.read())

from xinference.client import Client

client = Client("http://localhost:9997")
model = client.get_model("MODEL_UID")

query = "A man is eating pasta."
corpus = [
  "A man is eating food.",
  "A man is eating a piece of bread.",
  "The girl is carrying a baby.",
  "A man is riding a horse.",
  "A woman is playing violin."
]
print(model.rerank(corpus, query))

Getting Started#

Install Xinference

Install Xinference on Linux, Windows, and macOS.

Try it out!

Start by running Xinference on a local machine.

Explore models

Explore a wide range of models supported by Xinference.

Explore the API#

Chat & Generate

Learn how to chat with LLMs in Xinference.

Tools

Learn how to connect LLM with external tools.

Embeddings

Learn how to create text embeddings in Xinference.

Rerank

Learn how to use rerank models in Xinference.

Images

Learn how to generate images with Xinference.

Vision

Learn how to process image with LLMs.

Audio

Learn how to turn audio into text or text into audio with Xinference.

Getting Involved#

Get Latest News

Read our blogs

Get Support

Find community on WeChat

Find community on Slack

Open an issue

Contribute to Xinference

Create a pull request