xinference.client.Client.describe_model#

Client.describe_model(model_uid: str)[source]#

Get model information via RESTful APIs.

Parameters:

model_uid (str) – The unique id that identify the model.

Returns:

A dictionary containing the following keys:

  • ”model_type”: str

    the type of the model determined by its function, e.g. “LLM” (Large Language Model)

  • ”model_name”: str

    the name of the specific LLM model family

  • ”model_lang”: List[str]

    the languages supported by the LLM model

  • ”model_ability”: List[str]

    the ability or capabilities of the LLM model

  • ”model_description”: str

    a detailed description of the LLM model

  • ”model_format”: str

    the format specification of the LLM model

  • ”model_size_in_billions”: int

    the size of the LLM model in billions

  • ”quantization”: str

    the quantization applied to the model

  • ”revision”: str

    the revision number of the LLM model specification

  • ”context_length”: int

    the maximum text length the LLM model can accommodate (include all input & output)

Return type:

dict

Raises:

RuntimeError – Report failure to get the wanted model with given model_uid. Provide details of failure through error message.