xinference.client.Client#
- class xinference.client.Client(base_url, api_key: str | None = None)[源代码]#
-
Methods
__init__(base_url[, api_key])describe_model(model_uid)Get model information via RESTful APIs.
get_model(model_uid)Launch the model based on the parameters on the server via RESTful APIs.
get_model_registration(model_type, model_name)Get the model with the model type and model name registered on the server.
launch_model(model_name[, model_type, ...])Launch the model based on the parameters on the server via RESTful APIs.
list_model_registrations(model_type)List models registered on the server.
Retrieve the model specifications from the Server.
login(username, password)query_engine_by_model_name(model_name)Get the engine parameters with the model name registered on the server.
register_model(model_type, model, persist)Register a custom model.
terminate_model(model_uid)Terminate the specific model running on the server.
unregister_model(model_type, model_name)Unregister a custom model.
vllm_models()