xinference.client.Client.list_cached_models#

Client.list_cached_models(model_name: str | None = None, worker_ip: str | None = None) List[Dict[Any, Any]][source]#

Get a list of cached models. :param model_name: The name of model. :type model_name: Optional[str] :param worker_ip: Specify the worker ip where the model is located in a distributed scenario. :type worker_ip: Optional[str]

Returns:

The collection of cached models on the server.

Return type:

List[Dict[Any, Any]]

Raises:

RuntimeError – Raised when the request fails, including the reason for the failure.