xinference.client.Client.list_cached_models#
- Client.list_cached_models(model_name: str | None = None, worker_ip: str | None = None) List[Dict[Any, Any]][source]#
Get a list of cached models. :param model_name: The name of model. :type model_name: Optional[str] :param worker_ip: Specify the worker ip where the model is located in a distributed scenario. :type worker_ip: Optional[str]
- Returns:
The collection of cached models on the server.
- Return type:
List[Dict[Any, Any]]
- Raises:
RuntimeError – Raised when the request fails, including the reason for the failure.