xinference.client.Client.list_cached_models#

Client.list_cached_models(model_name: str | None = None, worker_ip: str | None = None) List[Dict[Any, Any]][源代码]#

Get a list of cached models. :param model_name: The name of model. :type model_name: Optional[str] :param worker_ip: Specify the worker ip where the model is located in a distributed scenario. :type worker_ip: Optional[str]

返回:

The collection of cached models on the server.

返回类型:

List[Dict[Any, Any]]

抛出:

RuntimeError -- Raised when the request fails, including the reason for the failure.