xinference.client.Client.describe_model#
- Client.describe_model(model_uid: str)[sorgente]#
Get model information via RESTful APIs.
- Parametri:
model_uid (str) – The unique id that identify the model.
- Ritorna:
A dictionary containing the following keys:
- »model_type»: str
the type of the model determined by its function, e.g. «LLM» (Large Language Model)
- »model_name»: str
the name of the specific LLM model family
- »model_lang»: List[str]
the languages supported by the LLM model
- »model_ability»: List[str]
the ability or capabilities of the LLM model
- »model_description»: str
a detailed description of the LLM model
- »model_format»: str
the format specification of the LLM model
- »model_size_in_billions»: int
the size of the LLM model in billions
- »quantization»: str
the quantization applied to the model
- »revision»: str
the revision number of the LLM model specification
- »context_length»: int
the maximum text length the LLM model can accommodate (include all input & output)
- Tipo di ritorno:
dict
- Solleva:
RuntimeError – Report failure to get the wanted model with given model_uid. Provide details of failure through error message.