List Models

List Available Inference Models.

Return a list of all available models and their details.

HTTP Request

GET {API_URL}/serverless/models

Response Attributes

data dict

A dict containing a key-value pair of models: [list of models]. Each model in this list is represented as a dict, containing the following model information:

  • model_name string: The unique name for the model. Use this value when specifying the desired models in API calls.

  • model_alias string: An name for the model that is easy to interpret.

  • model_type string: The type of data output by the model.

  • context_length string: The length of the context required by the model.

  • max_completion_tokens string: The maximum number of tokens that will be generated for each response, if applicable.

  • description: A brief description of the model.

  • price: The price of the model.

  • huggingface_url: The URL of the model on hugging face, if applicable.

status string

Indicates the result of the request. success signifies success, while failed indicates an error.

message string

A description of the status of the request.

Example

Request

Response

Here's an example of a successful response.

Last updated