Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gpuhub.com/llms.txt

Use this file to discover all available pages before exploring further.

MiniMax-M2.5 is available through the OpenAI-compatible chat completions API. It is suitable for chat, reasoning, writing, coding assistance, and agent-style application workflows.
Create or copy your API key from the console before calling the API. Use the model unit price displayed in the console as the source of truth.

Endpoint

CompatibilityBase URL
OpenAI-compatiblehttps://api.gpuhub.com/api/v1

curl

curl --location --request POST "https://api.gpuhub.com/api/v1/chat/completions" \
  --header "Authorization: Bearer $API_KEY" \
  --header "Content-Type: application/json" \
  --data-raw '{
    "model": "MiniMax-M2.5",
    "messages": [
      {
        "role": "user",
        "content": "Hello! Please draft a short product announcement."
      }
    ],
    "stream": true
  }'

Python

# Install the SDK first if needed:
# pip install openai

from openai import OpenAI

client = OpenAI(
    base_url="https://api.gpuhub.com/api/v1",
    api_key="YOUR_API_KEY",
)

stream = client.chat.completions.create(
    model="MiniMax-M2.5",
    messages=[
        {"role": "user", "content": "Hello! Please draft a short product announcement."}
    ],
    stream=True,
)

for chunk in stream:
    if chunk.choices and chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

Use with Cherry Studio

To use this model in Cherry Studio, configure the matching provider type and add the model name shown on this page. See Cherry Studio for setup details.