๐Ÿง˜

ZenMux

Speed: โšก 37 tok/s avg
Models: 127
Price: 700 req/day free
Status: โœ… Online
Avg Speed
37 tok/s
Models
127
Price
700/d free
Best For
Gemini, Qwen

๐Ÿ’ฐ Plan & Pricing

โœ…
700 Free Requests/Day
Generous free tier with 700 requests per day. Good Gemini and Qwen coverage. No credit card needed.

๐Ÿ”‘ API Key

sk-ss-...f72f

๐ŸŒ Endpoint

https://zenmux.ai/api/v1/chat/completions

๐Ÿ“ฆ Key Models (127 total)

ModelSpeedCategoryNotes
google/gemini-2.5-flashโšก 90 tok/sChat๐Ÿฅ‡ Best Gemini Flash
google/gemini-2.5-proโšก 41 tok/sChat/Reasoning๐Ÿฅ‡ Best Gemini Pro
llama-3.3-70b-versatileโšก 52 tok/sChat2nd best for Llama
qwen/qwen3-32bโšก 38 tok/sChatGood Qwen coverage
llama-4-maverickโšก 42 tok/sChatLlama 4 Maverick
deepseek-r1โšก 35 tok/sReasoningDeepSeek R1
claude-sonnet-4โšก 30 tok/sChatClaude Sonnet 4
qwen3-235bโšก 36 tok/sChat/CodingQwen3 235B
minimax-m2.7โšก 27 tok/sChatMiniMax M2.7
zenmux/autoโšก variesAutoโšก Auto-selects best model

+ 117 more models with strong Gemini and Qwen coverage โ€” auto routing available

๐Ÿ’ป cURL Example

curl -X POST https://zenmux.ai/api/v1/chat/completions \
  -H "Authorization: Bearer sk-ss-...f72f" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/gemini-2.5-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

๐Ÿ Python Example

from openai import OpenAI

client = OpenAI(
    api_key="sk-ss-...f72f",
    base_url="https://zenmux.ai/api/v1"
)

response = client.chat.completions.create(
    model="google/gemini-2.5-flash",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

โš ๏ธ Pitfalls & Notes

๐Ÿ’ก
Best Gemini Coverage โ€” ZenMux offers the best Gemini 2.5 Flash at 90 tok/s. Good for Gemini-heavy workloads.
โš ๏ธ
Rate Limit โ€” Free tier limited to 700 requests per day. Suitable for moderate but not heavy usage.

๐Ÿท๏ธ Categories

Chat Vision Coding Reasoning Video