๐ท๏ธ Models by Category
Find the best models across all providers for each use case โ click a tab to filter
๐ฌ Chat Models
General-purpose conversational AI โ all providers offer chat models
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| llama-3.3-70b-versatile | Groq | โก 212 tok/s | 128K | Free | Best free fast chat |
| openai/gpt-oss-120b | Groq | โก 215 tok/s | 128K | Free | GPT-OSS on LPU |
| groq/compound | Groq | โก 289 tok/s | 128K | Free | Agentic compound model |
| llama-3.1-8b-instant | Groq | โก 300+ tok/s | 128K | Free | Ultra-fast small model |
| qwen/qwen3-32b | Groq | โก 295 tok/s | 128K | Free | Qwen3 on LPU |
| deepseek-r1-distill-llama-70b | Groq | โก 180 tok/s | 128K | Free | Reasoning distill |
| mixtral-8x7b-32768 | Groq | โก 190 tok/s | 32K | Free | MoE architecture |
| gemma2-9b-it | Groq | โก 250 tok/s | 8K | Free | Google Gemma2 |
| step-3-chat | StepFun | โก 93 tok/s | โ | $9/mo | Strong Chinese/English |
| step-3-mini | StepFun | โก Fast | โ | $9/mo | Small/fast variant |
| glm-5 | OpenCode | โก 52 tok/s | โ | $10/mo | GLM-5 champion |
| glm-5.1 | OpenCode | โก 52 tok/s | โ | $10/mo | Improved GLM |
| deepseek-v4-flash | OpenCode | โก 58 tok/s | โ | $10/mo | Fast DeepSeek |
| minimax-m2.5 | OpenCode | โ | โ | $10/mo | MiniMax model |
| minimax-m2.7 | OpenCode | โ | โ | $10/mo | MiniMax v2.7 |
| kimi-k2.5 | OpenCode | โ | โ | $10/mo | Kimi latest |
| qwen3-235b | OpenCode | โ | โ | $10/mo | Qwen3 reasoning |
| gpt-oss:120b | Ollama | โก 71 tok/s | โ | Unlimited | Large open model |
| glm-5.1 | Ollama | โก 63 tok/s | โ | Unlimited | GLM-5.1 |
| deepseek-v4-pro | Ollama | โก 50 tok/s | โ | Unlimited | DeepSeek reasoner |
| kimi-k2.5 | Ollama | โก 58 tok/s | โ | Unlimited | Kimi latest |
| glm-4.7 | Ollama | โก 51 tok/s | โ | Unlimited | GLM-4.7 |
| deepseek-v4-flash | Ollama | โก 44 tok/s | โ | Unlimited | Flash variant |
| nemotron-3-super | Ollama | โก 102 tok/s | โ | Unlimited | ๐ Exclusive |
| minimax-m2.1 | Ollama | โก 83 tok/s | โ | Unlimited | ๐ Exclusive |
| qwen3-next:80b | Ollama | โก 82 tok/s | โ | Unlimited | ๐ Exclusive |
| qwen3.5:397b | Ollama | โก 72 tok/s | โ | Unlimited | ๐ Exclusive |
| kimi-k2.6 | Ollama | ๐ข 18 tok/s | โ | Unlimited | Kimi v2.6 |
| claude-opus-4-6 | Venice | โ | 200K | Uncensored | Uncensored Claude Opus |
| claude-sonnet-4-5 | Venice | โ | 200K | Uncensored | Uncensored Claude Sonnet |
| kimi-k2.5 | Venice | โก 86 tok/s | โ | Uncensored | Uncensored Kimi |
| kimi-k2.6 | Venice | โก 93 tok/s | โ | Uncensored | Uncensored Kimi v2.6 |
| grok-4 | Venice | โก 102 tok/s | โ | Uncensored | ๐ Exclusive uncensored |
| deepseek-v4-pro | Venice | โก 50 tok/s | โ | Uncensored | Uncensored DeepSeek |
| anthropic/claude-3.5-sonnet | OpenRouter | โ | 200K | Free (:free) | Free Claude tier |
| meta-llama/llama-3.1-8b | OpenRouter | โ | 128K | Free (:free) | Free Llama tier |
| google/gemini-2.5-pro | OpenRouter | โก 67 tok/s | 1M | Paid | Best Gemini model |
| google/gemini-2.5-flash | OpenRouter | โก 74 tok/s | 1M | Free/Paid | Fast Gemini |
| +320 more models | OpenRouter | โ | โ | Mixed | 356 total models |
| gemini-2.5-pro | ZenMux | โก 41 tok/s | 1M | Free 700/d | Best Gemini coverage |
| gemini-2.5-flash | ZenMux | โก 90 tok/s | 1M | Free 700/d | Fast Gemini |
| Qwen/Qwen3-32B-TEE | Chutes | โก 192 tok/s | โ | $20/mo | ๐ TEE privacy |
| zai-org/GLM-5.1-TEE | Chutes | โ | โ | $20/mo | ๐ TEE privacy |
| moonshotai/Kimi-K2.5-TEE | Chutes | โ | โ | $20/mo | ๐ TEE privacy |
| MiniMaxAI/MiniMax-M2.5-TEE | Chutes | โ | โ | $20/mo | ๐ TEE privacy |
| zai-org/GLM-5-TEE | Chutes | โ | โ | $20/mo | ๐ TEE privacy |
| unsloth/Mistral-Nemo-2407-TEE | Chutes | โ | โ | $20/mo | ๐ TEE privacy |
| zai-org/GLM-5-Turbo | Chutes | โ | โ | $20/mo | Fast GLM variant |
| claude-opus-4-6 | ORBIT | โก 29 tok/s | 200K | Free (2B/mo) | Claude-only, free |
| claude-sonnet-4-5 | ORBIT | โก 25 tok/s | 200K | Free (2B/mo) | Latest Sonnet |
| claude-3.5-sonnet | ORBIT | โก 28 tok/s | 200K | Free (2B/mo) | Stable Claude |
| claude-3-opus | ORBIT | ๐ข 22 tok/s | 200K | Free (2B/mo) | Legacy Opus |
| glm-4.7 | BytePlus | โก 71 tok/s | โ | $9/mo Pro | โ Verified & Working |
| glm-5.1 | BytePlus | โก 30 tok/s | โ | $9/mo Pro | โ Verified & Working |
| kimi-k2.5 | BytePlus | โก 26 tok/s | โ | $9/mo Pro | โ Verified & Working |
| gpt-oss-120b | BytePlus | ๐ข 15 tok/s | โ | $9/mo Pro | Large open model |
| ark-code-latest | BytePlus | ๐ข 15 tok/s | โ | $9/mo Pro | Auto-code agent |
| Doubao-Pro-32K | BytePlus | โ | 32K | $9/mo Pro | General API |
| Doubao-Pro-128K | BytePlus | โ | 128K | $9/mo Pro | General API |
| Doubao-Lite-32K | BytePlus | โ | 32K | $9/mo Pro | General API |
| Doubao-Lite-128K | BytePlus | โ | 128K | $9/mo Pro | General API |
| DeepSeek-V3.2 | BytePlus | โ | โ | $9/mo Pro | General API |
๐ป Coding Models
Models optimized for code generation, completion, and debugging
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| dola-seed-2.0-code | BytePlus | โก 80 tok/s | โ | $9/mo Pro | ๐ฅ Best coding model โ |
| dola-seed-2.0-lite | BytePlus | โก 84 tok/s | โ | $9/mo Pro | Fastest coder โ |
| bytedance-seed-code | BytePlus | โก 63 tok/s | โ | $9/mo Pro | Strong code completion โ |
| ark-code-latest | BytePlus | ๐ข 15 tok/s | โ | $9/mo Pro | Auto-code agent |
| qwen2.5-coder-32b | Groq | โก 212 tok/s | 128K | Free | ๐ฅ Best free coding |
| Qwen/Qwen2.5-Coder-32B-TEE | Chutes | โ | โ | $20/mo | ๐ TEE privacy |
| qwen3-coder-next | Ollama | โก 52 tok/s | โ | Unlimited | ๐ Exclusive |
| devstral-2:123b | Ollama | ๐ข 36 tok/s | โ | Unlimited | ๐ Exclusive code model |
| deepseek-v4-pro | OpenCode | โก 52 tok/s | โ | $10/mo | Strong at code tasks |
| codestral-latest | OpenRouter | โ | โ | Paid | Mistral code model |
๐ง Reasoning Models
Thinking/reasoning models that show chain-of-thought
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| dola-seed-2.0-pro | BytePlus | โก 50 tok/s | โ | $9/mo Pro | ๐ฅ Best reasoning โ |
| deepseek-v4-pro | OpenCode | โก 52 tok/s | โ | $10/mo | DeepSeek reasoning |
| deepseek-r1-distill-llama-70b | Groq | โก 180 tok/s | 128K | Free | Fast free reasoning |
| step-3-reasoning | StepFun | โก 93 tok/s | โ | $9/mo | StepFun reasoning |
| Qwen/Qwen3-235B-Thinking | Chutes | โก 40 tok/s | โ | $20/mo | TEE thinking mode |
| deepseek-ai/DeepSeek-V3.2-TEE | Chutes | โ | โ | $20/mo | ๐ TEE DeepSeek reason |
| Qwen/Qwen3-32B-TEE | Chutes | โก 192 tok/s | โ | $20/mo | ๐ TEE privacy |
| deepseek-v3.2 | Ollama | ๐ข 23 tok/s | โ | Unlimited | DeepSeek V3.2 reasoning |
| claude-opus-4-6 | ORBIT | โก 29 tok/s | 200K | Free (2B/mo) | Claude reasoning |
| qwen3-235b | OpenCode | โ | โ | $10/mo | Qwen3 reasoning |
| DeepSeek-V3.2 | BytePlus | โ | โ | $9/mo Pro | DeepSeek reasoning |
| o3 | Venice | โ | โ | Uncensored | Uncensored reasoning |
๐๏ธ Vision / OCR Models
Multimodal models that can process images and text
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| step-3-vl | StepFun | โก 93 tok/s | โ | $9/mo | Chinese/English OCR |
| glm-5-vision | OpenCode | โก 52 tok/s | โ | $10/mo | GLM vision |
| minimax-vl-01 | OpenCode | โ | โ | $10/mo | MiniMax vision |
| gemini-3-flash-preview | Ollama | โก 47 tok/s | โ | Unlimited | ๐ Exclusive |
| llama-3.2-90b-vision | Groq | โก Fast | โ | Free | Llama vision at speed |
| gemini-2.5-pro | ZenMux | โก 41 tok/s | 1M | Free 700/d | Best Gemini for vision |
| gemini-2.5-flash | ZenMux | โก 90 tok/s | 1M | Free 700/d | Fast Gemini vision |
| google/gemini-2.5-pro | OpenRouter | โก 67 tok/s | 1M | Paid | Gemini Pro vision |
| google/gemini-2.5-flash | OpenRouter | โก 74 tok/s | 1M | Free/Paid | Gemini Flash vision |
| moonshotai/Kimi-K2.5-TEE | Chutes | โ | โ | $20/mo | ๐ TEE, Vision+Chat |
| Qwen/Qwen3.5-397B-TEE | Chutes | โ | โ | $20/mo | ๐ TEE, Vision+Chat |
| Qwen/Qwen3.6-27B-TEE | Chutes | โ | โ | $20/mo | ๐ TEE, Vision |
| moonshotai/Kimi-K2.6-TEE | Chutes | โ | โ | $20/mo | ๐ TEE, Vision+Video |
| google/gemma-4-31B-turbo-TEE | Chutes | โ | โ | $20/mo | ๐ TEE, Gemma vision |
๐ค Speech-to-Text (STT) Models
Transcribe audio to text
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| whisper-large-v3 | Groq | โก Ultra-fast | โ | Free | ๐ฅ Best free STT, on LPU |
| whisper-large-v3-turbo | Groq | โก Ultra-fast | โ | Free | Faster Whisper variant |
| step-asr | StepFun | โ | โ | $9/mo | StepFun ASR engine |
| step-asr | Infermatic | โ | โ | $20/mo | Via Infermatic API |
๐ Text-to-Speech (TTS) Models
Generate speech from text
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| step-tts | StepFun | โ | โ | $9/mo | StepFun TTS engine |
| kokoro-82m | Infermatic | โ | โ | $20/mo | Kokoro TTS engine |
| Kokoro-82M | Chutes | โ | โ | $20/mo | Kokoro TTS on Chutes |
| Seed Speech | BytePlus | โ | โ | $9/mo Pro | BytePlus TTS โ |
๐ผ๏ธ Image Generation Models
Diffusion and generative image models
| Model | Provider | Type | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| FLUX.1-schnell | Chutes | Diffusion | โ | $20/mo | Fast FLUX generation |
| JuggernautXL-Ragnarok | Chutes | Diffusion | โ | $20/mo | Cinematic style |
| DreamShaper-XL | Chutes | Diffusion | โ | $20/mo | Artistic style |
| hunyuan-image-3 | Chutes | Diffusion | โ | $20/mo | Photorealistic |
| flux-1 | Venice | Diffusion | โ | Uncensored | Uncensored image gen |
| Various image models | ArliAI | Diffusion | โ | From $10/mo | Image gen API |
| SDXL / Stable models | Featherless | Diffusion | โ | Scale ร3 | Massive catalog, slow |
| Seedream-5.0-lite | BytePlus | Diffusion | 4K | $9/mo Pro | 4K image gen โ |
๐ฌ Video Generation
Models capable of generating or understanding video content
| Model | Provider | Speed | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| Seedance-2.0 | BytePlus | โ | โ | $9/mo Pro | Video generation โ |
| Seedance-2.0-fast | BytePlus | โ | โ | $9/mo Pro | Fast video gen variant โ |
| OmniHuman | BytePlus | โ | โ | $9/mo Pro | Digital human โ |
| Video gen models | OpenRouter | โ | โ | Paid | Various video models |
| Gemini video understanding | ZenMux | โ | โ | Free/Paid | Gemini video capability |
๐ Embedding Models
Text embedding models for semantic search and similarity
| Model | Provider | Dimensions | Context | Price Tier | Notes |
|---|---|---|---|---|---|
| Qwen3-Embedding-8B-TEE | Chutes | โ | โ | $20/mo | ๐ TEE embeddings |
| multilingual-e5-base | Infermatic | 768 | โ | $20/mo | Multilingual embeddings |