๐Ÿ’ฌ Chat Models

General-purpose conversational AI โ€” all providers offer chat models

ModelProviderSpeedContextPrice TierNotes
llama-3.3-70b-versatileGroqโšก 212 tok/s128KFreeBest free fast chat
openai/gpt-oss-120bGroqโšก 215 tok/s128KFreeGPT-OSS on LPU
groq/compoundGroqโšก 289 tok/s128KFreeAgentic compound model
llama-3.1-8b-instantGroqโšก 300+ tok/s128KFreeUltra-fast small model
qwen/qwen3-32bGroqโšก 295 tok/s128KFreeQwen3 on LPU
deepseek-r1-distill-llama-70bGroqโšก 180 tok/s128KFreeReasoning distill
mixtral-8x7b-32768Groqโšก 190 tok/s32KFreeMoE architecture
gemma2-9b-itGroqโšก 250 tok/s8KFreeGoogle Gemma2
step-3-chatStepFunโšก 93 tok/sโ€”$9/moStrong Chinese/English
step-3-miniStepFunโšก Fastโ€”$9/moSmall/fast variant
glm-5OpenCodeโšก 52 tok/sโ€”$10/moGLM-5 champion
glm-5.1OpenCodeโšก 52 tok/sโ€”$10/moImproved GLM
deepseek-v4-flashOpenCodeโšก 58 tok/sโ€”$10/moFast DeepSeek
minimax-m2.5OpenCodeโ€”โ€”$10/moMiniMax model
minimax-m2.7OpenCodeโ€”โ€”$10/moMiniMax v2.7
kimi-k2.5OpenCodeโ€”โ€”$10/moKimi latest
qwen3-235bOpenCodeโ€”โ€”$10/moQwen3 reasoning
gpt-oss:120bOllamaโšก 71 tok/sโ€”UnlimitedLarge open model
glm-5.1Ollamaโšก 63 tok/sโ€”UnlimitedGLM-5.1
deepseek-v4-proOllamaโšก 50 tok/sโ€”UnlimitedDeepSeek reasoner
kimi-k2.5Ollamaโšก 58 tok/sโ€”UnlimitedKimi latest
glm-4.7Ollamaโšก 51 tok/sโ€”UnlimitedGLM-4.7
deepseek-v4-flashOllamaโšก 44 tok/sโ€”UnlimitedFlash variant
nemotron-3-superOllamaโšก 102 tok/sโ€”Unlimited๐Ÿ†• Exclusive
minimax-m2.1Ollamaโšก 83 tok/sโ€”Unlimited๐Ÿ†• Exclusive
qwen3-next:80bOllamaโšก 82 tok/sโ€”Unlimited๐Ÿ†• Exclusive
qwen3.5:397bOllamaโšก 72 tok/sโ€”Unlimited๐Ÿ†• Exclusive
kimi-k2.6Ollama๐Ÿข 18 tok/sโ€”UnlimitedKimi v2.6
claude-opus-4-6Veniceโ€”200KUncensoredUncensored Claude Opus
claude-sonnet-4-5Veniceโ€”200KUncensoredUncensored Claude Sonnet
kimi-k2.5Veniceโšก 86 tok/sโ€”UncensoredUncensored Kimi
kimi-k2.6Veniceโšก 93 tok/sโ€”UncensoredUncensored Kimi v2.6
grok-4Veniceโšก 102 tok/sโ€”Uncensored๐Ÿ†• Exclusive uncensored
deepseek-v4-proVeniceโšก 50 tok/sโ€”UncensoredUncensored DeepSeek
anthropic/claude-3.5-sonnetOpenRouterโ€”200KFree (:free)Free Claude tier
meta-llama/llama-3.1-8bOpenRouterโ€”128KFree (:free)Free Llama tier
google/gemini-2.5-proOpenRouterโšก 67 tok/s1MPaidBest Gemini model
google/gemini-2.5-flashOpenRouterโšก 74 tok/s1MFree/PaidFast Gemini
+320 more modelsOpenRouterโ€”โ€”Mixed356 total models
gemini-2.5-proZenMuxโšก 41 tok/s1MFree 700/dBest Gemini coverage
gemini-2.5-flashZenMuxโšก 90 tok/s1MFree 700/dFast Gemini
Qwen/Qwen3-32B-TEEChutesโšก 192 tok/sโ€”$20/mo๐Ÿ”’ TEE privacy
zai-org/GLM-5.1-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE privacy
moonshotai/Kimi-K2.5-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE privacy
MiniMaxAI/MiniMax-M2.5-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE privacy
zai-org/GLM-5-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE privacy
unsloth/Mistral-Nemo-2407-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE privacy
zai-org/GLM-5-TurboChutesโ€”โ€”$20/moFast GLM variant
claude-opus-4-6ORBITโšก 29 tok/s200KFree (2B/mo)Claude-only, free
claude-sonnet-4-5ORBITโšก 25 tok/s200KFree (2B/mo)Latest Sonnet
claude-3.5-sonnetORBITโšก 28 tok/s200KFree (2B/mo)Stable Claude
claude-3-opusORBIT๐Ÿข 22 tok/s200KFree (2B/mo)Legacy Opus
glm-4.7BytePlusโšก 71 tok/sโ€”$9/mo Proโœ… Verified & Working
glm-5.1BytePlusโšก 30 tok/sโ€”$9/mo Proโœ… Verified & Working
kimi-k2.5BytePlusโšก 26 tok/sโ€”$9/mo Proโœ… Verified & Working
gpt-oss-120bBytePlus๐Ÿข 15 tok/sโ€”$9/mo ProLarge open model
ark-code-latestBytePlus๐Ÿข 15 tok/sโ€”$9/mo ProAuto-code agent
Doubao-Pro-32KBytePlusโ€”32K$9/mo ProGeneral API
Doubao-Pro-128KBytePlusโ€”128K$9/mo ProGeneral API
Doubao-Lite-32KBytePlusโ€”32K$9/mo ProGeneral API
Doubao-Lite-128KBytePlusโ€”128K$9/mo ProGeneral API
DeepSeek-V3.2BytePlusโ€”โ€”$9/mo ProGeneral API

๐Ÿ’ป Coding Models

Models optimized for code generation, completion, and debugging

ModelProviderSpeedContextPrice TierNotes
dola-seed-2.0-codeBytePlusโšก 80 tok/sโ€”$9/mo Pro๐Ÿฅ‡ Best coding model โœ…
dola-seed-2.0-liteBytePlusโšก 84 tok/sโ€”$9/mo ProFastest coder โœ…
bytedance-seed-codeBytePlusโšก 63 tok/sโ€”$9/mo ProStrong code completion โœ…
ark-code-latestBytePlus๐Ÿข 15 tok/sโ€”$9/mo ProAuto-code agent
qwen2.5-coder-32bGroqโšก 212 tok/s128KFree๐Ÿฅ‡ Best free coding
Qwen/Qwen2.5-Coder-32B-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE privacy
qwen3-coder-nextOllamaโšก 52 tok/sโ€”Unlimited๐Ÿ†• Exclusive
devstral-2:123bOllama๐Ÿข 36 tok/sโ€”Unlimited๐Ÿ†• Exclusive code model
deepseek-v4-proOpenCodeโšก 52 tok/sโ€”$10/moStrong at code tasks
codestral-latestOpenRouterโ€”โ€”PaidMistral code model

๐Ÿง  Reasoning Models

Thinking/reasoning models that show chain-of-thought

ModelProviderSpeedContextPrice TierNotes
dola-seed-2.0-proBytePlusโšก 50 tok/sโ€”$9/mo Pro๐Ÿฅ‡ Best reasoning โœ…
deepseek-v4-proOpenCodeโšก 52 tok/sโ€”$10/moDeepSeek reasoning
deepseek-r1-distill-llama-70bGroqโšก 180 tok/s128KFreeFast free reasoning
step-3-reasoningStepFunโšก 93 tok/sโ€”$9/moStepFun reasoning
Qwen/Qwen3-235B-ThinkingChutesโšก 40 tok/sโ€”$20/moTEE thinking mode
deepseek-ai/DeepSeek-V3.2-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE DeepSeek reason
Qwen/Qwen3-32B-TEEChutesโšก 192 tok/sโ€”$20/mo๐Ÿ”’ TEE privacy
deepseek-v3.2Ollama๐Ÿข 23 tok/sโ€”UnlimitedDeepSeek V3.2 reasoning
claude-opus-4-6ORBITโšก 29 tok/s200KFree (2B/mo)Claude reasoning
qwen3-235bOpenCodeโ€”โ€”$10/moQwen3 reasoning
DeepSeek-V3.2BytePlusโ€”โ€”$9/mo ProDeepSeek reasoning
o3Veniceโ€”โ€”UncensoredUncensored reasoning

๐Ÿ‘๏ธ Vision / OCR Models

Multimodal models that can process images and text

ModelProviderSpeedContextPrice TierNotes
step-3-vlStepFunโšก 93 tok/sโ€”$9/moChinese/English OCR
glm-5-visionOpenCodeโšก 52 tok/sโ€”$10/moGLM vision
minimax-vl-01OpenCodeโ€”โ€”$10/moMiniMax vision
gemini-3-flash-previewOllamaโšก 47 tok/sโ€”Unlimited๐Ÿ†• Exclusive
llama-3.2-90b-visionGroqโšก Fastโ€”FreeLlama vision at speed
gemini-2.5-proZenMuxโšก 41 tok/s1MFree 700/dBest Gemini for vision
gemini-2.5-flashZenMuxโšก 90 tok/s1MFree 700/dFast Gemini vision
google/gemini-2.5-proOpenRouterโšก 67 tok/s1MPaidGemini Pro vision
google/gemini-2.5-flashOpenRouterโšก 74 tok/s1MFree/PaidGemini Flash vision
moonshotai/Kimi-K2.5-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE, Vision+Chat
Qwen/Qwen3.5-397B-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE, Vision+Chat
Qwen/Qwen3.6-27B-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE, Vision
moonshotai/Kimi-K2.6-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE, Vision+Video
google/gemma-4-31B-turbo-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE, Gemma vision

๐ŸŽค Speech-to-Text (STT) Models

Transcribe audio to text

ModelProviderSpeedContextPrice TierNotes
whisper-large-v3Groqโšก Ultra-fastโ€”Free๐Ÿฅ‡ Best free STT, on LPU
whisper-large-v3-turboGroqโšก Ultra-fastโ€”FreeFaster Whisper variant
step-asrStepFunโ€”โ€”$9/moStepFun ASR engine
step-asrInfermaticโ€”โ€”$20/moVia Infermatic API

๐Ÿ”Š Text-to-Speech (TTS) Models

Generate speech from text

ModelProviderSpeedContextPrice TierNotes
step-ttsStepFunโ€”โ€”$9/moStepFun TTS engine
kokoro-82mInfermaticโ€”โ€”$20/moKokoro TTS engine
Kokoro-82MChutesโ€”โ€”$20/moKokoro TTS on Chutes
Seed SpeechBytePlusโ€”โ€”$9/mo ProBytePlus TTS โœ…

๐Ÿ–ผ๏ธ Image Generation Models

Diffusion and generative image models

ModelProviderTypeContextPrice TierNotes
FLUX.1-schnellChutesDiffusionโ€”$20/moFast FLUX generation
JuggernautXL-RagnarokChutesDiffusionโ€”$20/moCinematic style
DreamShaper-XLChutesDiffusionโ€”$20/moArtistic style
hunyuan-image-3ChutesDiffusionโ€”$20/moPhotorealistic
flux-1VeniceDiffusionโ€”UncensoredUncensored image gen
Various image modelsArliAIDiffusionโ€”From $10/moImage gen API
SDXL / Stable modelsFeatherlessDiffusionโ€”Scale ร—3Massive catalog, slow
Seedream-5.0-liteBytePlusDiffusion4K$9/mo Pro4K image gen โœ…

๐ŸŽฌ Video Generation

Models capable of generating or understanding video content

ModelProviderSpeedContextPrice TierNotes
Seedance-2.0BytePlusโ€”โ€”$9/mo ProVideo generation โœ…
Seedance-2.0-fastBytePlusโ€”โ€”$9/mo ProFast video gen variant โœ…
OmniHumanBytePlusโ€”โ€”$9/mo ProDigital human โœ…
Video gen modelsOpenRouterโ€”โ€”PaidVarious video models
Gemini video understandingZenMuxโ€”โ€”Free/PaidGemini video capability

๐Ÿ“ Embedding Models

Text embedding models for semantic search and similarity

ModelProviderDimensionsContextPrice TierNotes
Qwen3-Embedding-8B-TEEChutesโ€”โ€”$20/mo๐Ÿ”’ TEE embeddings
multilingual-e5-baseInfermatic768โ€”$20/moMultilingual embeddings