๐Ÿชถ

Featherless

Speed: โšก 9 tok/s avg
Models: 16K+
Price: Scale ร—3
Status: โš ๏ธ Very Slow
Avg Speed
9 tok/s
Models
16K+
Price
Scale ร—3
Best For
Obscure Models

๐Ÿ’ฐ Plan & Pricing

โš ๏ธ
Scale ร—3 Pricing โ€” Very Slow
Scale ร—3 pricing means costs multiply quickly. Very slow at 9 tok/s with queue times of 30-50s common. Best as last resort for models not available elsewhere.

๐Ÿ”‘ API Key

rc_96ff...c073b

๐ŸŒ Endpoint

https://api.featherless.ai/v1/chat/completions

๐Ÿ“ฆ Models (16K+ โ€” Key Models)

ModelSpeedCategoryNotes
llama-4-maverickโšก 10 tok/sChatLlama 4 Maverick
deepseek-r1โšก 8 tok/sReasoningDeepSeek R1
qwen3-235bโšก 9 tok/sChat/CodingQwen3 235B
deepseek-v3โšก 8 tok/sChatDeepSeek V3
llama-3.1-70b-instructโšก 10 tok/sChatLlama 3.1 70B
llama-3.1-8b-instructโšก 12 tok/sChatLlama 3.1 8B
mistral-7b-instructโšก 11 tok/sChatMistral 7B
qwen2.5-coder-32b-instructโšก 9 tok/sCodingQwen Coder
various 16K+ modelsโšก ~9 avgAllChat, Coding, Vision, Reasoning, Image Gen, Audio

โš ๏ธ Very slow (9 tok/s avg, 30-50s queue). Best as fallback for obscure models. Categories: Chat, Coding, Vision, Reasoning, Image Gen, Audio.

๐Ÿ’ป cURL Example

curl -X POST https://api.featherless.ai/v1/chat/completions \
  -H "Authorization: Bearer rc_96ff...c073b" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "meta-llama/llama-3-70b-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

๐Ÿ Python Example

from openai import OpenAI

client = OpenAI(
    api_key="rc_96ff...c073b",
    base_url="https://api.featherless.ai/v1"
)

response = client.chat.completions.create(
    model="meta-llama/llama-3-70b-instruct",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

โš ๏ธ Pitfalls & Notes

๐Ÿšจ
Very Slow โ€” 9 tok/s avg with 30-50 second queue times. Expect long waits before responses begin.
โš ๏ธ
Scale ร—3 Pricing โ€” Costs multiply quickly. Scale ร—3 pricing means you pay 3ร— the base model cost.
โ„น๏ธ
Largest Model Catalog โ€” 16,000+ models is the largest catalog of any provider.
โ„น๏ธ
Fallback for Obscure Models โ€” Useful as fallback for obscure/niche models not available elsewhere.

๐Ÿท๏ธ Categories

Chat Long-tail Models Research