๐Ÿค–

ArliAI

Speed: โšก 16 tok/s avg
Models: 54
Price: $10+/mo
Status: โœ… Online
Avg Speed
16 tok/s
Models
54
Price
$10+/mo
Best For
Derestricted & Image Gen

๐Ÿ’ฐ Plan & Pricing

โ„น๏ธ
From $10/mo
Derestricted/uncensored model variants. Also provides image generation API.

๐Ÿ”‘ API Key

44c327...cdff9

๐ŸŒ Endpoint

https://api.arliai.com/v1/chat/completions

๐Ÿ“ฆ Models (54 total โ€” derestricted)

ModelSpeedCategoryNotes
deepseek-r1-0528โšก 16 tok/sReasoning๐Ÿ”ฅ Derestricted R1
qwen3-235bโšก 15 tok/sChat/CodingDerestricted Qwen3
llama-4-maverickโšก 14 tok/sChatDerestricted Llama 4
meta-llama/llama-3.1-70b-instructโšก 16 tok/sChatDerestricted Llama
meta-llama/llama-3.1-8b-instructโšก 18 tok/sChatSmall derestricted Llama
mistralai/mistral-7b-instructโšก 17 tok/sChatDerestricted Mistral
deepseek-v3โšก 14 tok/sChatDerestricted DS-V3
qwen2.5-coder-32b-instructโšก 15 tok/sCodingDerestricted Qwen Coder
various fine-tunesโšก variesChat/CodingMany community fine-tunes

๐Ÿ”ฅ Image Generation API also available โ€” + ~45 more derestricted model variants

โš ๏ธ Speed Note

โš ๏ธ
16 tok/s average โ€” ArliAI is significantly slower than other providers. All models average ~16 tok/s with occasional spikes. Best for derestricted/uncensored access when speed is not critical.
curl -X POST https://api.arliai.com/v1/chat/completions \
  -H "Authorization: Bearer ***" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "meta-llama/llama-3.1-70b-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

๐Ÿ Python Example

from openai import OpenAI

client = OpenAI(
    api_key="44c327...cdff9",
    base_url="https://api.arliai.com/v1"
)

response = client.chat.completions.create(
    model="meta-llama/llama-3.1-70b-instruct",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

โš ๏ธ Pitfalls & Notes

โ„น๏ธ
Offers derestricted/uncensored model variants โ€” ArliAI provides uncensored versions of popular models for unrestricted chat.
โ„น๏ธ
Has image generation endpoint โ€” ArliAI also offers an image generation API alongside text completions.
โš ๏ธ
16 tok/s is slower than most providers โ€” Average speed is below the typical range; expect longer response times on large outputs.
โ„น๏ธ
54+ models with uncensored chat and image gen โ€” Wide model selection covering both text and image generation use cases.

๐Ÿท๏ธ Categories

Chat Image Gen Uncensored