๐ฑ BytePlus
โ Verified & Working โ Coding Plan $9/mo, 9 models + General API
BytePlus
๐ฐ Plan & Pricing
Each LLM model gets 500K free tokens. Vision models get 2M tokens. 200 free 4K image generations via Seedream. No credit card required to start.
Full access to all 9 Coding Plan models including dola-seed-2.0-lite (84 tok/s), dola-seed-2.0-pro (deep-thinking), and ark-code-latest (intelligent routing). Anthropic-compatible endpoint for Claude Code, Cursor, Hermes Agent, and other IDEs.
๐ API Key
๐ Endpoints
https://ark.ap-southeast.bytepluses.com/api/coding/v3
Coding Plan (OpenAI-compatible) โ use this for Coding Plan quota
https://ark.ap-southeast.bytepluses.com/api/coding
Coding Plan (Anthropic-compatible) โ for Claude Code, Hermes Agent, etc.
https://ark.ap-southeast.bytepluses.com/api/v3/
General API โ โ ๏ธ requires ep-xxx endpoint, incurs additional charges
๐ฆ Coding Plan Models $9/mo ยท All โ Verified
| Model | Speed (tok/s) | Type | Notes |
|---|---|---|---|
| dola-seed-2.0-lite | โก 84 tok/s | Coding | โก Fastest |
| dola-seed-2.0-code | โก 80 tok/s | Coding | IDE-optimized |
| dola-seed-2.0-pro | โก 50 tok/s | Reasoning | Deep-thinking multimodal |
| glm-4.7 | โก 71 tok/s | Chat | GLM 4.7 |
| bytedance-seed-code | โก 63 tok/s | Coding | ByteDance Seed |
| glm-5.1 | โก 30 tok/s | Chat | GLM 5.1 |
| kimi-k2.5 | โก 26 tok/s | Chat | Moonshot Kimi |
| ark-code-latest | โก 15 tok/s | Auto | Intelligent routing |
| gpt-oss-120b | โก 15 tok/s | Chat | Open-source GPT |
๐ General API Models Requires ep-xxx Endpoint
| Model | Type | Context | Notes |
|---|---|---|---|
| Doubao-Pro-32K | LLM | 32K | Doubao standard |
| Doubao-Pro-128K | LLM | 128K | Doubao long context |
| Doubao-Lite-32K | LLM | 32K | Fast & cheap |
| Doubao-Lite-128K | LLM | 128K | Fast cheap long context |
| Dola-Seed-2.0-pro | Multimodal | โ | Deep-thinking |
| Dola-Seed-2.0-Code | Coding | โ | IDE-agent optimized |
| DeepSeek-V3.2 | LLM | โ | Reasoning |
General API models require dedicated inference endpoints (ep-xxx). Create them in the BytePlus console under ModelArk.
๐จ Image & Video Models
| Model | Type | Notes |
|---|---|---|
| Dola-Seedream-5.0-lite | Image Gen | 4K output, text+image input, batch generation |
| Dreamina-Seedance-2.0 | Video Gen | Multi-modal video generation |
| Dreamina-Seedance-2.0-fast | Video Gen | Fast video generation |
๐ฌ Other Models
| Model | Type | Notes |
|---|---|---|
| OmniHuman | Digital Human | Film-grade from photo + audio + text |
| Seed Speech | TTS | Large model voice synthesis |
๐ง Configuration โ Claude Code / Hermes IDE
Use these environment variables for Anthropic-compatible IDEs (Claude Code, Cursor, Hermes Agent, etc.)
ANTHROPIC_AUTH_TOKEN=ark-add3...297be ANTHROPIC_BASE_URL=https://ark.ap-southeast.bytepluses.com/api/coding ANTHROPIC_MODEL=ark-code-latest
Supported IDEs: Claude Code, Cursor, OpenCode, Cline, Roo Code, TRAE, Kilo Code, Hermes Agent
๐ง Configuration โ OpenAI Style
For tools using the OpenAI SDK format
base_url=https://ark.ap-southeast.bytepluses.com/api/coding/v3 api_key=ark-add3...297be
๐ป cURL Example
curl -X POST https://ark.ap-southeast.bytepluses.com/api/coding/v3/chat/completions \
-H "Authorization: Bearer ***" \
-H "Content-Type: application/json" \
-d '{
"model": "ark-code-latest",
"messages": [{"role": "user", "content": "Hello!"}]
}'
๐ Python Example
from openai import OpenAI
client = OpenAI(
api_key="ark-add3...297be",
base_url="https://ark.ap-southeast.bytepluses.com/api/coding/v3"
)
response = client.chat.completions.create(
model="ark-code-latest",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
๐จ Error Reference
| Error | Meaning | Fix |
|---|---|---|
| 401 โ key doesn't exist | Wrong domain | Use bytepluses.com โ NOT volces.com |
| 400 โ InvalidSubscription | Coding Plan not activated | Subscribe to ArkCode Pro ($9/mo) at Volcengine console |
| 404 โ InvalidEndpointOrModel | Wrong endpoint for model type | General API models need an ep-xxx endpoint, not /api/coding |
๐ฅ๏ธ Console & Management
Volcengine Coding Plan: https://console.volcengine.com/ark/region:ark+cn-beijing/openManagement?OpenModelVisible=false&tab=CodingPlan BytePlus Console: https://console.byteplus.com โ ModelArk
Use Volcengine console to activate/manage Coding Plan subscription. Use BytePlus console for General API endpoints and model management.
โ ๏ธ Pitfalls & Notes
bytepluses.com./api/v3 with a Coding Plan key incurs ADDITIONAL charges, not plan quota. Always use /api/coding/v3 for Coding Plan.ep-xxx inference endpoints in the console before use.