๐Ÿ“ฑ

BytePlus

Speed: โšก 15โ€“84 tok/s
Models: 22+ (9 Coding + 7 General + 6 Media)
Price: Free 500K + $9/mo Pro
Status: โœ… Verified & Working
Avg Speed
48 tok/s avg
Models
22+
Price
Free + $9/mo
Best For
IDE Coding Agents

๐Ÿ’ฐ Plan & Pricing

โœ…
Free Tier โ€” 500K Tokens/LLM, 2M/Vision, 200 Images
Each LLM model gets 500K free tokens. Vision models get 2M tokens. 200 free 4K image generations via Seedream. No credit card required to start.
โœ…
ArkCode Pro โ€” $9/mo Coding Plan (Verified โœ…)
Full access to all 9 Coding Plan models including dola-seed-2.0-lite (84 tok/s), dola-seed-2.0-pro (deep-thinking), and ark-code-latest (intelligent routing). Anthropic-compatible endpoint for Claude Code, Cursor, Hermes Agent, and other IDEs.

๐Ÿ”‘ API Key

ark-add3...297be

๐ŸŒ Endpoints

https://ark.ap-southeast.bytepluses.com/api/coding/v3

Coding Plan (OpenAI-compatible) โ€” use this for Coding Plan quota

https://ark.ap-southeast.bytepluses.com/api/coding

Coding Plan (Anthropic-compatible) โ€” for Claude Code, Hermes Agent, etc.

https://ark.ap-southeast.bytepluses.com/api/v3/

General API โ€” โš ๏ธ requires ep-xxx endpoint, incurs additional charges

๐Ÿ“ฆ Coding Plan Models $9/mo ยท All โœ… Verified

ModelSpeed (tok/s)TypeNotes
dola-seed-2.0-liteโšก 84 tok/sCodingโšก Fastest
dola-seed-2.0-codeโšก 80 tok/sCodingIDE-optimized
dola-seed-2.0-proโšก 50 tok/sReasoningDeep-thinking multimodal
glm-4.7โšก 71 tok/sChatGLM 4.7
bytedance-seed-codeโšก 63 tok/sCodingByteDance Seed
glm-5.1โšก 30 tok/sChatGLM 5.1
kimi-k2.5โšก 26 tok/sChatMoonshot Kimi
ark-code-latestโšก 15 tok/sAutoIntelligent routing
gpt-oss-120bโšก 15 tok/sChatOpen-source GPT

๐ŸŒ General API Models

ModelTypeContextNotes
Doubao-Pro-32KLLM32KDoubao standard
Doubao-Pro-128KLLM128KDoubao long context
Doubao-Lite-32KLLM32KFast & cheap
Doubao-Lite-128KLLM128KFast cheap long context
Dola-Seed-2.0-proMultimodalโ€”Deep-thinking
Dola-Seed-2.0-CodeCodingโ€”IDE-agent optimized
DeepSeek-V3.2LLMโ€”Reasoning

General API models require dedicated inference endpoints (ep-xxx). Create them in the BytePlus console under ModelArk.

๐ŸŽจ Image & Video Models

ModelTypeNotes
Dola-Seedream-5.0-liteImage Gen4K output, text+image input, batch generation
Dreamina-Seedance-2.0Video GenMulti-modal video generation
Dreamina-Seedance-2.0-fastVideo GenFast video generation

๐Ÿ”ฌ Other Models

ModelTypeNotes
OmniHumanDigital HumanFilm-grade from photo + audio + text
Seed SpeechTTSLarge model voice synthesis

๐Ÿ”ง Configuration โ€” Claude Code / Hermes IDE

Use these environment variables for Anthropic-compatible IDEs (Claude Code, Cursor, Hermes Agent, etc.)

ANTHROPIC_AUTH_TOKEN=ark-add3...297be
ANTHROPIC_BASE_URL=https://ark.ap-southeast.bytepluses.com/api/coding
ANTHROPIC_MODEL=ark-code-latest

Supported IDEs: Claude Code, Cursor, OpenCode, Cline, Roo Code, TRAE, Kilo Code, Hermes Agent

๐Ÿ”ง Configuration โ€” OpenAI Style

For tools using the OpenAI SDK format

base_url=https://ark.ap-southeast.bytepluses.com/api/coding/v3
api_key=ark-add3...297be

๐Ÿ’ป cURL Example

curl -X POST https://ark.ap-southeast.bytepluses.com/api/coding/v3/chat/completions \
  -H "Authorization: Bearer ***" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "ark-code-latest",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

๐Ÿ Python Example

from openai import OpenAI

client = OpenAI(
    api_key="ark-add3...297be",
    base_url="https://ark.ap-southeast.bytepluses.com/api/coding/v3"
)

response = client.chat.completions.create(
    model="ark-code-latest",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

๐Ÿšจ Error Reference

ErrorMeaningFix
401 โ€” key doesn't existWrong domainUse bytepluses.com โ€” NOT volces.com
400 โ€” InvalidSubscriptionCoding Plan not activatedSubscribe to ArkCode Pro ($9/mo) at Volcengine console
404 โ€” InvalidEndpointOrModelWrong endpoint for model typeGeneral API models need an ep-xxx endpoint, not /api/coding

๐Ÿ–ฅ๏ธ Console & Management

Volcengine Coding Plan:
https://console.volcengine.com/ark/region:ark+cn-beijing/openManagement?OpenModelVisible=false&tab=CodingPlan

BytePlus Console:
https://console.byteplus.com โ†’ ModelArk

Use Volcengine console to activate/manage Coding Plan subscription. Use BytePlus console for General API endpoints and model management.

โš ๏ธ Pitfalls & Notes

๐Ÿšจ
CRITICAL: Use bytepluses.com domain! โ€” Do NOT use volces.com โ€” it returns 401 for the same key. Always use bytepluses.com.
๐Ÿšจ
Don't use /api/v3 with Coding Plan key! โ€” Using the General API endpoint /api/v3 with a Coding Plan key incurs ADDITIONAL charges, not plan quota. Always use /api/coding/v3 for Coding Plan.
โœ…
Coding Plan Verified & Working โ€” All 9 Coding Plan models tested and confirmed operational (May 2026). The $9/mo ArkCode Pro subscription is active and functional.
โ„น๏ธ
Coding Plan Supported IDEs โ€” Claude Code, Cursor, OpenCode, Cline, Roo Code, TRAE, Kilo Code, Hermes Agent
โ„น๏ธ
Dola-Seed-2.0-Code โ€” A unique coding-optimized model designed specifically for IDE workflows.
โ„น๏ธ
General API Endpoints โ€” Doubao, DeepSeek, and other General API models require creating dedicated ep-xxx inference endpoints in the console before use.

๐Ÿท๏ธ Categories

Chat Coding Vision Reasoning Image Gen Video Gen TTS