Doubao LLM API logo

Doubao LLM API

Doubao LLM API

Doubao is ByteDance's self-developed LLM series, served via Volcengine Ark — Pro/Lite tiers covering dialog, long-context, vision, and embeddings.

Use it when

Aggressively competitive pricing (initiator of the 2024 Chinese LLM price war)

Watch for

No open-source release (unlike Qwen / Baichuan)

First check

Sign up at volcengine.com → enable Doubao models in the Volcengine Ark console → create an inference endpoint (endpoint_id) → POST /api/v3/chat/completions (OpenAI-compatible).

Auth
api_key
CORS
?
HTTPS
Yes
Signup
?
Protocol
REST
Pricing
paid
Full pricing breakdown 2026
Per-model rates, free tier, GPT-4 comparison. Verified May 14, 2026.

Uptime · 30-day window

Health history will appear here after the next daily check.
01

About this API

Doubao is ByteDance's self-developed LLM series, commercialized in 2024 and offered via the Volcengine Ark platform. The model matrix spans general dialog (Doubao-pro / Doubao-lite), 128K long-context (Doubao-pro-128k), vision (Doubao-vision), embeddings (Doubao-embedding), function calling, and image generation (Doubao-Seedream). Doubao is famous for initiating the 2024 Chinese LLM 'price war', dramatically lowering API prices and forcing Qwen / Zhipu / Baichuan to follow. Trained by ByteDance's internal AI Lab and battle-tested in the Doubao app, Douyin search, and CapCut AI. API access uses an 'inference endpoint' (endpoint_id) pattern — each model version maps to an endpoint for easy version switching. A natural fit for mainland-China developers building cost-sensitive Chinese LLM apps, especially enterprise scenarios needing low latency and compliance.

02

What you can build

  • 1Chinese LLM apps (very low cost; battle-tested at ByteDance's own scale)
  • 2Long-context processing (Doubao-pro-128k)
  • 3Visual understanding (Doubao-vision)
  • 4Integration scenarios within the Douyin / Toutiao ecosystem
03

Strengths & limitations

Strengths

  • Aggressively competitive pricing (initiator of the 2024 Chinese LLM price war)
  • Battle-tested at ByteDance scale (Doubao app, Douyin search, etc.)
  • Enterprise-grade SLA and compliance support via Volcengine

Limitations

  • No open-source release (unlike Qwen / Baichuan)
  • Overseas developers need a Volcengine account and verification
  • Some advanced models require enterprise vetting
04

Example request

Generic template — replace <endpoint> with the real path from the docs.
curl https://www.volcengine.com/product/doubao/<endpoint> \
  -H "Authorization: Bearer $API_KEY"
# Some providers use X-Api-Key instead — verify in the docs.
05

Getting started

Sign up at volcengine.com → enable Doubao models in the Volcengine Ark console → create an inference endpoint (endpoint_id) → POST /api/v3/chat/completions (OpenAI-compatible).

06

FAQ

Doubao vs. Qwen — which to pick?+

Doubao is slightly cheaper; capability is comparable. Choose Doubao for ByteDance-ecosystem scenarios (Douyin etc.), Qwen for open-source breadth and multilingual reach.

Why is an endpoint_id required before calling?+

Volcengine Ark abstracts model versions behind endpoints, enabling gradual rollout — an externalization of ByteDance's internal ML-engineering practice, making multi-model multi-version management cleaner.

07

Technical details

CORS: ?HTTPS: YesSignup: ?Open source: No
Auth type
api_key
Pricing
paid
Rate limit
Per-endpoint and per-model; enterprise verification scales limits.
Free tier quota
Doubao Pro / Lite tiers each include 500K free tokens valid 30 days on signup. Vision and image endpoints carry separate quotas.
Protocols
REST
SDKs
python, javascript, java, go
08

Tags