Doubao LLM API

Doubao is ByteDance's self-developed LLM series, served via Volcengine Ark — Pro/Lite tiers covering dialog, long-context, vision, and embeddings.

Visit site ↗Documentation ↗Health checked 3h ago

Use it when

Aggressively competitive pricing (initiator of the 2024 Chinese LLM price war)

Watch for

No open-source release (unlike Qwen / Baichuan)

First check

Sign up at volcengine.com → enable Doubao models in the Volcengine Ark console → create an inference endpoint (endpoint_id) → POST /api/v3/chat/completions (OpenAI-compatible).

Auth

api_key

CORS

HTTPS

Yes

Signup

Latency

1343 ms

Protocol

REST

Pricing

paid

Full pricing breakdown 2026

Per-model rates, free tier, GPT-4 comparison. Verified May 14, 2026.

Uptime · 30-day window

Probes: 30Uptime: 97%Avg latency: 1872ms

About this API

Doubao is ByteDance's self-developed LLM series, commercialized in 2024 and offered via the Volcengine Ark platform. The model matrix spans general dialog (Doubao-pro / Doubao-lite), 128K long-context (Doubao-pro-128k), vision (Doubao-vision), embeddings (Doubao-embedding), function calling, and image generation (Doubao-Seedream). Doubao is famous for initiating the 2024 Chinese LLM 'price war', dramatically lowering API prices and forcing Qwen / Zhipu / Baichuan to follow. Trained by ByteDance's internal AI Lab and battle-tested in the Doubao app, Douyin search, and CapCut AI. API access uses an 'inference endpoint' (endpoint_id) pattern — each model version maps to an endpoint for easy version switching. A natural fit for mainland-China developers building cost-sensitive Chinese LLM apps, especially enterprise scenarios needing low latency and compliance.

What you can build

1Chinese LLM apps (very low cost; battle-tested at ByteDance's own scale)
2Long-context processing (Doubao-pro-128k)
3Visual understanding (Doubao-vision)
4Integration scenarios within the Douyin / Toutiao ecosystem

Strengths & limitations

Strengths

Aggressively competitive pricing (initiator of the 2024 Chinese LLM price war)
Battle-tested at ByteDance scale (Doubao app, Douyin search, etc.)
Enterprise-grade SLA and compliance support via Volcengine

Limitations

No open-source release (unlike Qwen / Baichuan)
Overseas developers need a Volcengine account and verification
Some advanced models require enterprise vetting

Official quickstart

Read the official quickstart at www.volcengine.com.

Getting started

Sign up at volcengine.com → enable Doubao models in the Volcengine Ark console → create an inference endpoint (endpoint_id) → POST /api/v3/chat/completions (OpenAI-compatible).

FAQ

Doubao vs. Qwen — which to pick?+

Doubao is slightly cheaper; capability is comparable. Choose Doubao for ByteDance-ecosystem scenarios (Douyin etc.), Qwen for open-source breadth and multilingual reach.

Why is an endpoint_id required before calling?+

Volcengine Ark abstracts model versions behind endpoints, enabling gradual rollout — an externalization of ByteDance's internal ML-engineering practice, making multi-model multi-version management cleaner.

Technical details

CORS: ?HTTPS: YesSignup: ?Open source: No

Auth type: api_key
Pricing: paid
Rate limit: Per-endpoint and per-model; enterprise verification scales limits.
Free tier quota: Doubao Pro / Lite tiers each include 500K free tokens valid 30 days on signup. Vision and image endpoints carry separate quotas.
Protocols: REST
SDKs: python, javascript, java, go
Response time: 1343 ms
Last health check: 6/29/2026, 6:32:37 AM