Moonshot Kimi API logo

Moonshot Kimi API

Moonshot Kimi API

Moonshot Kimi API provides ultra-long context Chinese LLM — Kimi supports 200K-1M tokens context, especially strong at long document analysis.

Visit site ↗Documentation ↗Health checked 12h ago
Use it when

Earliest in China to do ultra-long context LLM (200K-1M tokens)

Watch for

Ultra-long context isn't cheap

First check

Sign up at platform.moonshot.cn for API key. OpenAI-compatible: POST /v1/chat/completions with model: "moonshot-v1-128k" + messages.

Auth
api_key
CORS
?
HTTPS
Yes
Signup
?
Latency
2166 ms
Protocol
REST
Pricing
paid
Full pricing breakdown 2026
Per-model rates, free tier, GPT-4 comparison. Verified May 14, 2026.

Uptime · 30-day window

Probes: 30Uptime: 70%Avg latency: 2610ms
01

About this API

Moonshot AI is an AI company founded 2023 in Beijing from Tsinghua alums, main product Kimi LLM. Differentiator is "ultra-long context" — early 2024 first to launch 1M tokens context (equivalent to a book or 100k lines of code in one prompt), enabling scenarios like long document analysis, entire codebase understanding. Kimi web app is popular among Chinese young users, hailed as "one of China's ChatGPT alternatives". API design is OpenAI-compatible (just change base_url); low switching cost. Same first-tier China LLM as Qwen / GLM / DeepSeek, differentiated in long context. But since late 2024, Qwen-Long, GPT-4o caught up to 1M context; Moonshot's differentiation narrowed, pivoting to explore AI agents and new directions.

02

What you can build

  • 1Long document analysis (academic papers, legal contracts, full book QA)
  • 2Chinese LLM apps
  • 3Research assistants (multiple PDFs at once)
  • 4Codebase understanding (entire repo in context)
03

Strengths & limitations

Strengths

  • Earliest in China to do ultra-long context LLM (200K-1M tokens)
  • Top-tier Chinese quality among China LLMs
  • Kimi web app has large consumer user base (brand effect)

Limitations

  • Ultra-long context isn't cheap
  • Advanced features like function calling less mature than GPT-4
  • Mainly Chinese; English scenarios weaker than GPT-4
04

Official quickstart

Read the official quickstart at platform.moonshot.cn.

05

Getting started

Sign up at platform.moonshot.cn for API key. OpenAI-compatible: POST /v1/chat/completions with model: "moonshot-v1-128k" + messages.

06

FAQ

Is 1M context actually usable?+

Yes but expensive and slow. Regular scenarios suffice with 128k; 1M suits special scenarios fitting entire books / codebases.

07

Technical details

CORS: ?HTTPS: YesSignup: ?Open source: No
Auth type
api_key
Pricing
paid
Rate limit
RPM/TPM by account tier; request higher in console.
Free tier quota
No persistent free tier. New accounts receive ¥15 trial credit (≈500K Kimi tokens on the standard 8K-context model).
Protocols
REST
SDKs
python, javascript
Response time
2166 ms
Last health check
6/26/2026, 6:23:48 AM
08

Tags