Kimi K2.7 Code
Moonshot AI's open-weight coding model (June 2026): a 1T-parameter MoE (32B active) that uses ~30% fewer reasoning tokens than K2.6 and posts strong tool-use scores. Input cache-hit as low as $0.19/1M; third-party hosts (OpenRouter/DeepInfra) ~$0.74 in / $3.50 out. Modified-MIT open weights, deployable via vLLM/SGLang. Benchmarks are vendor-reported — verify.
Kimi K2.7 Code strengths
- Open weights (Modified MIT)
- Repository-scale agentic coding
- ~30% fewer reasoning tokens vs K2.6
- Strong tool-use (MCP)
- Self-hostable (vLLM/SGLang)
Pricing & context
| Context window | 256K tokens |
| Input price /1M | $0.95 |
| Output price /1M | $4.00 |
| Modalities | text, image, video |
Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $0.95 × 0.01 + $4.00 × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.
When to choose Kimi K2.7 Code
Kimi K2.7 Code is best for Repository-scale refactoring and long multi-turn agentic coding with an open-weight, cost-efficient model. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.
Kimi K2.7 Code FAQ
How much does Kimi K2.7 Code cost?
Kimi K2.7 Code is priced at $0.95 per 1M input tokens and $4.00 per 1M output tokens (public API list price), with a 256K tokens context window.
What is Kimi K2.7 Code best for?
Kimi K2.7 Code by Moonshot AI is best for Repository-scale refactoring and long multi-turn agentic coding with an open-weight, cost-efficient model.
Is Kimi K2.7 Code multimodal?
Kimi K2.7 Code supports text, image, video.
Other models
All models →| 01 | GPT-5.5 | OpenAI | $5.00 | → |
| 02 | GPT-5.4 | OpenAI | $2.50 | → |
| 03 | GPT-5.4 mini | OpenAI | $0.75 | → |
| 04 | Claude Opus 4.8 | Anthropic | $5.00 | → |