Skip to content

Kimi K2 Thinking

Kimi K2 Thinking adds extended chain-of-thought (CoT) reasoning to the K2 architecture, supporting many sequential tool calls for agentic workflows through AI Gateway.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'moonshotai/kimi-k2-thinking',
prompt: 'Why is the sky blue?'
})

More models by Moonshot AI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
256K
1.6s
47tps
$0.95/M$4.00/M
Read:$0.19/M
Write:
+3
moonshotai logo
06/12/2026
262K
0.3s
183tps
$0.95/M$4.00/M
Read:$0.16/M
Write:
+3
baseten logo
fireworks logo
moonshotai logo
+2
04/20/2026
262K
0.2s
72tps
$0.60/M$3.00/M
Read:$0.1/M
Write:
+2
baseten logo
bedrock logo
fireworks logo
+2
01/26/2026
131K
1.2s
26tps
$0.57/M$2.30/M
novita logo
09/05/2025