BEAM

Features

Rankings

Pro

Docs

GitHub

LAUNCH APP

Use Moonshot Models in Big-AGI.

Bring your own key: Moonshot's API rates, no markup. Keys and chats stay in your browser. Run Moonshot in parallel with other models, then compare and merge the answers.

Kimi K3

Kimi K2.7 Code

Kimi K2.7 Code Highspeed

Launch Big-AGI

All supported Moonshot models

ModelContextInputOutputReleased

Kimi K3

NEW

Native multimodal flagship (text, image, video inputs) with thinking on by default. 1M context.

$15

Jul 2026

Kimi K2.7 Code

Code-focused multimodal model (text, image, video inputs) with always-on thinking. ~180 tok/s output (up to 260 in short contexts for highspeed). 256K context.

262K

$0.95

Jun 2026

Kimi K2.7 Code Highspeed

High-speed code variant with ~180 tok/s output (up to 260 in short contexts). Native multimodal with always-on thinking. 256K context.

262K

$1.9

Jun 2026

Kimi K2.6

Native multimodal flagship (text, image, video inputs) with thinking and non-thinking modes. Stronger long-form coding, improved instruction compliance and sel…

262K

$0.95

Apr 2026

Kimi K2.5

Supports vision (images/videos), thinking mode, and Agent tasks. 256K context.

262K

$0.6

Jan 2026

V1 8K Vision (Preview)

Legacy vision model with 8K context. Preview variant - use moonshot-v1-vision for production.

$0.2

Jan 2025

V1 32K Vision (Preview)

Legacy vision model with 32K context. Preview variant - use moonshot-v1-vision for production.

33K

Jan 2025

V1 128K Vision (Preview)

Legacy vision model with 128K context. Preview variant - use moonshot-v1-vision for production.

131K

Jan 2025

V1 128K

Legacy V1 model with 128K context. Deprecated - use Kimi K2 Instruct instead.

131K

Feb 2024

V1 32K

Legacy V1 model with 32K context. Deprecated - use Kimi K2 Instruct instead.

33K

Feb 2024

V1 8K

Legacy V1 model with 8K context. Deprecated - use Kimi K2 Instruct instead.

$0.2

Feb 2024

Kimi K3

NEW

Jul 2026

Native multimodal flagship (text, image, video inputs) with thinking on by default. 1M context.

1M · in $3 · out $15

Kimi K2.7 Code

Jun 2026

Code-focused multimodal model (text, image, video inputs) with always-on thinking. ~180 tok/s output (up to 260 in short contexts for highspeed). 256K context.

262K · in $0.95 · out $4

Kimi K2.7 Code Highspeed

Jun 2026

High-speed code variant with ~180 tok/s output (up to 260 in short contexts). Native multimodal with always-on thinking. 256K context.

262K · in $1.9 · out $8

Kimi K2.6

Apr 2026

Native multimodal flagship (text, image, video inputs) with thinking and non-thinking modes. Stronger long-form coding, improved instruction compliance and sel…

262K · in $0.95 · out $4

Kimi K2.5

Jan 2026

Supports vision (images/videos), thinking mode, and Agent tasks. 256K context.

262K · in $0.6 · out $3

V1 8K Vision (Preview)

Jan 2025

Legacy vision model with 8K context. Preview variant - use moonshot-v1-vision for production.

8K · in $0.2 · out $2

V1 32K Vision (Preview)

Jan 2025

Legacy vision model with 32K context. Preview variant - use moonshot-v1-vision for production.

33K · in $1 · out $3

V1 128K Vision (Preview)

Jan 2025

Legacy vision model with 128K context. Preview variant - use moonshot-v1-vision for production.

131K · in $2 · out $5

V1 128K

Feb 2024

Legacy V1 model with 128K context. Deprecated - use Kimi K2 Instruct instead.

131K · in $2 · out $5

V1 32K

Feb 2024

Legacy V1 model with 32K context. Deprecated - use Kimi K2 Instruct instead.

33K · in $1 · out $3

V1 8K

Feb 2024

Legacy V1 model with 8K context. Deprecated - use Kimi K2 Instruct instead.

8K · in $0.2 · out $2

11 models · sorted by release date · prices in USD per 1M tokens · refreshed every 30 minutesCompare every model across vendors →

Get started in 3 steps

Create an API key at the Moonshot console.

Paste it into Big-AGI's model settings.

Start chatting, or Beam it against other models and fuse the answers.

Running Moonshot in Big-AGI

Add your Moonshot API key and run the Kimi models at Moonshot's own API rates. Big-AGI adds no markup and no intermediary: billing runs directly between you and Moonshot, and your keys stay in your browser.

Your key, your billing. Usage is billed by Moonshot to your account. Big-AGI does not meter or charge for model usage.
Built for long context and tools. Kimi holds a very large context window and drives tool calls well, so it keeps up on agentic work and whole-codebase reads that shorter-context models choke on.
Freshness, judged honestly. Moonshot's API reports the same creation date for every model, so Big-AGI ignores that field rather than badging the whole catalog as new.

Why not just Kimi?

Kimi's own app is a solid way to chat, but it only ever shows you Kimi's take. Beam sends the same prompt to Kimi and to GPT, Claude, or Gemini at once, so a long-context read of your codebase or document gets checked against other labs' models before you trust it. You also get parameters Kimi's app does not expose: temperature, system prompt, per-turn model switching, and a real thinking on/off toggle for the Kimi models that support it (the flagship coding model reasons on every turn, so Big-AGI doesn't show a fake switch for it). Your keys stay in your browser instead of sitting on Moonshot's servers.

Your keys and your data

Turn on Direct Connection and the browser calls Moonshot directly, bypassing the Big-AGI server, whenever your key is client-side and Moonshot allows it. Your keys stay in your browser. Chats are stored locally first and sync only if you turn it on. The AI Inspector opens on any message to show the exact request sent to Moonshot, the token counts, and a cost estimate for that call.

Kimi in Beam

Run Kimi in parallel with Claude, GPT, and Gemini on the same prompt, then reach for Fusions: several strategies that combine, cross-check, and synthesize the parallel answers, which beats just picking the single best one. Parallel runs use more tokens than a single chat.

Bring your Moonshot key. Keep control.

Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Moonshot is called.

Launch Big-AGI

<- All Models

Alibaba

Anthropic

AWS Bedrock

Azure

Cerebras

DeepSeek

Fireworks AI

Google Gemini

Groq

MiniMax

Mistral

Moonshot

OpenAI

OpenRouter

Perplexity

Sakana AI

SpaceXAI

Together AI

Z.ai

BIG-AGI

Product

Features Models Controls Changelog BEAM Technology

Resources

Documentation Discord GitHub

Company

Email Us Privacy Terms