Use Moonshot Models in Big-AGI.

Bring your own Moonshot key and use Moonshot at its own API rates, with no markup. Keys and chats stay in your browser. Run Moonshot in parallel with other models, then compare and merge the answers.

Kimi K2.7 Code
Kimi K2.7 Code Highspeed
Kimi K2.6

All supported Moonshot models

ModelContextInputOutputReleased

Kimi K2.7 Code

NEW
VisionReasoningTools / functions

Code-focused multimodal model (text, image, video inputs) with always-on thinking. ~180 tok/s output (up to 260 in short contexts for highspeed). 256K context.

262K

$0.95

$4

Jun 2026

Kimi K2.7 Code Highspeed

NEW
VisionReasoningTools / functions

High-speed code variant with ~180 tok/s output (up to 260 in short contexts). Native multimodal with always-on thinking. 256K context.

262K

$1.9

$8

Jun 2026

Kimi K2.6

VisionTools / functions

Native multimodal flagship (text, image, video inputs) with thinking and non-thinking modes. Stronger long-form coding, improved instruction compliance and self-correction. 256K context.

262K

$0.95

$4

Apr 2026

Kimi K2.5

VisionTools / functions

Supports vision (images/videos), thinking mode, and Agent tasks. 256K context.

262K

$0.6

$3

Jan 2026

V1 8K Vision (Preview)

Vision

Legacy vision model with 8K context. Preview variant - use moonshot-v1-vision for production.

8K

$0.2

$2

Jan 2025

V1 32K Vision (Preview)

Vision

Legacy vision model with 32K context. Preview variant - use moonshot-v1-vision for production.

33K

$1

$3

Jan 2025

V1 128K Vision (Preview)

Vision

Legacy vision model with 128K context. Preview variant - use moonshot-v1-vision for production.

131K

$2

$5

Jan 2025

V1 128K

Tools / functions

Legacy V1 model with 128K context. Deprecated - use Kimi K2 Instruct instead.

131K

$2

$5

Feb 2024

V1 32K

Tools / functions

Legacy V1 model with 32K context. Deprecated - use Kimi K2 Instruct instead.

33K

$1

$3

Feb 2024

V1 8K

Tools / functions

Legacy V1 model with 8K context. Deprecated - use Kimi K2 Instruct instead.

8K

$0.2

$2

Feb 2024

Kimi K2.7 Code

NEW
Jun 2026

Code-focused multimodal model (text, image, video inputs) with always-on thinking. ~180 tok/s output (up to 260 in short contexts for highspeed). 256K context.

VisionReasoningTools / functions
262K · in $0.95 · out $4

Kimi K2.7 Code Highspeed

NEW
Jun 2026

High-speed code variant with ~180 tok/s output (up to 260 in short contexts). Native multimodal with always-on thinking. 256K context.

VisionReasoningTools / functions
262K · in $1.9 · out $8

Kimi K2.6

Apr 2026

Native multimodal flagship (text, image, video inputs) with thinking and non-thinking modes. Stronger long-form coding, improved instruction compliance and self-correction. 256K context.

VisionTools / functions
262K · in $0.95 · out $4

Kimi K2.5

Jan 2026

Supports vision (images/videos), thinking mode, and Agent tasks. 256K context.

VisionTools / functions
262K · in $0.6 · out $3

V1 8K Vision (Preview)

Jan 2025

Legacy vision model with 8K context. Preview variant - use moonshot-v1-vision for production.

Vision
8K · in $0.2 · out $2

V1 32K Vision (Preview)

Jan 2025

Legacy vision model with 32K context. Preview variant - use moonshot-v1-vision for production.

Vision
33K · in $1 · out $3

V1 128K Vision (Preview)

Jan 2025

Legacy vision model with 128K context. Preview variant - use moonshot-v1-vision for production.

Vision
131K · in $2 · out $5

V1 128K

Feb 2024

Legacy V1 model with 128K context. Deprecated - use Kimi K2 Instruct instead.

Tools / functions
131K · in $2 · out $5

V1 32K

Feb 2024

Legacy V1 model with 32K context. Deprecated - use Kimi K2 Instruct instead.

Tools / functions
33K · in $1 · out $3

V1 8K

Feb 2024

Legacy V1 model with 8K context. Deprecated - use Kimi K2 Instruct instead.

Tools / functions
8K · in $0.2 · out $2
10 models · sorted by release date · prices in USD per 1M tokens · refreshed every 30 minutesCompare every model across vendors →

Running Moonshot Kimi in Big-AGI

Add your Moonshot API key and run the Kimi models at Moonshot's own API rates. Big-AGI adds no markup and keeps your keys and chats in your browser, not on its servers.

  • Your key, your billing. Usage is billed by Moonshot to your account. Big-AGI does not meter or charge for model usage.
  • Direct Connection. Turn it on and the browser calls Moonshot directly, bypassing the Big-AGI server, when your key is client-side and Moonshot allows it.
  • Built for long context and tools. Kimi holds huge context and drives tool calls well, so it keeps up on agentic and whole-codebase work.
  • Beam. Run Kimi in parallel with Claude, GPT, and Gemini, then compare or merge the answers. Parallel runs use more tokens than a single chat.

Bring your Moonshot key. Keep control.

Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Moonshot is called.

© 2026 Token Fabrics·Built with passion in San Diego