BEAM

Features

Rankings

Pro

Docs

GitHub

LAUNCH APP

Use Cerebras Models in Big-AGI.

Bring your own key: Cerebras's API rates, no markup. Keys and chats stay in your browser. Run Cerebras in parallel with other models, then compare and merge the answers.

Gemma 4 31B (Preview)

Z.ai GLM 4.7 (Preview)

GPT OSS 120B

Launch Big-AGI

All supported Cerebras models

ModelContextInputOutputReleased

Gemma 4 31B (Preview)

NEW

Google Gemma 4 31B on Cerebras - first multimodal model on wafer-scale inference (~1,850 tok/s). Vision (base64 PNG/JPEG, max 5 images / 10MB), function callin…

131K

$0.99

$1.49

Jun 2026

Z.ai GLM 4.7 (Preview)

Z.ai GLM 4.7 (355B) on Cerebras (~1,000 tok/s). Strong agentic coding, advanced reasoning (on by default), superior tool use. 131K context, 40K max output.

131K

$2.25

$2.75

Jan 2026

GPT OSS 120B

OpenAI flagship open-weight MoE (120B total, 5.1B active) on Cerebras (~3,000 tok/s). Reasoning (default medium effort) and function calling. 131K context, 40K…

131K

$0.35

$0.75

Aug 2025

Gemma 4 31B (Preview)

NEW

Jun 2026

Google Gemma 4 31B on Cerebras - first multimodal model on wafer-scale inference (~1,850 tok/s). Vision (base64 PNG/JPEG, max 5 images / 10MB), function callin…

131K · in $0.99 · out $1.49

Z.ai GLM 4.7 (Preview)

Jan 2026

Z.ai GLM 4.7 (355B) on Cerebras (~1,000 tok/s). Strong agentic coding, advanced reasoning (on by default), superior tool use. 131K context, 40K max output.

131K · in $2.25 · out $2.75

GPT OSS 120B

Aug 2025

OpenAI flagship open-weight MoE (120B total, 5.1B active) on Cerebras (~3,000 tok/s). Reasoning (default medium effort) and function calling. 131K context, 40K…

131K · in $0.35 · out $0.75

3 models · sorted by release date · prices in USD per 1M tokens · refreshed every 30 minutesCompare every model across vendors →

Get started in 3 steps

Create an API key at the Cerebras console.

Paste it into Big-AGI's model settings.

Start chatting, or Beam it against other models and fuse the answers.

Running Cerebras in Big-AGI

Add your Cerebras API key and run open models on Cerebras wafer-scale hardware at their own API rates. Big-AGI adds no markup and no intermediary: the billing relationship runs directly between you and Cerebras.

Your key, your billing. Usage is billed by Cerebras to your account.
Accurate toggles, even when Cerebras' catalog isn't. Cerebras' own public catalog sometimes under-reports what a model can do; Big-AGI's editorial data wins for known models, so vision, tools, and reasoning toggles stay correct regardless.
Built for speed. Cerebras serves open models at very high tokens per second, so long generations and agentic loops feel instant.

Why Big-AGI instead of the playground?

Cerebras' playground proves the speed; it doesn't give you a workspace. Big-AGI adds persistent chats, personas, and attachments on top of the raw API, then puts that speed to work in Beam: run a Cerebras model next to Claude, GPT, and Gemini, and it's usually done before the others have finished a sentence, so running several models in parallel feels instant instead of like a wait. Parameters, the key, and the chats stay yours to control.

Your keys and your data

Turn on Direct Connection and the browser calls Cerebras directly, bypassing the Big-AGI server, when your key is client-side and Cerebras allows it. Your keys stay in your browser. Chats are stored locally first, and sync only if you turn it on. The AI Inspector shows the exact request, the token counts, and a cost estimate.

Cerebras in Beam

Put a Cerebras model into a Beam next to Claude, GPT, and Gemini, and it's usually the first one finished. Fusions then combine, cross-check, and synthesize the parallel answers instead of just picking the best one. Parallel runs use more tokens than a single chat.

Bring your Cerebras key. Keep control.

Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Cerebras is called.

Launch Big-AGI

<- All Models

Alibaba

Anthropic

AWS Bedrock

Azure

Cerebras

DeepSeek

Fireworks AI

Google Gemini

Groq

MiniMax

Mistral

Moonshot

OpenAI

OpenRouter

Perplexity

Sakana AI

SpaceXAI

Together AI

Z.ai

BIG-AGI

Product

Features Models Controls Changelog BEAM Technology

Resources

Documentation Discord GitHub

Company

Email Us Privacy Terms