BEAM

Features

Rankings

Pro

Docs

GitHub

LAUNCH APP

Use Fireworks AI Models in Big-AGI.

Bring your own key: Fireworks AI's API rates, no markup. Keys and chats stay in your browser. Run Fireworks AI in parallel with other models, then compare and merge the answers.

Intent En VERIFY1

Intent En 6A34F8

Intent De 3CD022

Launch Big-AGI

All supported Fireworks AI models

ModelContextInputOutputReleased

Intent En VERIFY1

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent En 6A34F8

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent De 3CD022

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent De C9EC41

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent En 3CD022

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent De 6A34F8

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent En C9EC41

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent Es C9EC41

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent Es 3CD022

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent En BC8D96

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

Intent Es 6A34F8

deprecated

Fine-tuned adapter served on Fireworks AI.

Jul 2026

GLM 5.2

NEW

Z.ai flagship with 1M-token context and multi-effort coding for long-horizon agentic tasks. New IndexShare architecture and improved MTP layer cut per-token co…

$1.4

$4.4

Jun 2026

DeepSeek V4 Pro

DeepSeek flagship open MoE (1.6T params) for frontier reasoning, coding, and long-context work up to 1M tokens. Hybrid attention keeps long contexts efficient.

$1.74

$3.48

Apr 2026

Kimi K2.6 (Vision)

Moonshot AI native-multimodal agentic model tuned for long-horizon coding, autonomous execution, and swarm task orchestration.

262K

$0.95

Apr 2026

GLM 5.1

Z.ai 754B-parameter MoE built for agentic engineering, with strong coding and sustained performance across long multi-round tasks.

203K

$1.4

$4.4

Mar 2026

Kimi K2.5 (Vision)

deprecated

Open-weights model served on Fireworks AI.

262K

Jan 2026

GPT-OSS 120B

OpenAI open-weight model for high-reasoning, agentic, general-purpose use that fits on a single H100.

131K

$0.15

$0.6

Aug 2025

Intent En VERIFY1

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent En 6A34F8

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent De 3CD022

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent De C9EC41

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent En 3CD022

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent De 6A34F8

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent En C9EC41

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent Es C9EC41

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent Es 3CD022

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent En BC8D96

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

Intent Es 6A34F8

deprecated

Jul 2026

Fine-tuned adapter served on Fireworks AI.

- · in - · out -

GLM 5.2

NEW

Jun 2026

Z.ai flagship with 1M-token context and multi-effort coding for long-horizon agentic tasks. New IndexShare architecture and improved MTP layer cut per-token co…

1M · in $1.4 · out $4.4

DeepSeek V4 Pro

Apr 2026

DeepSeek flagship open MoE (1.6T params) for frontier reasoning, coding, and long-context work up to 1M tokens. Hybrid attention keeps long contexts efficient.

1M · in $1.74 · out $3.48

Kimi K2.6 (Vision)

Apr 2026

Moonshot AI native-multimodal agentic model tuned for long-horizon coding, autonomous execution, and swarm task orchestration.

262K · in $0.95 · out $4

GLM 5.1

Mar 2026

Z.ai 754B-parameter MoE built for agentic engineering, with strong coding and sustained performance across long multi-round tasks.

203K · in $1.4 · out $4.4

Kimi K2.5 (Vision)

deprecated

Jan 2026

Open-weights model served on Fireworks AI.

262K · in - · out -

GPT-OSS 120B

Aug 2025

OpenAI open-weight model for high-reasoning, agentic, general-purpose use that fits on a single H100.

131K · in $0.15 · out $0.6

17 models · sorted by release date · prices in USD per 1M tokens · refreshed every 30 minutesCompare every model across vendors →

Get started in 3 steps

Create an API key at the Fireworks AI console.

Paste it into Big-AGI's model settings.

Start chatting, or Beam it against other models and fuse the answers.

Running Fireworks AI in Big-AGI

Add your Fireworks API key over its OpenAI-compatible endpoint and reach the open-model catalog at Fireworks' own rates. Big-AGI adds no markup and no intermediary: the billing relationship runs directly between you and Fireworks.

Your key, your billing. Usage is billed by Fireworks AI to your account.
Always current. Fireworks is a 100% live catalog: every model Fireworks adds, from DeepSeek to Llama to Kimi, appears the moment it's live on their API, with real release dates driving the "new" badges, no hand-maintained list to fall behind.
Zero setup beyond the key. Big-AGI recognizes Fireworks endpoints by hostname and turns on the right vision and tool flags per model automatically.
Speed as the product. Fireworks' custom inference stack pushes open models to interactive latencies, priced per token.

Why Big-AGI instead of the playground?

Fireworks' playground is built for one prompt at a time. Big-AGI turns the same key into a persistent workspace: chats that stick around, personas, and file and image attachments, all layered on top of the raw API. It's also the only place a Fireworks model runs next to Claude, GPT, and Gemini in Beam, with the parameters and the key still yours to control.

Your keys and your data

Turn on Direct Connection and the browser talks to Fireworks directly, skipping the Big-AGI server, when your key is client-side and Fireworks allows it. Your keys stay in your browser. Chats are stored locally first, and sync only if you turn it on. The AI Inspector shows the exact request, the token counts, and a cost estimate, so you always know what you're billed for.

Fireworks AI in Beam

Put a Fireworks model into a Beam alongside frontier labs, or run a few open models side by side at Fireworks' speed. Fusions then combine, cross-check, and synthesize the parallel answers instead of just picking the best one. Parallel runs use more tokens than a single chat.

Bring your Fireworks AI key. Keep control.

Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Fireworks AI is called.

Launch Big-AGI

<- All Models

Alibaba

Anthropic

AWS Bedrock

Azure

Cerebras

DeepSeek

Fireworks AI

Google Gemini

Groq

MiniMax

Mistral

Moonshot

OpenAI

OpenRouter

Perplexity

Sakana AI

SpaceXAI

Together AI

Z.ai

BIG-AGI

Product

Features Models Controls Changelog BEAM Technology

Resources

Documentation Discord GitHub

Company

Email Us Privacy Terms