Use Fireworks AI Models in Big-AGI.

Bring your own Fireworks AI key and use Fireworks AI at its own API rates, with no markup. Keys and chats stay in your browser. Run Fireworks AI in parallel with other models, then compare and merge the answers.

Glm 5p2
Deepseek V4 Pro
Kimi K2p6 (Vision)

All supported Fireworks AI models

ModelContextInputOutputReleased

Glm 5p2

NEW
Tools / functions

fireworks `HF_BASE_MODEL` type.

1M

-

-

Jun 2026

Deepseek V4 Pro

Tools / functions

fireworks `HF_BASE_MODEL` type.

1M

-

-

Apr 2026

Kimi K2p6 (Vision)

VisionTools / functions

fireworks `HF_BASE_MODEL` type.

262K

-

-

Apr 2026

Glm 5p1

Tools / functions

fireworks `HF_BASE_MODEL` type.

203K

-

-

Mar 2026

Kimi K2p5 (Vision)

VisionTools / functions

fireworks `HF_BASE_MODEL` type.

262K

-

-

Jan 2026

Gpt Oss 120b

Tools / functions

fireworks `HF_BASE_MODEL` type.

131K

-

-

Aug 2025

Glm 5p2

NEW
Jun 2026

fireworks `HF_BASE_MODEL` type.

Tools / functions
1M · in - · out -

Deepseek V4 Pro

Apr 2026

fireworks `HF_BASE_MODEL` type.

Tools / functions
1M · in - · out -

Kimi K2p6 (Vision)

Apr 2026

fireworks `HF_BASE_MODEL` type.

VisionTools / functions
262K · in - · out -

Glm 5p1

Mar 2026

fireworks `HF_BASE_MODEL` type.

Tools / functions
203K · in - · out -

Kimi K2p5 (Vision)

Jan 2026

fireworks `HF_BASE_MODEL` type.

VisionTools / functions
262K · in - · out -

Gpt Oss 120b

Aug 2025

fireworks `HF_BASE_MODEL` type.

Tools / functions
131K · in - · out -
6 models · sorted by release date · prices in USD per 1M tokens · refreshed every 30 minutesCompare every model across vendors →

Get started in 3 steps

1

Create an API key at the Fireworks AI console.

2

Paste it into Big-AGI's model settings.

3

Start chatting, or Beam it against other models and fuse the answers.

Running Fireworks AI in Big-AGI

Add your Fireworks API key over its OpenAI-compatible endpoint and reach the open-model catalog at Fireworks' own rates. Big-AGI adds no markup and no intermediary: the billing relationship runs directly between you and Fireworks.

  • Your key, your billing. Usage is billed by Fireworks AI to your account.
  • Always current. Fireworks is a 100% live catalog: every model Fireworks adds, from DeepSeek to Llama to Kimi, appears the moment it's live on their API, with real release dates driving the "new" badges, no hand-maintained list to fall behind.
  • Zero setup beyond the key. Big-AGI recognizes Fireworks endpoints by hostname and turns on the right vision and tool flags per model automatically.
  • Speed as the product. Fireworks' custom inference stack pushes open models to interactive latencies, priced per token.

Why Big-AGI instead of the playground?

Fireworks' playground is built for one prompt at a time. Big-AGI turns the same key into a persistent workspace: chats that stick around, personas, and file and image attachments, all layered on top of the raw API. It's also the only place a Fireworks model runs next to Claude, GPT, and Gemini in Beam, with the parameters and the key still yours to control.

Your keys and your data

Turn on Direct Connection and the browser talks to Fireworks directly, skipping the Big-AGI server, when your key is client-side and Fireworks allows it. Your keys stay in your browser. Chats are stored locally first, and sync only if you turn it on. The AI Inspector shows the exact request, the token counts, and a cost estimate, so you always know what you're billed for.

Fireworks AI in Beam

Put a Fireworks model into a Beam alongside frontier labs, or run a few open models side by side at Fireworks' speed. Fusions then combine, cross-check, and synthesize the parallel answers instead of just picking the best one. Parallel runs use more tokens than a single chat.

Bring your Fireworks AI key. Keep control.

Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Fireworks AI is called.

© 2026 Token Fabrics·Built with passion in San Diego