Bring your own Perplexity key and use Perplexity at its own API rates, with no markup. Keys and chats stay in your browser. Run Perplexity in parallel with other models, then compare and merge the answers.
Sonar Reasoning Pro
Premier reasoning model (DeepSeek R1) with Chain of Thought. 128k context.
128K
$2
$8
Feb 2025
Sonar Deep Research
Expert-level research model for exhaustive searches and comprehensive reports. 128k context.
128K
$2
$8
Feb 2025
Sonar
Lightweight, cost-effective search model for quick, grounded answers. 128k context.
128K
$1
$1
Jan 2025
Sonar Pro
Advanced search model for complex queries and deep content understanding. 200k context.
200K
$3
$15
Jan 2025
Sonar Reasoning Pro
Premier reasoning model (DeepSeek R1) with Chain of Thought. 128k context.
Sonar Deep Research
Expert-level research model for exhaustive searches and comprehensive reports. 128k context.
Sonar
Lightweight, cost-effective search model for quick, grounded answers. 128k context.
Sonar Pro
Advanced search model for complex queries and deep content understanding. 200k context.
1
Create an API key at the Perplexity console.
2
Paste it into Big-AGI's model settings.
3
Start chatting, or Beam it against other models and fuse the answers.
Add your Perplexity API key and run the Sonar models at Perplexity's own rates. Big-AGI isn't a reseller: no markup and no intermediary, so the bill comes straight from Perplexity to your account.
Perplexity's app gives you one search engine in one box. Big-AGI runs a Sonar model next to Claude, GPT, or Gemini in the same conversation, so you can cross-check a live, cited web answer against a frontier model's own reasoning instead of trusting either one alone.
Perplexity Pro runs $20 a month for unlimited use in its own app. The API bills differently: per token like everything else, plus a per-search fee that scales with how much web context a query pulls in. A few grounded lookups a day can land cheaper than the subscription; a Deep Research report running dozens of searches usually won't. Beam parallel runs add more searches on top, the honest tradeoff for comparing models side by side.
Turn on Direct Connection and the browser calls Perplexity directly, bypassing the Big-AGI server, when your key is client-side and Perplexity's API allows it. Your keys stay in your browser. Chats are stored locally first, and sync only if you turn it on. The AI Inspector shows the exact request, the token counts, and a cost estimate.
Put Sonar into a Beam next to Claude, GPT, and Gemini, so a live, cited web answer sits right beside frontier-model reasoning. Fusions then combine, cross-check, and synthesize the parallel answers instead of just picking the best one. Parallel runs mean more tokens, and for Perplexity, more searches too.
Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Perplexity is called.
BIG-AGI
Resources
© 2026 Token Fabrics·Built with passion in San Diego