Use Alibaba Models in Big-AGI.

Bring your own Alibaba key and use Alibaba at its own API rates, with no markup. Keys and chats stay in your browser. Run Alibaba in parallel with other models, then compare and merge the answers.

Kimi K2.7 Code (Alibaba)
GLM-5.2 (Alibaba)
DeepSeek V4 Pro (Alibaba)

All supported Alibaba models

ModelContextInputOutputReleased

Kimi K2.7 Code (Alibaba)

NEW
VisionReasoningTools / functions

Moonshot Kimi K2.7 Code served via Alibaba Model Studio. Multimodal, always-on thinking, 256K context. (Alibaba pricing not yet published.)

262K

-

-

Jun 2026

GLM-5.2 (Alibaba)

NEW
ReasoningTools / functions

Zhipu GLM-5.2 served via Alibaba Model Studio. 1M context, thinking. (Alibaba pricing not yet published.)

1M

-

-

Jun 2026

DeepSeek V4 Pro (Alibaba)

NEW
ReasoningTools / functions

DeepSeek V4 Pro served via Alibaba Model Studio (Alibaba pricing, ~5x DeepSeek-direct). 1M context, thinking.

1M

$2.4

$4.8

Jun 2026

DeepSeek V3.2 (Alibaba)

NEW
ReasoningTools / functions

DeepSeek V3.2 served via Alibaba Model Studio (superseded by V4). Thinking.

131K

$0.57

$1.71

Jun 2026

DeepSeek V4 Flash (Alibaba)

NEW
ReasoningTools / functions

DeepSeek V4 Flash served via Alibaba Model Studio. 1M context, thinking.

1M

$0.2

$0.4

Jun 2026

[?] Qwen3.7 Max [2026 05 17]

NEW
ReasoningTools / functions

Flagship agent model with native extended thinking and 1M context. Text-only; strong at coding, productivity, and long-horizon autonomous tasks.

1M

$2.5

$7.5

Jun 2026

Qwen3.6 Flash

NEW
VisionReasoningTools / functions

Fast, cost-effective multimodal model with 1M context, near-flagship quality, vision/video, and built-in tools.

1M

$0.25

$1.5

Jun 2026

[?] Qwen3.7 Max [preview]

NEW
ReasoningTools / functions

Flagship agent model with native extended thinking and 1M context. Text-only; strong at coding, productivity, and long-horizon autonomous tasks.

1M

$2.5

$7.5

Jun 2026

Qwen3.7 Plus

NEW
VisionReasoningTools / functions

Multimodal agent model with 1M context, native thinking, and vision/video understanding. Lower cost than Max.

1M

$0.4

$1.6

Jun 2026

Qwen3 Coder Plus

Tools / functions

Agentic coding model with very long context. Tiered pricing by input length (up to 1M).

1M

$1

$5

May 2026

[?] Qwen3 VL Plus [2025 12 19]

VisionReasoningTools / functions

Current vision-language model with strong visual reasoning and thinking. Tiered pricing by input length (up to 256K).

262K

$0.2

$1.6

Apr 2026

Glm 5.1

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Next 80b A3b Instruct

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 235b A22b Instruct 2507

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Plus

ReasoningTools / functions

Balanced quality, speed, and cost with hybrid thinking. 1M context.

1M

$0.4

$1.2

-

Qwen3 Next 80b A3b Thinking

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 30b A3b Thinking 2507

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 8b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 235b A22b Thinking 2507

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.6 35b A3b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 32b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 30b A3b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Vl 235b A22b Instruct

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Coder Next

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 235b A22b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 30b A3b Instruct 2507

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 14b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.5 122b A10b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Max

Tools / functions

Best quality of the stable commercial line. 32K context.

33K

$1.6

$6.4

-

Qwen3.5 Flash 2026 02 23

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.5 35b A3b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Vl Max

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwq Plus 2025 03 05

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Vl Flash 2025 10 15

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.5 27b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Coder Flash

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Vl Plus

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.5 397b A17b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.6 Max Preview

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Max Preview

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.5 Plus 2026 02 15

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Coder Plus

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.6 Plus

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Flash

ReasoningTools / functions

Fast and very low cost with hybrid thinking. 1M context.

1M

$0.05

$0.4

-

Qwen3 Max

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3.6 27b

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qvq Max

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Vl 235b A22b Thinking

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen3 Coder 480b A35b Instruct

Tools / functions

Alibaba model (not yet curated).

131K

-

-

-

Qwen Turbo

Tools / functions

Fastest and cheapest for simple tasks. 1M context.

1M

$0.05

$0.2

-

[?] Qwen Plus [latest]

ReasoningTools / functions

Balanced quality, speed, and cost with hybrid thinking. 1M context.

1M

$0.4

$1.2

-

Kimi K2.7 Code (Alibaba)

NEW
Jun 2026

Moonshot Kimi K2.7 Code served via Alibaba Model Studio. Multimodal, always-on thinking, 256K context. (Alibaba pricing not yet published.)

VisionReasoningTools / functions
262K · in - · out -

GLM-5.2 (Alibaba)

NEW
Jun 2026

Zhipu GLM-5.2 served via Alibaba Model Studio. 1M context, thinking. (Alibaba pricing not yet published.)

ReasoningTools / functions
1M · in - · out -

DeepSeek V4 Pro (Alibaba)

NEW
Jun 2026

DeepSeek V4 Pro served via Alibaba Model Studio (Alibaba pricing, ~5x DeepSeek-direct). 1M context, thinking.

ReasoningTools / functions
1M · in $2.4 · out $4.8

DeepSeek V3.2 (Alibaba)

NEW
Jun 2026

DeepSeek V3.2 served via Alibaba Model Studio (superseded by V4). Thinking.

ReasoningTools / functions
131K · in $0.57 · out $1.71

DeepSeek V4 Flash (Alibaba)

NEW
Jun 2026

DeepSeek V4 Flash served via Alibaba Model Studio. 1M context, thinking.

ReasoningTools / functions
1M · in $0.2 · out $0.4

[?] Qwen3.7 Max [2026 05 17]

NEW
Jun 2026

Flagship agent model with native extended thinking and 1M context. Text-only; strong at coding, productivity, and long-horizon autonomous tasks.

ReasoningTools / functions
1M · in $2.5 · out $7.5

Qwen3.6 Flash

NEW
Jun 2026

Fast, cost-effective multimodal model with 1M context, near-flagship quality, vision/video, and built-in tools.

VisionReasoningTools / functions
1M · in $0.25 · out $1.5

[?] Qwen3.7 Max [preview]

NEW
Jun 2026

Flagship agent model with native extended thinking and 1M context. Text-only; strong at coding, productivity, and long-horizon autonomous tasks.

ReasoningTools / functions
1M · in $2.5 · out $7.5

Qwen3.7 Plus

NEW
Jun 2026

Multimodal agent model with 1M context, native thinking, and vision/video understanding. Lower cost than Max.

VisionReasoningTools / functions
1M · in $0.4 · out $1.6

Qwen3 Coder Plus

May 2026

Agentic coding model with very long context. Tiered pricing by input length (up to 1M).

Tools / functions
1M · in $1 · out $5

[?] Qwen3 VL Plus [2025 12 19]

Apr 2026

Current vision-language model with strong visual reasoning and thinking. Tiered pricing by input length (up to 256K).

VisionReasoningTools / functions
262K · in $0.2 · out $1.6

Glm 5.1

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Next 80b A3b Instruct

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 235b A22b Instruct 2507

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Plus

-

Balanced quality, speed, and cost with hybrid thinking. 1M context.

ReasoningTools / functions
1M · in $0.4 · out $1.2

Qwen3 Next 80b A3b Thinking

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 30b A3b Thinking 2507

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 8b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 235b A22b Thinking 2507

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.6 35b A3b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 32b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 30b A3b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Vl 235b A22b Instruct

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Coder Next

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 235b A22b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 30b A3b Instruct 2507

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 14b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.5 122b A10b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Max

-

Best quality of the stable commercial line. 32K context.

Tools / functions
33K · in $1.6 · out $6.4

Qwen3.5 Flash 2026 02 23

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.5 35b A3b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Vl Max

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwq Plus 2025 03 05

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Vl Flash 2025 10 15

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.5 27b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Coder Flash

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Vl Plus

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.5 397b A17b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.6 Max Preview

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Max Preview

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.5 Plus 2026 02 15

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Coder Plus

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.6 Plus

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Flash

-

Fast and very low cost with hybrid thinking. 1M context.

ReasoningTools / functions
1M · in $0.05 · out $0.4

Qwen3 Max

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3.6 27b

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qvq Max

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Vl 235b A22b Thinking

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen3 Coder 480b A35b Instruct

-

Alibaba model (not yet curated).

Tools / functions
131K · in - · out -

Qwen Turbo

-

Fastest and cheapest for simple tasks. 1M context.

Tools / functions
1M · in $0.05 · out $0.2

[?] Qwen Plus [latest]

-

Balanced quality, speed, and cost with hybrid thinking. 1M context.

ReasoningTools / functions
1M · in $0.4 · out $1.2
51 models · sorted by release date · prices in USD per 1M tokens · refreshed every 30 minutesCompare every model across vendors →

Running Alibaba models in Big-AGI

Add your Alibaba Cloud Model Studio API key and run the Qwen family, plus the other models Alibaba serves, at Alibaba's own API rates. Big-AGI adds no markup and keeps your keys and chats in your browser, not on its servers.

  • Your key, your billing. Usage is billed by Alibaba Cloud to your account. Big-AGI does not meter or charge for model usage.
  • Direct Connection. Turn it on and the browser calls Alibaba directly, bypassing the Big-AGI server, when your key is client-side and Alibaba allows it.
  • Qwen, end to end. Multilingual chat, strong coding, vision, and long context. Pick the right Qwen for the job and switch mid-conversation.
  • Beam. Run Qwen in parallel with Claude, GPT, and Gemini, then compare or merge the answers. Parallel runs use more tokens than a single chat.

Bring your Alibaba key. Keep control.

Your key, your data, your choice of model. Big-AGI is open source and self-hostable, so you can check exactly how Alibaba is called.

© 2026 Token Fabrics·Built with passion in San Diego