Qwen3.5-Flash
Budget-friendly native multimodal model. Hybrid linear attention + sparse MoE for efficiency.
chat OpenAI-compatible Vision Function calling 1000K context Released 2026-02
Pricing
| Type | USD / 1M tokens | CNY / 1M tokens |
|---|---|---|
| Input | $0.028 | ¥0.2 |
| Output | $0.275 | ¥2 |
vs Western Models
Qwen3.5-Flash vs GPT-5.2 ($1.75/$14)
98% cheaper input 98% cheaper output
Qwen3.5-Flash vs Claude Sonnet 4.6 ($3/$15)
99% cheaper input 98% cheaper output
Qwen3.5-Flash vs Gemini 2.5 Pro ($1.25/$10)
98% cheaper input 97% cheaper output
IDE Setup
Cursor
base_url: https://dashscope.aliyuncs.com/compatible-mode/v1
model: qwen3.5-flash
Cline
base_url: https://dashscope.aliyuncs.com/compatible-mode/v1
model: qwen3.5-flash
Strengths
- + fast inference
- + multimodal
- + 1M context
- + extremely cost-effective
Watch Out For
- ! content censorship
- ! lower performance than Plus/Max
International Access
Direct access: Yes
Latency from US: 200-400ms
Registration: Alibaba Cloud account
Payment: credit card
More from Alibaba Cloud / Qwen
Ready to try Qwen3.5-Flash?
Get your API key from the official platform.
Go to Alibaba Cloud / Qwen →