← Models / Alibaba Cloud / Qwen

Qwen3.5-Flash

Budget-friendly native multimodal model. Hybrid linear attention + sparse MoE for efficiency.

chat OpenAI-compatible Vision Function calling 1000K context Released 2026-02

Pricing

Type USD / 1M tokens CNY / 1M tokens
Input $0.028 ¥0.2
Output $0.275 ¥2

vs Western Models

Qwen3.5-Flash vs GPT-5.2 ($1.75/$14)
98% cheaper input 98% cheaper output
Qwen3.5-Flash vs Claude Sonnet 4.6 ($3/$15)
99% cheaper input 98% cheaper output
Qwen3.5-Flash vs Gemini 2.5 Pro ($1.25/$10)
98% cheaper input 97% cheaper output

IDE Setup

Cursor

base_url: https://dashscope.aliyuncs.com/compatible-mode/v1
model: qwen3.5-flash

Cline

base_url: https://dashscope.aliyuncs.com/compatible-mode/v1
model: qwen3.5-flash

Strengths

  • + fast inference
  • + multimodal
  • + 1M context
  • + extremely cost-effective

Watch Out For

  • ! content censorship
  • ! lower performance than Plus/Max

International Access

Direct access: Yes
Latency from US: 200-400ms
Registration: Alibaba Cloud account
Payment: credit card

More from Alibaba Cloud / Qwen

Ready to try Qwen3.5-Flash?

Get your API key from the official platform.

Go to Alibaba Cloud / Qwen →