Alibaba Open-Sources 3 Qwen3.5 Models for Consumer GPUs

💡New open-weight Qwen3.5 beats GPT-5 mini, runs on consumer GPUs—deploy locally now
⚡ 30-Second TL;DR
What Changed
Open-sourced Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, Qwen3.5-27B models
Why It Matters
Enables affordable local deployment of top-tier LLMs, lowering barriers for developers and boosting open-source competition against closed models. Alibaba Cloud's cheap API accelerates enterprise adoption.
What To Do Next
Download Qwen3.5-27B from Hugging Face and test on your consumer GPU for agent tasks.
🧠 Deep Insight
Web-grounded analysis with 9 cited sources.
🔑 Enhanced Key Takeaways
- •Qwen3.5-397B-A17B achieves 19x faster decoding on long-context tasks (256k tokens) compared to Qwen3-Max while matching its reasoning and coding performance[7].
- •Qwen3.5 excels in agentic terminal coding with 52.5 on Terminal-Bench 2.0, surpassing Qwen3-Max (22.5) and approaching Gemini 3 Pro (54.2)[7].
- •In document recognition, Qwen3.5 scores 90.8 on OmniDocBench v1.5, outperforming GPT-5.2 (85.7) and Claude Opus 4.5 (87.7)[7].
📊 Competitor Analysis▸ Show
| Feature | Qwen3.5-Flash/Qwen3 VL | GPT-5 Mini |
|---|---|---|
| Context Window | 262k tokens | 400k tokens[1][2][3] |
| Input Cost | 0.2 yuan (~$0.028)/M tokens (Flash); cheaper overall | ~$0.25/M tokens[2][5][6] |
| Output Cost | Not specified; generally lower | ~$2/M tokens[2][5][6] |
| Coding Benchmarks | 16.5 (Qwen3 VL); strong in Terminal-Bench | 35.3; leads in LiveCodeBench (83.8)[1][7] |
| Speed (tok/s) | 44.4 (Qwen3 VL) | 130.7[1] |
| Intelligence Index | 20.6 (Qwen3 VL) | 41.0[1] |
🔮 Future ImplicationsAI analysis grounded in cited sources
⏳ Timeline
📎 Sources (9)
Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.
- pricepertoken.com — Openai Gpt 5 Mini vs Qwen Qwen3 Vl 235b A22b Instruct
- sourceforge.net — Gpt 5 Mini vs Qwen3.5 Plus
- artificialanalysis.ai — Qwen3 5 397b A17b vs Gpt 5 Mini
- llm-stats.com — Gpt 5 Mini 2025 08 07 vs Qwen3.5 397b A17b
- appaca.ai — Gpt 5 Mini vs Qwen3 Max
- blog.galaxy.ai — Gpt 5 Mini vs Qwen3 Coder
- datacamp.com — Qwen3 5
- timetoact-group.at — LLM Benchmarks Summer 2025
- openrouter.ai — Qwen3 Coder
Weekly AI Recap
Read this week's curated digest of top AI events →
👉Related Updates
AI-curated news aggregator. All content rights belong to original publishers.
Original source: 机器之心 ↗