Qwen3.5-35B-A3B Nears Claude Opus on SWE-bench Hard

๐ก3B MoE hits 38% SWE-bench Hard, beats baselines w/ simple verify trick
โก 30-Second TL;DR
What Changed
37.8% on SWE-bench Hard (45 tasks) with verify-on-edit vs 22.2% baseline
Why It Matters
Proves efficient verification boosts small MoE models to top-tier coding agent performance. Enables cost-effective self-hosted SWE agents for practitioners.
What To Do Next
Implement verify-on-edit in your vLLM agent loop for SWE-bench testing.
๐ง Deep Insight
Web-grounded analysis with 6 cited sources.
๐ Enhanced Key Takeaways
- โขQwen3.5-35B-A3B has a 262k token context window, exceeding Claude Opus 4.6's 200k tokens, enabling handling of larger codebases[1][2].
- โขAs an open-weight model runnable locally, it achieves over 50 tokens per second on a single NVIDIA 4090 GPU, avoiding API rate limits[5].
- โขQwen3 offers an 83x lower price per token compared to Claude Opus for coding tasks, making it ideal for high-volume prototyping[6].
๐ Competitor Analysisโธ Show
| Metric | Qwen3.5-35B-A3B | Claude Opus 4.6 |
|---|---|---|
| Creator | Alibaba (inferred open-weight) | Anthropic |
| Context Window | 262k tokens | 200k tokens (API), up to 1M input |
| Pricing | Free/open-weight (local), ~$0.06 equiv. blended | $5 input / $25 output per M tokens |
| Speed | 50+ t/s on 4090 local | Lower on API, optimized for reasoning |
| SWE-bench Hard | 37.8% (verify-on-edit) | 40% |
| Open Source | Yes (local runnable) | Proprietary |
๐ฎ Future ImplicationsAI analysis grounded in cited sources
โณ Timeline
๐ Sources (6)
Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.
- artificialanalysis.ai โ Qwen3 5 35b A3b vs Claude Opus 4 6 Adaptive
- artificialanalysis.ai โ Qwen3 5 35b A3b vs Claude Opus 4 5 Thinking
- blog.galaxy.ai โ Claude Opus 4 6 vs Qwen3 30b A3b
- artificialanalysis.ai โ Qwen3 5 35b A3b vs Qwen3 5 35b A3b
- news.ycombinator.com โ Item
- ucstrategies.com โ Qwen 3 in 2026 the Best Free Coding AI with a Catch
Weekly AI Recap
Read this week's curated digest of top AI events โ
๐Related Updates
AI-curated news aggregator. All content rights belong to original publishers.
Original source: Reddit r/LocalLLaMA โ