Ant Group open-sourced UI-Venus-1.5, a high-performance end-to-end GUI agent topping SOTA benchmarks. It unifies grounding, mobile, and web handling in one model, supporting 40+ mainstream Chinese apps and solving knowledge gaps, sim-to-real issues, and multi-model coordination. Resources include GitHub code, Hugging Face models, and a technical report.
Key Points
- 1.SOTA performance in GUI agent benchmarks
- 2.Single model handles grounding, mobile, and web scenarios
- 3.Supports 40+ mainstream Chinese mobile apps
- 4.Addresses knowledge gaps, sim-to-real, and multi-model issues
- 5.Fully open-sourced with code, models, and arXiv report
Impact Analysis
Accelerates deployable GUI agents for real apps, lowering barriers for practical AI assistants in mobile/web automation.
Technical Details
Follows 'high-performance, practical' design for end-to-end processing without complex frameworks. Tackles GUI-specific challenges like obscure icons and app logic via unified training.




