WAC integrates multi-agent collaboration where an action model consults a world model for strategic guidance on web tasks, grounding suggestions into executable actions. It employs a two-stage deduction chain with consequence simulation and judge model scrutiny for risk-aware action correction. Experiments show 1.8% gains on VisualWebArena and 1.3% on Online-Mind2Web.
Key Points
- 1.Multi-agent setup: action model consults world model expert for web guidance
- 2.Leverages state transition dynamics to propose better candidate actions
- 3.Two-stage chain simulates outcomes and triggers corrective feedback via judge model
- 4.Achieves 1.8% absolute gain on VisualWebArena benchmark
Impact Analysis
Enhances reliability of LLM-based web agents by reducing risky actions and task failures. Offers practical improvements for automating complex web navigation. Positions world-model integration as key for resilient agentic systems.
Technical Details
World model simulates environmental state transitions for action outcomes. Judge model scrutinizes simulations to provide feedback-driven refinements. Action model uses prior knowledge to ground high-level suggestions into concrete web actions.