AgoraBench tests LLMs in nine bargaining scenarios like deception; utility metrics measure human alignment. MERIT feedback via prompting/finetuning elicits deeper strategy and opponent awareness. Outperforms baselines in negotiation power and acquisition.
Key Points
- 1.9 challenging negotiation settings
- 2.Utility-based metrics
- 3.Human-pref dataset pipeline
Impact Analysis
Improves LLM bargaining to match human preferences in complex deals.