Home Knowledge Base Kuhn Poker

Kuhn Poker

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Safe Equilibrium Policy Optimization for Strategic Agent Policies

arXiv:2605.30854v1 Announce Type: new Abstract: Language models fine-tuned with reinforcement learning typically optimize for task reward, ignoring multi-agent strategic structure. Because these agents condition on natural language game-state descriptions and emit actions through free-form generation, strategic failure modes -- exploiting weaker opponents, coordinating on harmful equilibria, and externalizing costs are inseparable from the language interface itself. We propose Safe...

arXiv CS 9d ago