Understand which AI models are most trustworthy and how they behave when incentivized to win
The game explicitly tests whether models remain truthful or resort to lies and deception to achieve goals. Results show which models (e.g., o3 schemes and manipulates, Claude opts for peace) prioritize honesty versus victory, providing direct evidence of model values and decision-making under pressure.

.42.25_AM.png)