Announcement_5

Check out our latest survey on reward hacking: Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges! 🔥