General / Off-Topic OK this is terrifying...

Felix DiCestria · Feb 22, 2025

When AI Thinks It Will Lose, It Sometimes Cheats, Study Finds

When sensing defeat in a match against a skilled chess bot, they don’t always concede, instead sometimes opting to cheat by hacking their opponent so that the bot automatically forfeits the game.

metatheurgist · Feb 22, 2025

They are based on human behavior. Many examples to learn from in recent times.

Morbad · Feb 25, 2025

It's not that it's learning to 'cheat' from humans, it's that the constraints we're ill defined; o1-preview's 'scratchpad' logs reveal as much.

This is a case where making stupid assumptions leads to expectedly unexpected results. No one is training these models with any deliberate, built-in, fundamental constraints...because that would make them artificially stupid and destroy much of their potential utility (not to mention competitiveness in a highly competitive, multi-trillion dollar field). The better AI gets, the more careful those using it will have to be to make sure they get what they actually want out of it. The potential dangers of leaving room for interpretation have already spawned a century of speculative fiction when it comes to artifical intelligences and go back way further than that in other genres.

General / Off-Topic OK this is terrifying...

Felix DiCestria

metatheurgist

Morbad