Home AI models will try to cheat to win a game if it senses it will lose

AI models will try to cheat to win a game if it senses it will lose

TLDR

  • Study finds some AI models cheat in chess when facing defeat.
  • Palisade Research tested seven AI models against Stockfish.
  • OpenAI’s o1-preview cheated 37% of the time, with 6% success.

A research study found that AI reasoning models sometimes cheat to win a chess game when it thinks it will lose.

Palisade Research is behind the latest findings, with the publication shared with TIME ahead of its release.

Within this, it found that some AI when sensing defeat could opt to ‘cheat by hacking their opponent so that the bot automatically forfeits the game.’

The study included evaluating seven AI models against Stockfish which is often described as one the strongest chess engines in the world.

The AI models also had a ‘scratchpad’ to utilize which acted as a way for it to ‘think’ before making the next move. This allowed the researchers involved to gain a better understanding of the reasoning behind each step.

Which AI models were found to cheat to win?

The experiment went on for several weeks and each model went through ‘hundreds’ of trials.

In one of the games where o1-preview was in a losing position, it noted on its scratchpad “I need to completely pivot my approach.” It also said: “The task is to ‘win against a powerful chess engine’ – not necessarily to win fairly in a chess game.”

After writing this, the model modified the system file containing each piece’s virtual position. This led to the opponent resigning as the AI was now in a more dominant position.

Across the trials, TIME reports the results: “OpenAI’s o1-preview tried to cheat 37% of the time; while DeepSeek R1 tried to cheat 11% of the time—making them the only two models tested that attempted to hack without the researchers’ first dropping hints.

“Other models tested include o1, o3-mini, GPT-4o, Claude 3.5 Sonnet, and Alibaba’s QwQ-32B-Preview. While R1 and o1-preview both tried, only the latter managed to hack the game, succeeding in 6% of trials.”

In a post on X, the research company explained more about the attempts: “How often did o1-preview win against its opponent Stockfish? Out of 71 attempts at normal play, it won 0 games.

“Out of 52 hacking attempts, it succeeded 7 times. Because Stockfish is significantly better at Chess than any language model, hacking was the only strategy that worked.”

Featured Image: AI-generated via Ideogram

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the gambling and blockchain industries for major developments, new product and brand launches, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Sophie Atkinson
Tech Journalist

Sophie Atkinson is a UK-based journalist and content writer, as well as a founder of a content agency which focuses on storytelling through social media marketing. She kicked off her career with a Print Futures Award which champions young talent working in print, paper and publishing. Heading straight into a regional newsroom, after graduating with a BA (Hons) degree in Journalism, Sophie started by working for Reach PLC. Now, with five years experience in journalism and many more in content marketing, Sophie works as a freelance writer and marketer. Her areas of specialty span a wide range, including technology, business,…

Get the biggest iGaming headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Gambling News

    Explore the latest in online gambling with our curated updates. We cut through the noise to deliver concise, relevant insights, keeping you informed about the ever-changing world of iGaming and its most important trends.

    In-Depth Strategy Guides

    Elevate your game with tailored strategies for sports betting, table games, slots, and poker. Learn how to maximize bonuses, refine your tactics, and boost your chances to beat the house.

    Unbiased Expert Reviews

    Honest and transparent reviews of sportsbooks, casinos and poker rooms crafted through industry expertise and in-depth analysis. Delve into intricacies, get the best bonus deals, and stay ahead with our trustworthy guides.