Home DeepMind claims its AI outperforms Olympiad gold medalists in solving maths problems

DeepMind claims its AI outperforms Olympiad gold medalists in solving maths problems

TLDR

  • DeepMind's AlphaGeometry2 outperformed Olympiad gold medalists in solving geometry problems.
  • It solved 42 out of 50 problems using Google's Gemini model and a symbolic reasoning engine.
  • Trained on 300M theorems, it improves AI’s logical reasoning and reduces errors in solutions.

Google DeepMind has unveiled an upgraded version of its math-focused AI, which has now surpassed an average gold medalist in solving Olympiad geometry problems. Only a year ago, AlphaGeometry, an AI problem solver from Google’s research team, matched the performance of silver medalists in the International Mathematical Olympiad (IMO), a prestigious competition known for its incredibly tough math problems designed for gifted high school students.

AlphaGeometry2 has now reportedly significantly improved on its predecessor, solving around 84% of equations. It combines Google’s Gemini model with a “symbolic engine,” helping it to tackle complex geometry problems that require deep deductive reasoning and rigorous proofs.

Consequently, the researchers say it managed to solve 42 out of 50 IMO problems, beating the average gold medalist score of 40.9. That’s a leap from its previous iteration, which only solved 54% of the problems.

To get there, the model trained on a massive dataset of over 300 million synthetic theorems and proofs, each increasing in difficulty. The upgraded training set is larger and more diverse than what the first AlphaGeometry used.

Why Google DeepMind is working on Olympiad math problems

So, why is DeepMind so interested in a high-school-level math competition? They believe that cracking tough geometry problems, especially Euclidean ones, could be a big step toward building more advanced AI.

Proving mathematical theorems, like the Pythagorean theorem, isn’t just about getting the right answer – it’s about logical reasoning and choosing the right steps to reach a solution. If DeepMind is right, these skills might be a key ingredient for future general-purpose AI models, helping them think more like humans when tackling complex problems.

As a result, AlphaGeometry is built from a mix of powerful components, including a specialized language model and a neuro-symbolic system. Unlike typical neural networks that learn from massive datasets, this system has abstract reasoning directly coded in by humans.

To make sure it stays precise, the team trained the language model to speak a formal mathematical language. This allows it to self-check its own logic, making sure it provides accurate evidence and reducing the risk of hallucinations—the kind of false or nonsensical statements that AI chatbots sometimes generate.

Featured image: Canva / Google DeepMind

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the gambling and blockchain industries for major developments, new product and brand launches, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Suswati Basu
News Editor

Suswati Basu is a multilingual, award-winning editor and the founder of the intersectional literature channel, How To Be Books. She was shortlisted for the Guardian Mary Stott Prize and longlisted for the Guardian International Development Journalism Award. With 18 years of experience in the media industry, Suswati has held significant roles such as head of audience and deputy editor for NationalWorld news, digital editor for Channel 4 News and ITV News. She has also contributed to the Guardian and received training at the BBC. As an audience, trends, and SEO specialist, she has participated in panel events alongside Google. Her…