Home Google DeepMind takes AI closer to human capacity in complex math

Google DeepMind takes AI closer to human capacity in complex math

TL:DR

  • Google DeepMind's AI systems, AlphaProof and AlphaGeometry 2, tackled the International Mathematical Olympiad but fell short of gold, scoring 28/42.
  • The AI systems either solved questions perfectly or failed completely, demonstrating the challenge of matching top human mathematicians.
  • Unlike humans, the AI had no time limits, with some problems taking up to three days to solve, highlighting differences in approach and capability.

Google DeepMind has taken a big step toward bringing artificial intelligence (AI) in line with human capability to solve complicated mathematics.

Researchers paired two new systems, known as AlphaProof and AlphaGeometry 2, tasking them with questions from the International Mathematical Olympiad. The global maths contest for advanced high school students has been running since 1959, comprised of six extremely difficult questions each year. Topics include algebra and geometry, with a gold medal putting the winners on a pedestal with the best and brightest young mathematicians in the world.

While the results from the AI systems were impressive, they weren’t quite at the standard of the most intelligent humans at this level, not yet anyway. The Google DeepMind ‘team’ racked up a score of 28 out of 42 points available, one short of the number required for a gold rating and having to settle for silver.

Understandably, and unlike human performance, the answers submitted by DeepMind’s AlphaProof and AlphaGeometry 2 were either perfect or pitiful. The AI solved four questions with precision, taking top marks, but in the other two, there was nothing. The technology could not even begin to work out the answer. 

Building a bridge between spheres

Another key point to note is that the DeepMind experiment effectively had no time limits. Some questions were answered in seconds while others took three days, round the clock. Conversely, human competitors in the Olympiad have a maximum of nine hours to complete the test.

The two AI systems paired by researchers are said to be very different. AlphaProof, which answered three of the questions, works by pairing a large language model (as used in chatbots) with a specialist “reinforcement learning” technique. AlphaGeometry pairs an LLM with a focused, mathematically inclined approach. 

Thomas Hubert, lead researcher on AlphaProof stated, “What we try to do is to build a bridge between these two spheres so that we can take advantage of the guarantees that come with formal mathematics and the data that is available in informal mathematics.”

 

Image credit: Via Ideogram

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Graeme Hanna
Tech Journalist

Graeme Hanna is a full-time, freelance writer with significant experience in online news as well as content writing. Since January 2021, he has contributed as a football and news writer for several mainstream UK titles including The Glasgow Times, Rangers Review, Manchester Evening News, MyLondon, Give Me Sport, and the Belfast News Letter. Graeme has worked across several briefs including news and feature writing in addition to other significant work experience in professional services. Now a contributing news writer at ReadWrite.com, he is involved with pitching relevant content for publication as well as writing engaging tech news stories.

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.