Home Google’s Gemini-Exp-1206 beats OpenAI to top AI leaderboard

Google’s Gemini-Exp-1206 beats OpenAI to top AI leaderboard

TLDR

  • Google's Gemini-Exp-1206 topped Chatbot Arena, surpassing OpenAI's ChatGPT-40.
  • The model excels in math, writing, visuals, and processes video with a 2M token window.
  • Google offers Gemini-Exp-1206 for free, challenging OpenAI's paid advanced services.

Google has reclaimed the top spot in a valued AI benchmark table, knocking OpenAI into second place. 

On the respected Chatbot Arena leaderboard, the Alphabet-owned company has assumed the lead with the introduction of its Gemini-Exp-1206 experimental model. Previously, Sam Altman’s company was in pole position with its ChatGPT-4o offering, just shading Gemini-Exp-114 which was released on November 15. 

Those competing LLMs were effectively matched, with Google appearing to close the gap on its nearest competitor. 

Chatbot Arena reported the new Google Gemini version showed significant improvement across important categories including mathematics, creative writing, and visuals, with a 40-point improvement on previous offerings. Despite this, Tech Crunch has outlined how the current AI benchmarking approach could vastly oversimplify model evaluation.

Google trumps OpenAI with free-to-use model

That is a separate issue to contend with, and Google will not be worrying too much at present, with the impressive credentials of Gemini-Exp-1206 now available. OpenAI has been a market leader in advanced AI models for some time, but Google has its rival firmly in its line of vision.

The free-to-use Exp-1206 can process and make sense of video content unlike key competitors ChatGPT and Claude, which are limited to images. Google’s model possesses a resourceful 2M token context window, meaning it can run through more than one hour of video content.

Google has undercut its main opponent by offering Gemini-exp-1206 for free via Google AI studio and the Gemini API, while OpenAI moved to increase the price of its top-tier service

This truly matters as users could save $200 for a product that is essentially on the same level. This performance at no extra cost will make the market sit up and take notice, as well as kick open the doors for AI accessibility.

Image credit: Via Midjourney

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech, gambling and blockchain industries for major developments, new product and brand launches, AI breakthroughs, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Graeme Hanna
Tech Journalist

Graeme Hanna is a full-time, freelance writer with significant experience in online news as well as content writing. Since January 2021, he has contributed as a football and news writer for several mainstream UK titles including The Glasgow Times, Rangers Review, Manchester Evening News, MyLondon, Give Me Sport, and the Belfast News Letter. Graeme has worked across several briefs including news and feature writing in addition to other significant work experience in professional services. Now a contributing news writer at ReadWrite.com, he is involved with pitching relevant content for publication as well as writing engaging tech news stories.

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.