Home Mysterious AI surfaces online — was it ChatGPT’s succesor?

Mysterious AI surfaces online — was it ChatGPT’s succesor?

tl;dr

Copy code

  • A mysterious AI chatbot 'gpt2-chatbot' briefly appeared on LMSYS Chatbot Arena.
  • Speculation suggests it might surpass GPT-4, leading to theories about it being GPT-5 or an advanced GPT-4.
  • OpenAI CEO's tweet hints at its novelty, indicating it's likely an unreleased model.

A mysterious AI chatbot labelled ‘gpt2-chatbot’ was briefly available online before subsequently disappearing again. The chatbot quietly made its debut on the website LMSYS Chatbot Arena — a website which is used to benchmark, compare, and rank different AI systems.

Based on its name, some are speculating that the tool might be an earlier version of OpenAI‘s chatbot language model, GPT-2. But users have noted that the language model seems equally as powerful — or more powerful than — GPT-4, OpenAI’s more recent and advanced language model.

In fact, some netizens found that the language model performed better than GPT-4 on certain tests. This has led to speculation that the “gpt2-chatbot” could be an early prototype of GPT-5, or perhaps a more updated, advanced version of GPT-4 which, for all intents and purposes, can be considered GPT-4.5.

But users who managed to test the model before it was taken offline noted that there was surprisingly little information about what the language model was and where it came from. Still, it wasn’t long until the language model was taken back offline, with LMSYS saying in a tweet: “In line with our policy, we’ve worked with several model developers in the past to offer community access to unreleased models/checkpoints (e.g., mistral-next, gpt2-chatbot) for preview testing.”

The website then went on to add that it had to “temporarily” take down the gpt2-chatbot due to “high traffic and capacity limit.”

Speculation grows over ‘gpt2-chatbot’

Thanks to a subsequent tweet by OpenAI CEO Sam Altman, it seems like the language model is more likely to be something new rather than an earlier model of GPT-2.

Altman wrote: “I do have a soft spot for GPT-2,” before later editing the tweet so that it appeared as “gpt-2.” And adding further fuel to the fire, OpenAI staff member Steven Heidel wrote a tweet saying: “when gpt-2.”

Based on these responses, it seems more likely than not that, as hinted by LMSYS, this is an unreleased model of some kind.

Featured Image: Franz Bachinger from Pixabay

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Charlotte Colombo
Freelance Journalist

Charlotte Colombo is a freelance journalist with bylines in Metro.co.uk, Radio Times, The Independent, Daily Dot, Glamour, Stylist, and VICE among others. She most recently worked as a Staff Writer for entertainment outlet The Digital Fix for two years and, prior to that, worked with Business Insider and Dexerto on their digital culture desks. She’s also appeared on BBC Radio 5 and The Guardian podcast to share her expertise on technology, influencers, and niche internet subcultures. She holds an MA in Magazine Journalism from City, University of London and has been freelancing for three years. She has a wide range…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.