Home US-based Ai2 releases new AI model, claims it beats DeepSeek

US-based Ai2 releases new AI model, claims it beats DeepSeek

Another AI company has stepped up to the plate as DeepSeek’s V3 model goes viral, with Ai2 claiming its newest model outperforms its Chinese competitor.

The open-source post-training model, Tülu 3 405B, has been described by the American technology company as being able to surpass the performance of DeepSeek V3.

“The last member of the Tülu 3 family demonstrates that our recipe, which includes Reinforcement Learning from Verifiable Rewards (RVLR) scales to 405B – with performance on par with GPT-4o, and surpassing prior open-weight post-trained models of the same size including Llama 3.1,” Ai2 said on X.

A benchmark was published on the social media site too, with the company looking at their model in comparison to Llama, Nous Hermes, GPT 4o, and DeepSeek.

This release follows the launch of Tülu 3 in November, with this new model aiming to demonstrate the scalability and effectiveness of a post-training recipe applied at 405B parameter scale.

Ai2 has big claims as they launch new Tülu 3 405B model

Within the announcement, the technology-focused company claims the tool “achieves competitive or superior performance to both Deepseek v3 and GPT-4o, while surpassing prior open-weight post-trained models of the same size including Llama 3.1 405B Instruct and Nous Hermes 3 405B on many standard benchmarks.

“Interestingly, we found that our Reinforcement Learning from Verifiable Rewards (RLVR) framework improved the MATH performance more significantly at a larger scale, i.e., 405B compared to 70B and 8B, similar to the findings in the DeepSeek-R1 report.

“Overall, our results show a consistent edge over DeepSeek V3, especially with the inclusion of safety benchmarks.”

Unlike others, Ai2’s new approach is open source so all components that are necessary to replicate it are freely available and permissively licensed.

A spokesperson for Ai2 was quoted by TechCrunch as saying that the lab believes the model “underscores the U.S.’ potential to lead the global development of best-in-class generative AI models.”

Featured Image: Via Ai2 on X

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech, gambling and blockchain industries for major developments, new product and brand launches, AI breakthroughs, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Sophie Atkinson
Tech Journalist

Sophie Atkinson is a UK-based journalist and content writer, as well as a founder of a content agency which focuses on storytelling through social media marketing. She kicked off her career with a Print Futures Award which champions young talent working in print, paper and publishing. Heading straight into a regional newsroom, after graduating with a BA (Hons) degree in Journalism, Sophie started by working for Reach PLC. Now, with five years experience in journalism and many more in content marketing, Sophie works as a freelance writer and marketer. Her areas of specialty span a wide range, including technology, business,…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.