Home Nvidia unveils its new NVLM 1.0 AI model, rivaling the likes of OpenAI’s GPT-4

Nvidia unveils its new NVLM 1.0 AI model, rivaling the likes of OpenAI’s GPT-4

TLDR

  • Nvidia launched NVLM 1.0, a powerful AI model with 72 billion parameters.
  • NVLM-D-72B excels in vision-language tasks and improves text accuracy by 4.3 points.
  • The model is open for research, but restricted from commercial use and modifications.

Nvidia has released its powerful open-source artificial intelligence model that could outpace the likes of OpenAI’s GPT-4.

The company’s new NVLM 1.0 family of open-source multimodal large language models (LLMs), with its flagship model, NVLM-D-72B, has around 72 billion parameters.

According to Nvidia’s research team, the new AI model excels in vision-language tasks while maintaining and even improving text-only performance compared to their LLM backbones. In their paper, the researchers state: “We introduce NVLM 1.0, a family of frontier-class multimodal large language models that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models.”

Unlike some of the other proprietary models where there is a significant decline in text performance over time, the NVLM-D-72B reportedly increased its accuracy by an average of 4.3 points across key text benchmarks.

The LLM was also able to interpret charts and tables, analyze images, understand memes, code software, as well as solve mathematical problems. The model weights are publicly available on Hugging Face and Nvidia says it will eventually release the training code.

What the AI community think of Nvidia’s NVLM model

AI researchers on X have called the release “wild,” and praised its ability to understand visual data. One user wrote: “Wow! Nvidia just published a 72B model with is ~on par with llama 3.1 405B in math and coding evals and also has vision ?”

That said, Nvidia itself has reportedly used open-source resources to develop NVLM 1.0, gaining insights from other AI models and various training data. However, the NVLM-D-72B model is restricted under its licensing terms. It cannot be used for commercial purposes or modified for resale. Essentially, Nvidia is providing the model exclusively for research purposes and for hobbyists eager to test the limits of their high-end graphics cards.

The researchers’ use of the term “open” is therefore quite intentional. Although Nvidia’s findings do provide value, the restrictions on commercial use mean it cannot be considered truly open-source, which would require the freedom to use, modify, and distribute the model without any limitations.

ReadWrite has reached out to Nvidia for comment.

Featured image: Midjourney

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Suswati Basu
Tech journalist

Suswati Basu is a multilingual, award-winning editor and the founder of the intersectional literature channel, How To Be Books. She was shortlisted for the Guardian Mary Stott Prize and longlisted for the Guardian International Development Journalism Award. With 18 years of experience in the media industry, Suswati has held significant roles such as head of audience and deputy editor for NationalWorld news, digital editor for Channel 4 News and ITV News. She has also contributed to the Guardian and received training at the BBC As an audience, trends, and SEO specialist, she has participated in panel events alongside Google. Her…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.