Home OpenAI shares four major updates at San Francisco DevDay

OpenAI shares four major updates at San Francisco DevDay

TLDR

  • OpenAI announced four updates: Realtime API, Prompt Caching, Model Distillation, and Fine-tuning.
  • Realtime API offers fast speech-to-speech abilities; Prompt Caching cuts costs and latency.
  • Vision fine-tuning lets developers fine-tune GPT-4o with images, available to paid users.

The OpenAI platform is getting some major changes as the company announced four updates at its ‘DevDay’ in San Francisco on Tuesday (October 1).

This includes ‘Realtime API,’ ‘Prompt Caching,’ ‘Model Distillation,’ and support for fine-tuning.

The team made it so the new additions would be visible and able to be used from the announcement date, with some to undergo further tweaking once feedback has been collected.

OpenAI’s four updates announced

‘Realtime API’

One of the most significant OpenAI updates is ‘Realtime API’ which boasts abilities for developers to “build fast speech-to-speech experiences into their applications.”

The public beta of the API has been launched which has been dubbed as being similar to ChatGPT’s Advanced Voice Model. It will enable “all paid developers to build low-latency, multimodal experiences in their apps.”

Audio input and output in the Chat Completions API have been introduced to support the use cases that don’t require the low-latency benefits of the Realtime API. This means that developers can now pass any text or audio inputs into GPT-4o and the model will respond with their choice of text, audio, or both.

Previously, creating a similar voice assistant experience would have required several steps including using another model to make it happen.

‘Prompt Caching’

To further help those building AI applications, ‘Prompt Caching’ has been announced which will reduce costs and latency. “By using recently seen input tokens, developers can get a 50% discount and faster prompt processing times,” writes OpenAI in a company-wide news release.

It has been automatically applied on the latest versions of GPT-4o, GPT-4o mini, o1 preview and o1-mini, as well as the fine-tuned versions of the models.

‘Model Distillation’

The new ‘Model Distillation’ aims to provide an integrated workflow that can help manage the entire distillation pipeline directly within the OpenAI platform.

“This lets developers easily use the outputs of frontier models like o1-preview and GPT-4o to fine-tune and improve the performance of more cost-efficient models like GPT-4o mini.”

Before this introduction, distillation required multiple manual steps whereas this new feature should be much easier and quicker.

The full suite includes Stored Completions, Evals, and Fine-tuning, all of which were made available on Tuesday.

‘Vision fine-tuning’ is the fourth OpenAI update

OpenAI implemented fine-tuning on GPT-4o previously which has been used by “hundreds of thousands of developers,” but the team says its new ‘vision fine-tuning’ update will now make it possible to fine-tune with images, as well as text.

This image version works in a similar way to what has been seen with text, with developers able to prepare their image datasets to follow the proper format and then upload this to the platform.

This will only be usable for those on the paid usage tiers and is supported on the latest GPT-4o model snapshot.

Featured Image: Via

That’s a wrap for DevDay SF! We can’t wait to see what you build with these new capabilities. London and Singapore, see you soon. 🇬🇧🇸🇬https://t.co/VI8UNJPbmH

— OpenAI Developers (@OpenAIDevs) October 1, 2024">OpenAIDevs X post

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Sophie Atkinson
Tech Journalist

Sophie Atkinson is a UK-based journalist and content writer, as well as a founder of a content agency which focuses on storytelling through social media marketing. She kicked off her career with a Print Futures Award which champions young talent working in print, paper and publishing. Heading straight into a regional newsroom, after graduating with a BA (Hons) degree in Journalism, Sophie started by working for Reach PLC. Now, with five years experience in journalism and many more in content marketing, Sophie works as a freelance writer and marketer. Her areas of specialty span a wide range, including technology, business,…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.