Home Apple’s new AI model ReALM ‘surpasses GPT-4’

Apple’s new AI model ReALM ‘surpasses GPT-4’

TL:DR

  • Apple's ReALM surpasses GPT-4 in reference resolution
  • Study finds ReALM's performance comparable to GPT-4
  • Expectations for integration into Siri 2.0 ahead of iOS 18 launch

Researchers have found that Apple’s new AI system, ReALM, surpassed the capabilities of OpenAI’s GPT-4.

The paper titled “ReALM: Reference Resolution as Language Modelling” examines the issue of reference resolution. Reference is a linguistic process in which one word in a sentence or discourse refers to another word or entity. The task of resolving these references is known as Reference Resolution.

The researchers state that while large language models (LLMs) are extremely powerful for a variety of tasks, their use in reference resolution, particularly for non-conversational entities, remains underutilized.

According to the study, the smallest version of ReALM was benchmarked against GPT-3.5 and GPT-4, and it managed to achieve performance comparable to that of GPT-4, while the larger models substantially outperformed it.

Ahead of WWDC 2024 and the anticipated June launch of iOS 18, expectations are high for the debut of an advanced Siri 2.0. Whether ReALM will be integrated into Siri by then remains uncertain.

Apple’s recent ventures into AI have not gone unnoticed, marked by the introduction of new models and tools aimed at enhancing AI efficiency on smaller devices, as well as strategic partnerships. These developments highlight the company’s strategy to place AI at the forefront of its business operations.

The unveiling of ReALM represents Apple’s AI research team’s latest and most targeted initiative to refine and accelerate existing models, driving them toward greater speed, intelligence, and efficiency.

Key features of Apple’s ReALM AI

ReALM reportedly uses a new way of converting screen information into text, allowing it to bypass the need for image recognition parameters and enabling more efficient processing on AI devices.

It also takes into account what is on the user’s screen or those running in the background.

As a result, the LLM should enable users to scroll through a website and instruct Siri to call a business. Siri would then be able to ‘see’ the phone number on the website and directly make the call.

Hence ReALM could significantly improve the context-aware capabilities of voice assistants. With its ability to interpret on-screen information and use additional context, the update to Siri could help deliver a more fluid and hands-free user experience.

ReALM could also handle a wide variety of references, including those that are dependent on conversational context, on-screen content, and even background information. This is critical for developing more intuitive and responsive AI systems that can adapt to the complexities of human language and context.

The paper reports large improvements over existing systems with similar functionalities, as its smallest model apparently achieved absolute gains of over 5% for on-screen references.

Featured image: Canva

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Suswati Basu
Tech journalist

Suswati Basu is a multilingual, award-winning editor and the founder of the intersectional literature channel, How To Be Books. She was shortlisted for the Guardian Mary Stott Prize and longlisted for the Guardian International Development Journalism Award. With 18 years of experience in the media industry, Suswati has held significant roles such as head of audience and deputy editor for NationalWorld news, digital editor for Channel 4 News and ITV News. She has also contributed to the Guardian and received training at the BBC As an audience, trends, and SEO specialist, she has participated in panel events alongside Google. Her…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.