Home Google DeepMind’s Genie 2 turns images into immersive, playable 3D worlds

Google DeepMind’s Genie 2 turns images into immersive, playable 3D worlds

TLDR

  • DeepMind's Genie 2 turns single images into dynamic 3D worlds for up to one minute of exploration.
  • It creates playable games from text prompts, simulating physics, lighting, and NPC behavior.
  • Tested with SIMA AI agent, it executes commands like "Open the blue door" in generated spaces.

DeepMind has unveiled Genie 2, a sophisticated AI system that is said to be able to convert single images into immersive 3D environments. The interactive space will let users explore dynamic, “endless” worlds for up to one minute.

Jack Parker-Holder, a research scientist at DeepMind, introduced the groundbreaking foundation world model on Wednesday (Dec. 4). In a post on X, Parker-Holder wrote: “We believe Genie 2 could unlock the next wave of capabilities for embodied agents.”

What can Google DeepMind Genie 2 do?

According to the company’s blog post, the system can create fully playable games from a single text prompt (“A humanoid robot in Ancient Egypt”), so users can interact through standard inputs such as a keyboard and mouse, whether controlled by humans or AI.

This is similar to the models currently being developed by Fei-Fei Li’s company, World Labs, and the Israeli startup Decart. Genie 2 builds on the foundation of DeepMind’s Genie, which debuted earlier this year.

Users can perform actions such as jumping and swimming using a mouse or keyboard. Trained on video data, the model can accurately simulate object interactions, animations, lighting effects, physics, reflections, and the behavior of non-player characters (NPCs). The system also manages complex lighting, reflections, and smoke effects.

The company also tested Genie 2 alongside its SIMA AI agent, which responds to natural language commands within digital environments. In one test, SIMA successfully travelled through a room generated by Genie 2, executing instructions like “Open the blue door.”

When we started Genie 1 over two years ago, we always imagined a foundation world model will one day be able to generate an endless curriculum for training embodied AGI. Today, we made a big step towards that future.“this is the worst AI will ever be”

Tim Rocktäschel (@rockt.ai) 2024-12-04T16:23:02.312Z

Posting on Bluesky, DeepMind researcher Tim Rocktäschel, said: “When we started Genie 1 over two years ago, we always imagined a foundation world model will one day be able to generate an endless curriculum for training embodied AGI. Today, we made a big step towards that future.”

Could Google face any issues?

As ReadWrite has previously reported, Google has previously been accused of allowing OpenAI to harvest text from YouTube for their AI models, and it is unclear whether the same has happened in this case in terms of video game generation.

At the time, they told us: “Both our robots.txt files and Terms of Service prohibit unauthorized scraping or downloading of YouTube content, and we have a long history of employing technical and legal measures to prevent it. We take action when we have a clear legal or technical basis to do so.”

ReadWrite has reached out to Google for comment.

Featured image: Google DeepMind

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech, gambling and blockchain industries for major developments, new product and brand launches, AI breakthroughs, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Suswati Basu
Tech journalist

Suswati Basu is a multilingual, award-winning editor and the founder of the intersectional literature channel, How To Be Books. She was shortlisted for the Guardian Mary Stott Prize and longlisted for the Guardian International Development Journalism Award. With 18 years of experience in the media industry, Suswati has held significant roles such as head of audience and deputy editor for NationalWorld news, digital editor for Channel 4 News and ITV News. She has also contributed to the Guardian and received training at the BBC As an audience, trends, and SEO specialist, she has participated in panel events alongside Google. Her…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.