Demis Hassabis on world models, Genie 3 and the road to AGI

From Gemini 2.5’s deep thinking to Genie 3’s reality-shaped AI, discover how Google DeepMind is pushing boundaries toward artificial general intelligence.

Leo Martins

ByLeo Martins

AI Tools, Prompts & Practical AI Expert

Leo Martins is the AI Tools & Practical AI Expert at Aiholics, focused on helping readers use artificial intelligence to improve productivity, creativity and everyday work....

- AI Tools, Prompts & Practical AI Expert

Published: August 12, 2025

7 Min Read

It’s a wild time in AI right now, and we recently discovered some incredible perspectives from Google DeepMind’s CEO Demis Hassabis on how fast things are moving over there. They’re basically releasing new tech almost every day, from Gemini 3’s impressive reception to a variety of cutting-edge initiatives like their “Deep Think” reasoning systems and the “Game Arena” for AI benchmarks.

Genie 3 and building a world model that truly understands physics

What really grabbed my attention was the concept behind Genie 3. This is not just another generative AI model; it’s designed to build what they call a world model, one that grasps the physical workings of the world, like liquids flowing from a tap or reflections in a mirror and then generates these hyper-consistent virtual environments. The truly mind-blowing part? If you look away and come back, the world remains consistent as you left it.

This speaks volumes about the depth of understanding embedded within Genie 3, moving beyond mere language generation to modeling the spatiotemporal dynamics of reality. Such a world model is critical for robotics, interactive assistants, and eventually an AI that operates seamlessly across real and virtual spaces.

We want to build what we call a world model – a model that actually understands the physics of the world.

It highlights a push to unite perception, physics, and reasoning into one coherent system that can help us understand both the virtual and actual worlds better.

From AlphaZero to thinking models: why reasoning matters so much

DeepMind’s roots in game-playing AIs like AlphaZero are well known, and it turns out their current work on “thinking models” draws deeply on that heritage. These models don’t just spit out an answer, they simulate multiple thought processes in parallel and refine their plans before acting. This capability is essential for progressing toward artificial general intelligence (AGI).

Once you have thinking, you can do deep thinking or extremely deep thinking… parallel planning, then collapse onto the best one.”

One key insight is that simply scaling up language models or raw output no longer cuts it. You need models that step back, reason, analyze, and revise internally – much like how humans mull over a problem rather than jumping to the first solution.

This explains why DeepMind’s thinking systems excel in complex domains like math competitions (they’ve even got gold medals in the International Math Olympiad) and coding while also remaining imperfect on simpler logic puzzles. It paints a picture of AI systems with a jagged intelligence profile: brilliant in some realms, still fumbling in others.

Game Arena: Why challenging AI with games matters more than ever

In the midst of all this progress, something struck me as very insightful: despite their leaps, these AI systems often struggle with simple games or tasks involving strict rule-following like chess. This is where the newly announced Game Arena partnership with Kaggle comes in.

Game Arena pits AI models against each other in a variety of games, with automatic adjustment of difficulty based on model performance. This dynamic benchmarking addresses a big challenge in AI evaluation, traditional benchmarks are saturating, and we need harder, more varied tests that also touch on areas like physical reasoning and safety.

This approach also recalls DeepMind’s early successes by framing games as clean, objective tests of intelligence – meaningful scores, less bias, and continual progress tracking. I found it exciting that eventually these AI systems might even invent new games and challenge each other to learn them, pushing their learning capabilities to fresh frontiers.

Game Arena is exciting because games are clean, objective testing grounds that automatically scale with model capability

Key takeaways: what deep learning builders and AI enthusiasts should note

World models like Genie 3 represent a leap beyond language AI: modeling physical and temporal consistency is crucial for next-level AI applications including robotics and virtual assistants.
Thinking models that internally plan and refine are essential: raw output generation won’t suffice for truly robust AI capable of complex reasoning and problem solving.
Evaluation through dynamic, game-based benchmarks is the way forward: new challenges like the Game Arena will better test diverse AI capabilities as we approach AGI.
Tool use is a powerful new dimension in AI scaling: the ability for models to use external tools like physics simulators or math programs during thinking drastically extends their competence.
AI capabilities are still uneven: shining in complex tasks yet faltering on simple logical ones, highlighting the path ahead in improving consistency and reasoning.
Building AI-powered products today requires anticipating rapid tech improvements: products should be designed to seamlessly plug in newer models updated every few months.

Reflecting on these insights, it’s clear we’re witnessing an extraordinary evolution in AI. The convergence of complex world modeling, advanced reasoning, and dynamic evaluation marks a new phase in creating systems that can truly understand and interact with the world like never before. As DeepMind’s journey shows, it’s not just about bigger models, but smarter, more grounded ones that bring us closer to AGI.

We’re starting to see convergence of models into what we call an omni model, which can do everything.

For those of us fascinated by AI’s future, keeping an eye on developments like Genie 3, thinking models, and innovative benchmarks like Game Arena is a must. They reveal not only how powerful AI is becoming but also where the toughest challenges lie – and that makes for one exciting adventure ahead.

Gmail enters the Gemini era: AI Overviews, smarter replies, and a cleaner inbox

ChatGPT Health turns OpenAI's chatbot into a personal health assistant

Nvidia fast-tracks Vera Rubin chips, promising a 5x jump in AI performance

9 Bold AI Predictions From Nvidia's Jensen Huang: How AI Will Reshape Wealth, Jobs, and Industry

NVIDIA RTX PRO 5000 72GB Blackwell: Supercharging agentic AI on your desktop

Archives

Categories

How AI is reshaping insurance, jobs, and regulation in a changing world

Demis Hassabis on world models, Genie 3 and the road to AGI

From Gemini 2.5’s deep thinking to Genie 3’s reality-shaped AI, discover how Google DeepMind is pushing boundaries toward artificial general intelligence.

Genie 3 and building a world model that truly understands physics

From AlphaZero to thinking models: why reasoning matters so much

Game Arena: Why challenging AI with games matters more than ever

Key takeaways: what deep learning builders and AI enthusiasts should note

Leave a Reply Cancel reply

Trending

Your may also like!

Nvidia reaches $5 trillion valuation as AI demand explodes. Can rivals keep up?

Ukraine uses AI to speed up landmine removal

EU's groundbreaking AI Act: A new era for artificial intelligence regulation

The 10 stages of Artificial Intelligence

Quick Links

Socials

Archives

Categories

Genie 3 and building a world model that truly understands physics

From AlphaZero to thinking models: why reasoning matters so much

More Read

Game Arena: Why challenging AI with games matters more than ever

Key takeaways: what deep learning builders and AI enthusiasts should note

Sign Up for the Daily AI Pulse

One email a day. All the stories that matter.

Leave a Reply Cancel reply

Trending

Your may also like!

Socials