Google just launched Gemini Omni, their new AI world model that can “create anything from any input,” starting with video 😳
Think Nano Banana, but for motion.
→ Text, image, video, and audio go in.
→ Short video, synced audio, effects, and conversational edits come out.
OpenAI had to shut down Sora because AI video was too expensive to run at scale.
Google can run this forever.
Not because video generation is cheap. Because Google owns the stack around it:
↳ Gemini for chat-based creation
↳ Flow for AI filmmaking
↳ YouTube for distribution
↳ TPUs for compute
↳ DeepMind for model research
↳ Search + ads for monetization
↳ Android + Chrome for reach
Most importantly, Gemini Omni is also teaching Google’s AI to simulate scenes, physics, camera movement, audio, characters, and edits through conversation.
And that’s what “AI world model” actually means in consumer form.
The ironic part?
Google may end up building the metaverse as a side project.
Not as a place you enter, something that Zuck imagined.
Google’s metaverse might be a feed you generate.
Sundar Pichai is playing 3D chess.
P.S. check out How to Turn ChatGPT Images 2.0 & Claude Design Into Your Chief Designer : https://lnkd.in/dcExymFM
Think Nano Banana, but for motion.
→ Text, image, video, and audio go in.
→ Short video, synced audio, effects, and conversational edits come out.
OpenAI had to shut down Sora because AI video was too expensive to run at scale.
Google can run this forever.
Not because video generation is cheap. Because Google owns the stack around it:
↳ Gemini for chat-based creation
↳ Flow for AI filmmaking
↳ YouTube for distribution
↳ TPUs for compute
↳ DeepMind for model research
↳ Search + ads for monetization
↳ Android + Chrome for reach
Most importantly, Gemini Omni is also teaching Google’s AI to simulate scenes, physics, camera movement, audio, characters, and edits through conversation.
And that’s what “AI world model” actually means in consumer form.
The ironic part?
Google may end up building the metaverse as a side project.
Not as a place you enter, something that Zuck imagined.
Google’s metaverse might be a feed you generate.
Sundar Pichai is playing 3D chess.
P.S. check out How to Turn ChatGPT Images 2.0 & Claude Design Into Your Chief Designer : https://lnkd.in/dcExymFM