Google DeepMind’s New AI Models Aim to Make Robots Smarter and More Useful
Date: March 13, 2025
Google DeepMind unveils Gemini Robotics, an AI system designed to make robots smarter, more adaptable, and capable of real-world problem-solving.
Google DeepMind is pushing robotics into new territory with the launch of two groundbreaking AI models: Gemini Robotics and Gemini Robotics-ER. These latest models promise to make robots smarter, more versatile, and far better at understanding our world.
Built on Google’s advanced Gemini 2.0 platform, Gemini Robotics is a vision-language-action (VLA) model that integrates sight, language processing, and physical action into a single system. Unlike traditional AI confined to digital outputs like text or images, this model enables robots to interpret natural language commands and execute complex tasks in real-world environments.
Meanwhile, Gemini Robotics-ER (Embodied Reasoning) adds advanced spatial awareness, allowing robots to navigate and reason about their surroundings with greater precision.
“We’ve been able to bring the world-understanding—the general-concept understanding—of Gemini 2.0 to robotics,” said Kanishka Rao, a robotics researcher at Google DeepMind who spearheaded the project. In a press briefing, Rao highlighted the models’ ability to control various robots across hundreds of scenarios, even those not included in their training data.
Demonstrations showcased the technology’s potential. Robots equipped with Gemini Robotics performed tasks like folding origami, packing snacks into a Ziploc bag, and plugging devices into power strips—all in response to spoken instructions. One video featured an Apptronik-developed humanoid robot, Apollo, rearranging letters on a tabletop while conversing with a human operator. The system’s adaptability shone when objects slipped or environments changed, with robots quickly recalibrating to complete their tasks.
What’s the Big Deal?
The implications are vast. Google DeepMind is partnering with Apptronik, a Texas-based robotics firm, to integrate Gemini 2.0 into next-generation humanoid robots. “We’re excited to explore how these models can push the boundaries of what robots can achieve,” Rao added. The company is also collaborating with select testers (Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools) to refine Gemini Robotics-ER, signaling a cautious but ambitious rollout.
Carolina Parada, head of robotics at Google DeepMind, emphasized the models’ advancements in three key areas: generality, interactivity, and dexterity. “This enables us to build robots that are more capable, more responsive, and more robust to changes in their environment,” she said during the briefing. Parada noted that unlike humans, these robots don’t learn on the fly—yet—but the foundation is being laid for future breakthroughs.
Balancing Innovation with Safety
Safety remains a priority amid such rapid progress. Google DeepMind introduced ASIMOV, a new benchmark named after sci-fi author Isaac Asimov, to assess risks in AI-powered robotics. The tool evaluates whether a robot’s actions could lead to unintended consequences, such as grasping an object dangerously close to a human. “We’re building this technology with safety top of mind,” Parada said, acknowledging that commercialization is still years away.
By Arpit Dubey
Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. With a knack for crafting compelling narratives, Arpit has a sharp specialization in everything: from Predictive Analytics to Game Development, along with artificial intelligence (AI), Cloud Computing, IoT, and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician's mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.
// Recommended
Pinterest Follows Amazon in Layoffs Trend, Shares Fall by 9%
AI-driven restructuring fuels Pinterest layoffs, mirroring Amazon’s strategy, as investors react sharply and question short-term growth and advertising momentum.
Clawdbot Rebrands to "Moltbot" After Anthropic Trademark Pressure: The Viral AI Agent That’s Selling Mac Minis
Clawdbot is now Moltbot. The open-source AI agent was renamed after Anthropic cited trademark concerns regarding its similarity to their Claude models.
Amazon Bungles 'Project Dawn' Layoff Launch With Premature Internal Email Leak
"Project Dawn" leaks trigger widespread panic as an accidental email leaves thousands of Amazon employees bracing for a corporate cull.
OpenAI Launches Prism, an AI-Native Workspace to Shake Up Scientific Research
Prism transforms the scientific workflow by automating LaTeX, citing literature, and turning raw research into publication-ready papers with GPT-5.2 precision.
Have newsworthy information in tech we can share with our community?
