#News

Google Says Gemini 1.5 Pro Can Make AI Robots Smarter

Date: December 07, 2024

Google’s Deepmind is training its robots with Gemini 1.5 Pro, and the results have impressed the researchers beyond any existing AI robot technology.

Google recently released demo videos showcasing the smart capabilities of its AI robots. Google’s dedicated AI research wing, Deepmind, is using Gemini 1.5 Pro to enhance the on-ground smartness of its RT-2 AI robots. The tech giant has been testing the robots for complex task handling against simple text prompt instructions. The Deepminds robotic team has released a research paper on the findings and multiple developments that can revolutionize the progress of AI robots.

The team filmed a video tour of a designated area and shared it with the robot to help it self-learn about its surroundings. A key element that helped the RT-2 AI robot was its longer processing capability using natural language instructions. The robot had a 90% success rate across 50 interactions in a 9000+ square foot operating area.

It was able to perform complex tasks with self-added layers of smartness. For instance, a team member asked the AI robot if it could take him somewhere he could draw. Based on the video tour the robot learned on, it not only identified but also guided the person to the exact spot. The ability of the robot to understand extremely simple and conversational instructions is a breakthrough for on-ground implementation of AI technology.

Researchers also found preliminary evidence that Gemini 1.5 Pro can fulfill conversational instructions that go beyond just navigation. One employee who had multiple empty Coke cans on his desk asked the robot to find out if his favorite drink was available. The AI robot identified the drink through real-time observational learning and went to the refrigerator to check the inventory before answering the question. This added layer of self-awareness to the nuances of instructions is something the Deepmind team is investigating further on priority.

However, though impressive, the robots are still quite slow in processing the information. While the demo videos left out certain details, the research paper revealed that the RT-2 robots took a buffer time of 10-30 seconds before answering or taking action to the instructions. AI apps are achieving advanced milestones, like cracking the UPSC exam in just 7 minutes. With this breakthrough at Google, the possibility of bringing helper AI robots for household and home support will get on the fast lane.

By Arpit Dubey

Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. With a knack for crafting compelling narratives, Arpit has a sharp specialization in everything: from Predictive Analytics to Game Development, along with artificial intelligence (AI), Cloud Computing, IoT, and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician's mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.

// Recommended

Pinterest Follows Amazon in Layoffs Trend, Shares Fall by 9%

AI-driven restructuring fuels Pinterest layoffs, mirroring Amazon’s strategy, as investors react sharply and question short-term growth and advertising momentum.

Clawdbot Rebrands to "Moltbot" After Anthropic Trademark Pressure: The Viral AI Agent That’s Selling Mac Minis

Clawdbot is now Moltbot. The open-source AI agent was renamed after Anthropic cited trademark concerns regarding its similarity to their Claude models.

Amazon Bungles 'Project Dawn' Layoff Launch With Premature Internal Email Leak

"Project Dawn" leaks trigger widespread panic as an accidental email leaves thousands of Amazon employees bracing for a corporate cull.

OpenAI Launches Prism, an AI-Native Workspace to Shake Up Scientific Research

Prism transforms the scientific workflow by automating LaTeX, citing literature, and turning raw research into publication-ready papers with GPT-5.2 precision.

Have newsworthy information in tech we can share with our community?