Date: January 24, 2025
OpenAI introduced Operator, an AI agent that will perform web-based tasks such as ordering tickets and groceries. Powered by the CUA model, it independently navigates websites to get things done.
OpenAI, the company behind the world-changing ChatGPT, has just introduced a new breakthrough AI agent, "Operator," which will independently browse the web to perform tasks for users. From flight bookings to ordering groceries, Operator promises a whole new use of the internet: independently performing tasks with minimum interference from people.
Operator is powered by OpenAI’s new Computer-Using Agent (CUA) model, which combines the visual processing abilities of GPT-4o with sophisticated reasoning techniques. This allows Operator to navigate websites, fill out forms, click buttons, and scroll through pages - just as a human would.
OpenAI product and engineering lead Yash Kumar shared “It can navigate websites and take actions on websites, much like you and I do.”
Unlike traditional AI tools that rely on APIs to interact with websites, Operator uses screenshots and a virtual browser to visually interpret elements like buttons and text fields.
While Operator represents a major leap forward, OpenAI acknowledges that it is still in its early stages and comes with some limitations. Complex tasks, such as creating slideshows or managing detailed calendar events, remain challenging for the AI. During demos, Operator reportedly achieved an 87% success rate on web-based tasks but struggled with intricate, multi-step processes.
Privacy and security are at the forefront of OpenAI’s rollout strategy. Operator employs a three-layer security system to ensure user control:
Additionally, OpenAI has introduced "watch mode,” and a monitor model to prevent misuse or phishing attacks.
OpenAI is partnering with companies like DoorDash, Instacart, Uber, and OpenTable to help Operator handle real-world tasks such as ordering food and booking reservations. The company is also working with the City of Stockton to simplify access to public services. These collaborations aim to make Operator useful for both businesses and government services while ensuring it meets user needs effectively.
Jamil Niazi, Director of Information Technology at the City of Stockton, emphasized Operator's potential saying "As we learn more about Operator, we'll identify ways that AI can make civic engagement easier for our residents."
The CEO of OpenAI, Sam Altman, said Operator is the first of many different AI agents that will be built. Further developments will increase access to more users, integrate it into ChatGPT, and provide an API to developers for building their own AI agents.
Operator is currently available exclusively to ChatGPT Pro subscribers in the U.S., with a subscription fee of $200 per month. OpenAI plans to roll it out globally in the future, pending further refinement and feedback.
Despite its potential, Operator still requires human oversight and fine-tuning. However, its debut signals a shift toward AI as an active assistant rather than a passive tool - one that might soon handle everything from your online shopping to your daily scheduling.
By Arpit Dubey
Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. With a knack for crafting compelling narratives, Arpit has a sharp specialization in everything: from Predictive Analytics to Game Development, along with artificial intelligence (AI), Cloud Computing, IoT, and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician's mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.
OpenAI Is Building an Audio-First AI Model And It Wants to Put It in Your Pocket
New real-time audio model targeted for Q1 2026 alongside consumer device ambitions.
Nvidia in Advanced Talks to Acquire Israel's AI21 Labs for Up to $3 Billion
Deal would mark chipmaker's fourth major Israeli acquisition and signal shifting dynamics in enterprise AI.
Nvidia Finalizes $5 Billion Stake in Intel after FTC approval
The deal marks a significant lifeline for Intel and signals a new era of collaboration between two of America's most powerful chipmakers.
Manus Changed How AI Agents Work. Now It's Coming to 3 Billion Meta Users
The social media giant's purchase of the Singapore-based firm marks its third-largest acquisition ever, as the race for AI dominance intensifies.