Home OpenAI launches Operator AI agent to perform web-based tasks for you

OpenAI launches Operator AI agent to perform web-based tasks for you

OpenAI has released a research preview of its new Operator AI agent, which is expected to “go to the web to perform tasks for you.” According to a blog post, the tech firm says the agent will be able to interact with its own browser by “typing, clicking, and scrolling.”

The feature is available to ChatGPT Pro tier users in the US, which costs $200 per month. That said, as it is still in research mode, OpenAI acknowledges that there will be limitations and that it will continue to evolve based on user feedback. The company says it’s planning to bring the tool to more users in its Plus, Team, and Enterprise tiers down the line.

During a livestream, CEO Sam Altman stated that “[Operator] will be [in] other countries soon,” but admitted that it would “take a while” for it to be rolled out in Europe.

The initial research preview is available at operator.chatgpt.com, but OpenAI says it’s planning to integrate Operator into all of its ChatGPT apps soon.

OpenAI says it is collaborating with companies like DoorDash, Instacart, OpenTable, Priceline, StubHub, Thumbtack, Uber, and others to ensure the system “addresses real-world needs while respecting established norms.”

How does OpenAI’s Operator work?

OpenAI explains that Operator runs on a new model called the Computer-Using Agent (CUA), which combines GPT-4o’s vision capabilities with advanced reasoning powered by reinforcement learning. It’s trained to interact with graphical user interfaces (GUIs)—the buttons, menus, and text fields you see on a screen.

With Operator, the model can “see” by analyzing screenshots and “interact” using mouse and keyboard actions. This helps it to navigate the web and take actions without relying on custom API integrations.

If it hits a snag or makes a mistake, it uses its reasoning skills to self-correct. And when it really gets stuck, it hands control back to you.

Featured image: OpenAI

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech, gambling and blockchain industries for major developments, new product and brand launches, AI breakthroughs, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Suswati Basu
Tech journalist

Suswati Basu is a multilingual, award-winning editor and the founder of the intersectional literature channel, How To Be Books. She was shortlisted for the Guardian Mary Stott Prize and longlisted for the Guardian International Development Journalism Award. With 18 years of experience in the media industry, Suswati has held significant roles such as head of audience and deputy editor for NationalWorld news, digital editor for Channel 4 News and ITV News. She has also contributed to the Guardian and received training at the BBC As an audience, trends, and SEO specialist, she has participated in panel events alongside Google. Her…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.