OpenAI Unveils 'Operator' AI for Performing Web Tasks

OpenAI has launched "Operator," an AI agent designed to perform web-based tasks autonomously using its own browser. Currently available as a research preview for Pro users in the United States, the tool aims to automate everyday activities such as filling out forms, ordering groceries, and even creating memes.

Operator represents one of OpenAI's first agents, which are AI systems capable of acting independently to accomplish specific tasks. Users can delegate assignments, such as managing online bookings or restocking household items, freeing up time for other priorities.

"Operator can interact with the web just like a human, using a browser to click, type, and scroll," OpenAI said in a statement. "It broadens the utility of AI, helping people save time on repetitive tasks while opening new engagement opportunities for businesses."

Powered by OpenAI's new Computer-Using Agent (CUA) model, Operator combines GPT-4o's advanced reasoning abilities with visual recognition capabilities to interact with graphical user interfaces (GUIs). The technology allows it to navigate buttons, menus, and text fields without requiring custom APIs.

A Research-Driven Launch

OpenAI emphasized Operator's rollout would be measured and iterative, starting small to refine the technology based on user feedback. "This research preview is crucial to learn from real-world applications and improve the system," OpenAI said. Future plans include expanding access to users on Plus, Team, and Enterprise plans and integrating Operator into the ChatGPT ecosystem.

To address privacy and user control concerns, Operator is designed to transfer tasks back to users whenever sensitive information like login credentials or payment details is needed. Users can also fully customize workflows, adding personalized instructions for specific websites and saving prompts for repeated actions.

A Vision of AI as a Digital Worker

Operator's real-world impact is supported by collaborations with major companies, including DoorDash, Instacart, OpenTable, and Uber. OpenAI is also exploring public sector applications, such as streamlining access to government services through partnerships like its pilot project with the City of Stockton.

The AI has already demonstrated record-breaking performance in WebArena and WebVoyager, two benchmarks measuring browser-use capabilities, the company said. OpenAI remains focused on fine-tuning the agent, learning from early adopters, and paving the way for wider adoption.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS.  He can be reached at [email protected].

Featured

  • interconnected blocks of data

    Rubrik Intros Immutable Backup for Okta Environments

    Rubrik has announced Okta Recovery, extending its identity resilience platform to Okta with immutable backups and in-place recovery, while separately detailing its integration with Okta Identity Threat Protection for automated remediation.

  • geometric grid of colorful faculty silhouettes using laptops

    Top 3 Faculty Uses of Gen AI

    A new report from Anthropic provides insights into how higher education faculty are using generative AI, both in and out of the classroom.

  • abstract pattern of shapes, arrows and circuit lines

    Internet2 Announces a New President and CEO to Step Up in October

    Internet2, the member-driven nonprofit offering advanced network technology services and cyberinfrastructure to the research and education community has completed its search, which began this past May, for a new president and CEO to take the helm.

  • businessman juggling cubes

    Anthology Restructures, Focuses on Teaching and Learning Business

    Anthology has announced a strategic restructuring, divesting its Enterprise Operations, Lifecycle Engagement, and Student Success businesses and filing for Chapter 11 bankruptcy in an effort to right-size its finances and focus on its core teaching and learning products.