OpenAI Unveils GPT-5.4 with Native Computer Control, Outperforming Humans on OS Tasks

OpenAI has released GPT-5.4, its newest flagship model featuring native computer use capabilities that allow the AI to autonomously operate software, navigate digital environments, and execute complex workflows. This release positions OpenAI at the forefront of the emerging agentic AI movement and marks a significant leap beyond traditional chatbot interfaces.

A New Era of Agentic AI

GPT-5.4 represents OpenAI’s first general-purpose model equipped with the ability to control computers directly. Unlike previous versions that merely provided instructions, this model can now act on them—interpreting screenshots, issuing mouse clicks and keyboard inputs to interact with applications, and managing multi-step processes across different programs.

” This is the core innovation driving GPT-5.4,” OpenAI stated in its announcement. The model can write code for execution, not just generate snippets, and create scripts using frameworks like Playwright to control computer functions autonomously.

Benchmarking Success

OpenAI is showcasing impressive performance metrics to demonstrate GPT-5.4’s capabilities:

OSWorld-Verified: The model achieved a 75.0% success rate, surpassing human performance at 72.4%
WebArena-Verified: Leading positions in computer-use performance tests
APEX-Agents: Top ranking for professional services work

The model also shows significant improvements in accuracy, with individual responses 33% less likely to contain errors compared to GPT-5.2, and an 18% overall reduction in mistakes and hallucinations.

Professional Workflow Revolution

For enterprise users, GPT-5.4 enables AI assistants to manage entire inbox workflows, extract action items from emails, update project management tools, and schedule follow-ups autonomously. Financial analysts could task the AI with gathering data, building comparison models, and generating investor reports without constant human oversight.

This moves AI from a “suggestion engine” to a true “workflow orchestrator”—a vision OpenAI is expanding with its “ChatGPT Agent” concept, where networks of AI agents coordinate across different applications.

Strategic Timing

The release comes at a critical juncture for OpenAI following public scrutiny over a Department of Defense collaboration that reportedly led to losing 1.5 million ChatGPT users. GPT-5.4 is widely seen as a strategic effort to course-correct and win back public trust, particularly by emphasizing unprecedented capabilities and safety measures.

The competitive landscape is intensifying. Anthropic has introduced similar computer control features with Claude Opus 4.5, Microsoft is integrating AI agents into Windows 11, and Google is testing capabilities with Gemini models. This “agentic AI arms race” signals that major AI players believe autonomous systems are the future of artificial intelligence.

Availability

GPT-5.4 is now available across OpenAI platforms: - ChatGPT (Plus, Teams, and Pro subscribers) - Codex (AI coding environment) - OpenAI API

The “GPT-5.4 Thinking” variant, designed for advanced reasoning, offers a unique “mid-response modification” feature for subscribers.