Date: December 11, 2025

OpenAI has officially released GPT-5.2, a groundbreaking new model series designed specifically to dominate professional knowledge work and complex, long-running agentic tasks. Rolling out today for ChatGPT Plus, Pro, and Enterprise users, as well as developers via the API, GPT-5.2 represents a massive leap forward in general intelligence, coding, and reasoning.

The headline feature of this release is the new GPT-5.2 Thinking model, which OpenAI claims is the first AI to perform at or above a human expert level on well-specified professional tasks.

Breaking the “Expert” Barrier: GDPval Benchmarks

The most significant metric from this release is the model’s performance on GDPval, a benchmark evaluating tasks across 44 distinct occupations (including sales, accounting, and manufacturing).

GPT-5.2 Thinking achieved a 70.9% win rate against industry professionals, a staggering increase from the 38.8% score held by the original GPT-5. According to expert judges, the model produced deliverables—such as complex spreadsheets and strategic presentations—at >11x the speed and less than 1% of the cost of human experts.

Three New Tiers: Instant, Thinking, and Pro

The GPT-5.2 release is categorized into three distinct versions to suit different workflows:

  1. GPT-5.2 Instant: A fast, low-latency “workhorse” model optimized for quick info-seeking, technical writing, and translation.
  2. GPT-5.2 Thinking: The flagship model for deep reasoning. It excels at multi-step projects, reducing hallucinations by 30% compared to GPT-5.1.
  3. GPT-5.2 Pro: The powerhouse for scientific and mathematical research, achieving 93.2% on the graduate-level GPQA Diamond benchmark.

State-of-the-Art Coding and Agentic Capabilities

For developers, GPT-5.2 creates a new paradigm in “agentic” coding—where the AI acts as an autonomous engineer rather than just a chatbot.

  • SWE-Bench Pro: GPT-5.2 Thinking scored 55.6% on this rigorous software engineering benchmark, which tests capabilities across four languages and real-world repositories.
  • SWE-bench Verified: The model reached a new high of 80.0%.
  • Front-End Engineering: Early testers report that GPT-5.2 is significantly better at complex UI work, capable of generating interactive 3D elements (like ocean simulations) from a single prompt.

Unmatched Vision and Long-Context Mastery

Professional workflows often involve analyzing massive documents or interpreting visual data. GPT-5.2 sets a new standard here as well:

  • Long Context: The model achieved near 100% accuracy on the OpenAI MRCRv2 “needle-in-a-haystack” evaluation, even when processing up to 256k tokens.
  • Vision: Error rates on chart reasoning and software interface understanding have been cut in half. The model can now accurately identify components on a motherboard or interpret dashboard screenshots with high spatial awareness.

Availability

GPT-5.2 Instant, Thinking, and Pro are rolling out starting today, December 11, 2025.

  • ChatGPT: Available immediately for users on paid plans (Plus, Team, Enterprise).
  • API: Developers can access the models now for building next-generation applications.

With improvements that span from abstract reasoning (ARC-AGI-2) to practical customer support (Tau2-bench), GPT-5.2 is positioning itself not just as a tool, but as a reliable, expert-level partner for the modern workforce.