OpenAI's GPT Image 2 Leaked: Near-Perfect Text, No More Yellow Tint

OpenAI's next-generation image model, GPT Image 2, briefly surfaced on LM Arena on April 4, 2026, under three mysterious codenames before disappearing within hours. The leaked models demonstrated significant leaps in text accuracy, color rendering, and photorealism that suggest a complete architectural overhaul from the company. With DALL-E shutting down on May 12, 2026, the leak signals that OpenAI has a replacement ready to launch .

What Makes GPT Image 2 Different From GPT Image 1.5?

The three anonymous models, codenamed maskingtape-alpha, gaffertape-alpha, and packingtape-alpha, were tested extensively by community members including developer Pieter Levels and venture investor Justine Moore before being pulled from the platform. This mirrors a similar pattern from December 2025, when two anonymous models appeared, vanished, and weeks later OpenAI shipped GPT Image 1.5. The tape models follow the exact same playbook, suggesting an imminent release .

Testers documented several capabilities that represent a meaningful jump over the current generation. The improvements span multiple dimensions of image quality and usability:

  • Text Rendering: Achieved near-perfect accuracy on text in images, with readable signs, product labels, code snippets, UI mockups, and even handwritten medical notes with convincing penmanship. Comic book panels displayed readable speech bubbles, and watch faces showed correctly positioned hands matching the time described in prompts.
  • Color Accuracy: Eliminated the warm yellow tint that plagued GPT Image 1.5, producing images with natural, neutral color rendering that photographers and designers have consistently complained about in previous versions.
  • Photorealism: Generated portraits described as "indistinguishable from real photographs," with complex beach scenes showing accurate hand anatomy and realistic sunglass reflections, a noticeable jump from occasional uncanny-valley artifacts in GPT Image 1.5.
  • World Knowledge: Demonstrated understanding of real-world details, generating IKEA storefronts with architectural accuracy, YouTube and Windows interfaces realistic enough to pass as screenshots, and geographically accurate world maps with proper labels and topographic shading.
  • Aspect Ratio Support: Confirmed support for 16:9 widescreen format, expanding beyond GPT Image 1.5's three aspect ratios (1:1, 3:2, and 2:3), making the model far more useful for presentations, thumbnails, and video content.

GPT Image 1.5 achieved roughly 90 to 95 percent accuracy on text in images, which sounds adequate until users see misspelled signs and garbled labels in outputs. The leaked models produced text that testers described as "near-perfect" and "finally usable," addressing what has been the biggest weakness of every AI image generator to date .

How Does GPT Image 2's Architecture Differ From Previous Models?

One of the most significant technical details from the leak is that GPT Image 2 appears to be built on an entirely new architecture. Previous GPT Image models were constructed on top of GPT-4o's image pipeline, but GPT Image 2 is reportedly a standalone model. Multiple sources report a shift from a two-stage inference process to single-pass inference, which would explain both the quality improvement and the expected speed gains of under three seconds per image, down from eight to twelve seconds .

New metadata tags have been detected in PNG file outputs from suspected GPT Image 2 generations, further supporting the theory of a fundamentally different system. This architectural overhaul matters because it suggests OpenAI is not simply iterating on existing technology. They are rebuilding the image generation stack from the ground up.

The performance improvements are substantial when compared directly to GPT Image 1.5. The leaked models beat Google's Nano Banana Pro in blind comparisons on LM Arena, the platform formerly known as Chatbot Arena where users compare AI models in blind tests. Testers noted that GPT Image 2 leads on text accuracy and prompt-following, while Midjourney V7 and V8 still maintain stronger artistic style control. However, on realism, text rendering, and world knowledge, the tape models demonstrated clear advantages .

Steps to Understand What's Coming Next for OpenAI's Image Generation

  • Track the DALL-E Shutdown: DALL-E is officially shutting down on May 12, 2026, which creates an immediate need for a replacement. GPT Image 2's leaked appearance suggests OpenAI is preparing to migrate users to the new model before the deadline.
  • Monitor Pricing Changes: GPT Image 1 Mini launched in October 2025 at 80 percent cheaper API pricing than the original GPT Image 1. GPT Image 1.5 followed with 20 percent cheaper costs in December 2025. Expect GPT Image 2 to introduce new pricing tiers that reflect its improved speed and quality.
  • Watch for A/B Testing in ChatGPT: Some ChatGPT users reported being randomly served a noticeably different and better image model during regular usage, suggesting active A/B testing in the product itself. This testing phase typically precedes a wider rollout by weeks.
  • Compare Against Competitors: The AI image generation space has become extremely competitive. GPT Image 2 appears to lead on text and prompt-following, but understanding how it compares to Midjourney, Google's Nano Banana Pro, and other models will help determine which tool best fits specific use cases.

The timeline of OpenAI's image generation releases shows a clear pattern of rapid iteration. DALL-E 2 launched in April 2022 as the first widely available AI image generator. DALL-E 3 arrived in October 2023 with better prompt following and ChatGPT integration. GPT Image 1 launched in March 2025 as the first autoregressive image model inside ChatGPT, generating over 700 million images in the first week alone across 130 million users. GPT Image 1 Mini followed in October 2025, and GPT Image 1.5 shipped in December 2025 with four times faster generation and better quality .

The leaked models are not flawless. A Rubik's Cube mirror reflection test stumped them, which is an industry-wide challenge that no current image model handles correctly. However, on virtually every other benchmark, the tape models outperformed everything currently available. The models also showed significantly improved handling of non-Latin scripts and likely feature persistent embeddings for better character consistency across multiple generations, addressing a known weakness in GPT Image 1.5 where character appearance would drift .

OpenAI's decision to build GPT Image 2 as a standalone model rather than layering it on top of GPT-4o suggests the company is prioritizing image generation as a core capability rather than a secondary feature. The shift to single-pass inference and the architectural overhaul indicate that OpenAI is investing significant engineering resources into competing directly with specialized image generation tools like Midjourney and Stable Diffusion. The expected maximum resolution of 2048 by 2048 pixels or higher, compared to GPT Image 1.5's 1536 by 1024 maximum, further demonstrates the company's commitment to delivering professional-grade output quality.