ChatGPT Images 2.0 arrives with visual reasoning and improved text generation

Beatriz

em April 28, 2026

On Tuesday, OpenAI released ChatGPT Images 2.0, an update to the image generation model integrated into the chatbot. The system, called gpt-image-2, brings improvements in text rendering, support for multiple languages and the ability to create more complex compositions. Usuários of all ChatGPT plans now access the standard version.

The launch took place on April 21, 2026. The company highlighted the inclusion of a “think” mode that allows the model to search the web, check details and generate up to eight images from a single prompt. Isso facilitates the production of storyboards, infographics and materials with a structured layout. Resolution reaches 2K in some cases.

OpenAI highlights reasoning and fidelity in new features

The model now follows instructions more accurately and preserves requested elements such as icons, interfaces, and small text. OpenAI cited examples of scientific posters, recipe cards and presentation slides as areas of gain. Support for Japanese, Korean, Hindi, and Bengali texts has also improved.

Desenvolvedores gain access via API, with prices based on quality and resolution. Paid Usuários have more capacity in advanced mode. The update comes months after previous improvements to the imaging system.

Geração of multiple interrelated images
Modo thinking with web search
Melhor text rendering in multiple languages
Suporte in varying proportions, from 3:1 to 1:3
Resolução up to 2K on selected outputs

https://twitter.com/OpenAI/status/2046670978890276918?ref_src=twsrc%5Etfw

Crítico points out flaws in functional understanding

Gary Marcus, a researcher known for questioning current AI capabilities, tested the new system with bicycle diagrams. In an automatic label, the model confused rear brake with seat tube and gear with brake. A label pointed to empty space.

In a more difficult test, Marcus asked for a taller than average tandem bike, with a luggage rack and saddlebags. The image generated presented problems such as a rear derailleur inserted into the wheel, a poorly positioned brake lever and a saddle-shaped rear handlebar. Marcus observed that the system matches visual patterns without understanding the actual function of the parts.

Especialistas compare with human limitations

Marcus recognized that the average human would also have difficulty drawing the tandem accurately. However, mechanics, experienced cyclists or designers would identify errors quickly. The example serves to discuss the extent to which the model understands the physical world.

The debate takes place as the sector celebrates leaps in professional imaging. Independent Testes confirmed gains in readable text and dense layouts, but specific cases still expose gaps in causal reasoning.

Ficha ChatGPT Technique Images 2.0

Modelo base: gpt-image-2
Disponibilidade: all ChatGPT users
Modo Advanced: Paid Subscribers
Recursos main: reasoning, multiple outputs, multilingual text
Resolução: up to 2K
API: released with variable pricing

OpenAI has not publicly commented on the Marcus testing at this time. The system continues to evolve, with regular updates based on user feedback.

ChatGPT Images 2.0 bietet visuelle Argumentation und verbesserte Textgenerierung »

« ChatGPT Images 2.0 chega com raciocínio visual e geração de texto aprimorada

Tags: ChatGPT Images 2.0gpt-image-2 templateOpenAI image generationvisual reasoning AI