ChatGPT 4o’s new image generation capabilities bring impressive quality but frustrating text restrictions, creating a contradictory user experience. While OpenAI claims the system excels at text-forward images like business cards and instructional posters, its content policies paradoxically prevent users from creating images with readable text on physical objects—a limitation that undermines one of its most promising technical achievements.
The big picture: OpenAI has integrated native image generation directly into ChatGPT 4o, replacing its previous reliance on DALL-E and delivering higher quality results with significant policy limitations.
Key capabilities: The new image generation system produces exceptionally detailed images, though they take longer to generate than with previous iterations.
Ethical guardrails: ChatGPT appropriately refuses to remove watermarks from uploaded images, showing improvement over Google’s Gemini in respecting copyright protections.
The contradiction: Despite OpenAI marketing the system as excelling at “text-forward” images like instructions posters and business cards, ChatGPT 4o refuses to generate images containing readable text on objects.
Why this matters: The restriction creates a frustrating user experience where the technology’s actual capabilities are artificially limited, preventing practical applications.