OpenAI’s new ChatGPT image generator makes faking photos easy

Date:

Share:


For most of photography’s roughly 200-year history, altering a photo convincingly required either a darkroom, some Photoshop expertise, or, at minimum, a steady hand with scissors and glue. On Tuesday, OpenAI released a tool that reduces the process to typing a sentence.

It’s not the first company to do so. While OpenAI had a conversational image-editing model in the works since GPT-4o in 2024, Google beat OpenAI to market in March with a public prototype, then refined it to a popular model called Nano Banana image model (and Nano Banana Pro). The enthusiastic response to Google’s image-editing model in the AI community got OpenAI’s attention.

OpenAI’s new GPT Image 1.5 is an AI image synthesis model that reportedly generates images up to four times faster than its predecessor and costs about 20 percent less through the API. The model rolled out to all ChatGPT users on Tuesday and represents another step toward making photorealistic image manipulation a casual process that requires no particular visual skills.

The “Galactic Queen of the Universe” added to a photo of a room with a sofa using GPT Image 1.5 in ChatGPT.

GPT Image 1.5 is notable because it’s a “native multimodal” image model, meaning image generation happens inside the same neural network that processes language prompts. (In contrast, DALL-E 3, an earlier OpenAI image generator previously built into ChatGPT, used a different technique called diffusion to generate images.)

This newer type of model, which we covered in more detail in March, treats images and text as the same kind of thing: chunks of data called “tokens” to be predicted, patterns to be completed. If you upload a photo of your dad and type “put him in a tuxedo at a wedding,” the model processes your words and the image pixels in a unified space, then outputs new pixels the same way it would output the next word in a sentence.

Using this technique, GPT Image 1.5 can more easily alter visual reality than earlier AI image models, changing someone’s pose or position, or rendering a scene from a slightly different angle, with varying degrees of success. It can also remove objects, change visual styles, adjust clothing, and refine specific areas while preserving facial likeness across successive edits. You can converse with the AI model about a photograph, refining and revising, the same way you might workshop a draft of an email in ChatGPT.



Source link

━ more like this

Capcom’s long-delayed Pragmata is now arriving a week earlier

Capcom during its March 5 Spotlight showcase that Pragmata, its dystopian sci-fi adventure game, will release on April 17 for PlayStation...

Motorola’s upcoming Razr 70 foldable could get a camera and memory boost

Motorola hasn’t said a word officially, but China’s TENAA certification database (via Gadgets360) has done the talking anyway. The Motorola Razr 70 has...

T-Mobile 5G Home Internet’s latest deal gives you up to $300 back 

If you’ve been considering a switch from traditional cable, T-Mobile 5G Home Internet’s newest promotion may be the most compelling reason yet to make the move. The...

Rad Power Bikes gets a new owner, pledge to build bikes in the US

Life EV has completed a court-approved acquisition of Rad Power Bikes, granting a second life to the troubled e-bike brand.The Florida-based Life EV...

Microsoft Copilot just made browser switching a thing of the past

If you have ever been mid-conversation with Copilot, clicked a link, and then spent the next few minutes trying to find your way...
spot_img