Google’s generative AI video model is available in private preview

Date:

Share:


Google has begun rolling out private access to its Veo and Imagen 3 generative AI models. Starting today, customers of the company’s Vertex AI Google Cloud package can begin using Veo to generate videos from text prompts and images. Then, as of next week, Google will make Imagen 3, its latest text-to-image framework, available to those same users.

With Veo’s rollout, Google says it’s the first hyperscale cloud provider to offer an image-to-video model. To that point, OpenAI’s Sora model is still only available to select artists, academics and researchers — though that could change quickly with the company teasing 12 days of product demos starting December 5.

Example footage of Google's Veo video model.

Of Veo, Google says the model creates 1080p footage “that’s consistent and coherent” and can run “beyond a minute.” The tool is also capable of working with both text prompts and images. In the latter case, it’s possible to use either AI-generated or human-made pictures as the starting point for a video.

Looking at the sample footage Google shared, it’s evident Veo, like all AI models, can struggle with cause and effect. For example, in the clip of the roasting marshmallows, the treats don’t yellow and char as they’re exposed to the heat of a campfire flame. Artifacting is also an issue, as is apparent if you look closely at the hands in the concert footage.

Example outputs from Google's Imagen toolExample outputs from Google's Imagen tool

Google

As for Imagen 3, Google says the model generates “the most realistic and highest quality images from simple text prompts, surpassing previous versions of Imagen in detail, lighting, and artifact reduction.” Here again, however, you don’t have to look too closely to see Google has more work to do.

In the first example of a group of friends sitting on the trunk of a car, the original prompt includes mention of “flash photography,” but the subjects are clearly backlit. One could argue that a flash was used to create intense backlighting, but if the idea behind the prompt was to create something representative of flash photography from the 1960s, this image isn’t it.

Still, Google is keen to get more of its enterprise customers using generative AI. Citing its own research, the tech giant says among companies using generative AI in production, 86 percent report an increase in revenue. However, a recent Appen survey found return on investment from AI projects fell by 4.6 percentage points from 2023 to 2024.

If you buy something through a link in this article, we may earn commission.



Source link

━ more like this

Want a cordless vacuum for under $100? This one’s just $65!

If you’re searching for the ultimate bargain from the available cordless vacuum deals, look no further than Walmart’s offer for the PrettyCare W200....

Save $100 on the perfect starter gaming PC at Best Buy

Looking for great gaming PC deals while keeping costs down? Right now, you can buy a CyberPowerPC Gamer Master Gaming Desktop for $600...

Experiment showcases 3D dental scanner capable of running Counter Strike: Source

One would assume that medical equipment is not as capable as a modern PC. However, in a surprising and creative tech experiment, Redditor...

Peloton is introducing a new audio-focused strength training app

Peloton is continuing to expand into products other than stationary bikes and treadmills with a new strength training app called Peloton Strength+. The...

High-Tech, High-End: Must-Have Luxury Tech Gadgets to Gift This Holiday

Table of Contents Table of Contents Terra Kaffe Super Automatic Espresso Machine – For the Sustainable Coffee Snob Oura Ring 4 – For the One with...
spot_img