Now you can create 8-second high-definition cinematic videos with just a single line of text in Google Gemini. Introducing Veo 2, opening a new door to the era of AI video.
Just imagine. You want to describe to someone the “flying car crossing a futuristic city filled with neon signs” that you saw in your dream last night. In the past, you would have had to spend months learning complex graphic tools or pay a high price to hire a professional. But now, all you have to do is enter a single sentence into the Google Gemini chat box. “Create a cinematic video of a flying car driving through a futuristic city with flashing neon signs.” In just a few seconds, the imagination in your head comes to life as a vivid, moving video right before your eyes.
Google recently announced that it has integrated Veo 2, its next-generation video generation model, into its paid subscription service, ‘Gemini Advanced,’ and its experimental creative tool, ‘Whisk.’ [Source 1] [Source 5] We now live in an era where we can instantly create professional-level short videos using just text or images, without any complex filming equipment.
Why is this important? The ‘Barriers’ to video production are disappearing
By now, writing or drawing pictures while talking to AI has become a fairly familiar sight. However, ‘video’ was a problem on a different level. Video requires thousands of still images to be replaced dozens of times per second to create movement. This means that AI must not only draw pictures but also perfectly calculate the flow of time and the movement of objects.
The emergence of Veo 2 is more than just adding a ‘new feature’; it signifies the democratization of video production. Now, even ordinary people without any video editing skills can immediately visualize their ideas. [Source 2] Expert Dave Constine emphasized that this tool is “not a technology of the distant future, but a realistic tool that can be used for work right now” for social media storytellers and brand operators. [Source 2]
To use an analogy, if in the past it took a huge studio and numerous staff members to shoot a movie, now the smartphone in your hand can play all those roles.
Easy to understand: How does Veo 2 make videos?
If we were to compare Veo 2, the video generation AI, to someone around us, we could call it a ‘genius animator who has studied all the videos in the world.’
For example, let’s say you ordered a “video of a puppy happily running around on the beach at sunset.” Veo 2 doesn’t just stick several similar photos together. This AI has already learned through vast amounts of data ‘at what angle the sunset light is scattered,’ ‘how leg muscles contract when a puppy runs,’ and ‘at what rhythm the waves wash in.’ [Source 11]
It’s just like when a top-tier chef receives an order for “spicy pasta” and immediately thinks of the harmony of ingredients and the cooking process in their head to complete the dish. Veo 2 also looks at your text (recipe) and sophisticatedly combines physical laws and visual styles to produce a result that comes alive during its 8-second duration.
A particularly interesting feature is ‘Whisk Animate.’ [Source 10] This is a technology that breathes life into still photos. If you put a beautiful landscape photo you took on a trip into Whisk, the AI will make the trees in the photo sway or the clouds flow, turning it into a lively video. It provides the experience of having photos filled with memories magically transformed into videos. [Source 15] [Source 16]
Current Situation: Features we can enjoy right now
Here is a summary of the main features of Veo 2 currently available in Google Gemini.
- 8 Seconds of Magic: The length of the video generated at one time is 8 seconds. [Source 1] [Source 3] It’s a short time, about the duration of one deep breath in and out, but it’s enough time to leave a strong impression in short-form content like Instagram Reels or TikTok.
- Clean High-Definition: Provided as MP4 files in 720p resolution (HD quality). [Source 3] The aspect ratio is generated in 16:9 widescreen, which is commonly seen on YouTube or TVs, making it easy to use anywhere. [Source 6]
- Directing Like a Pro: Beyond simply asking ‘what’ to draw, you can directly specify camera movements (zoom in, zoom out, etc.) or cinematic colors. [Source 11] You can feel like a director giving detailed instructions to a cameraman.
- Responsible Creation: To prevent AI-generated videos from being misused for fake news, Google has applied SynthID, an invisible digital watermarking technology. [Source 11] Although invisible to the eye, it technically identifies the video as AI-generated, increasing transparency.
Usage is very simple. If you are a Gemini Advanced subscriber, just select ‘Veo 2’ from the model selection menu. [Source 1] It is currently being rolled out to users worldwide sequentially, so check it out right now! [Source 14]
Future Outlook: Until 8 seconds becomes a movie
Right now it’s just a short 8-second video clip, but considering the pace of technological development, it will soon be possible to generate an entire scene of a movie we want to see, or create personalized advertisements tailored perfectly to individuals in real-time. Through this Veo 2 integration, Google has declared its entry into a true multimodal era (technology that understands and processes various forms of information simultaneously) that handles not only text, photos, and sounds, but also ‘video’ freely. [Source 11]
Of course, there are still things to supplement. There is a limit to the number of videos that can be made in a month, and very complex laws of physics (e.g., pouring water) can sometimes be awkward. [Source 6] However, Google continues to improve convenience, such as providing notifications before users reach their generation limit.
AI’s Perspective (A word from MindTickleBytes AI Reporter)
The development of video-generating AI will fundamentally change the way we record and express the world. Until now, it was an era of ‘filming,’ capturing the world through a camera lens; now, we are moving into an era of ‘composing,’ unfolding the imagination in our heads into writing. Technology is important, but ultimately, I am more excited to see how far the creativity of us humans, who have come to hold this powerful tool, will reach. What special moment would you like to create with 8 seconds of magic today?
References
- Try generating video in Gemini, powered by Veo 2
- Generate Videos in Gemini and Whisk with Veo 2
- Google Launches Video Generation Veo 2 in Gemini
- You can now generate AI videos in Google Gemini and Whisk
- Generate videos in Gemini and Whisk with Veo 2 - The Story Thailand
- Google News - Gemini Overview
- Gemini video generation rolls out with Veo 2 and Whisk
- Gemini gets Veo 2 and Whisk Animate for AI video creation
- Google Integrates Veo 2 Video Generator into Gemini Advanced Platform
- Google Gemini launches video generator: How to make AI clips using Veo 2
- Google’s Veo 2 video generating model comes to Gemini
- Google Rolls Out AI-Powered Video Generation for Gemini
- Google Gemini Advanced Now Lets You Generate 8-Second Video Clips
- How to create cinematic AI videos in Gemini with Veo 2 and Whisk
- Google rolls out its AI video generator to Gemini Advanced
FACT-CHECK SUMMARY
- Claims checked: 20
- Claims verified: 19
- Verdict: PASS
- Gemini Video
- Veo 2
- Whisk Animate
- 5 seconds
- 8 seconds
- 15 seconds
- AI-Sign
- DigitalStamp
- SynthID