Generative AI Publication

Generative AI Publication

Share this post

Generative AI Publication
Generative AI Publication
Google's New VLOGGER AI Is Next-Level Deepfake Video Generator

Google's New VLOGGER AI Is Next-Level Deepfake Video Generator

VLOGGER is a novel framework that can generate human-like vlogs with an audio and a photo as input.

Jim Clyde Monge's avatar
Jim Clyde Monge
Mar 24, 2024
∙ Paid
1

Share this post

Generative AI Publication
Generative AI Publication
Google's New VLOGGER AI Is Next-Level Deepfake Video Generator
1
Share

AI is moving from simple image deepfakes to videos.

No, I am not talking about swapping faces like the typical deepfake videos we’ve seen before—it’s something far more intriguing and potentially unsettling.

Today, Google released a research paper detailing a novel framework called VLOGGER that lets you generate a video of a human vlogger using only an audio clip and a single image as input. 

If you thought deepfakes were scary, this takes the use of AI technology to a whole new level.

What is VLOGGER?

VLOGGER uses a multimodal diffusion technique to synthesize humans from audio. It can generate photorealistic videos of a person talking with realistic head movements, facial expressions, gazes, and even hand gestures.

Here’s an example:

Keep reading with a 7-day free trial

Subscribe to Generative AI Publication to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Jim Clyde Monge
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share