About Us
Wednesday, June 10, 2026
  • tr Türkçe
  • en English
KREAblog | Creative News
No Result
View All Result
  • Home
  • World’s Firsts
  • TOP 10
  • Brand / Advertising
  • Artificial Intelligence
  • Technology
  • Design
  • Social Media
KREAblog | Creative News
  • Home
  • World’s Firsts
  • TOP 10
  • Brand / Advertising
  • Artificial Intelligence
  • Technology
  • Design
  • Social Media
No Result
View All Result
KREAblog | Creative News
No Result
View All Result
Home Artificial Intelligence

Another Artificial Intelligence Program from Google “VLOGGER”

23/03/2024
in Artificial Intelligence
A A
Another Artificial Intelligence Program from Google “VLOGGER”
1
SHARES
62
VIEWS
Share on FacebookShare on TwitterShare on Whatsapp
Recently, there has been another important step in the field of artificial intelligence: Google’s new artificial intelligence software called VLOGGER. This software offers a way to create text- and voice-driven talking human video from a single input image of a person. This innovative method builds on the success of recent generative diffusion models and was evaluated on three different criteria: image quality, identity preservation and temporal consistency.VLOGGER consists of a stochastic human-to-3D motion diffusion model and a novel diffusion-based architecture that augments text-to-image models with both temporal and spatial controls. This approach enables the creation of variable-length high-quality videos that can be easily controlled through high-level representations of human faces and bodies. Unlike previous work, VLOGGER does not require separate training for each person, does not rely on face detection and clipping, and considers a wide range of scenarios to accurately synthesize people communicating.

Google VLOGGER

The program is similar to Alibaba’s EMO application, which we have introduced to you before. To understand how VLOGGER works, the goal is to create a variable-length photorealistic video depicting a talking target person, including head and gestures. The first network takes an audio waveform as input to generate intermediate body motion controls responsible for gaze, facial expressions and pose along the target video length. The second network is a temporal image-to-image translation model that extends large image diffusion models by taking the estimated body controls. To condition the process to a specific identity, the network also takes a reference image of a person.

The diversity of the model is an important measure of success. The model provides a significant amount of motion and realism while producing a diverse distribution of videos of the original subject. This emphasizes the realistic appearance and diversity of the generated videos. Furthermore, VLOGGER’s video editing applications are also quite impressive. For example, VLOGGER can take a video and close the mouth or eyes to change the subject’s expression, making video edits consistent with the original unaltered pixels.

Another Artificial Intelligence Program from Google “VLOGGER” 3

Google and Artificial Intelligence

VLOGGER represents an important step forward in the field of human talking video production with artificial intelligence. Standing out from other state-of-the-art methods in terms of image quality, identity preservation, and temporal consistency, this model could shape future developments in this field and offer impressive application areas. You can click on the link for a more detailed review and to see the work done.

ShareTweetSend
Previous Post

Neuralink Chip Subject Manages to Play Chess with Mind Power

Next Post

WhatsApp is also Integrating Artificial Intelligence

Related News

Laptop on a desk displaying lines of code in a dark blue-lit office setting
Artificial Intelligence

Prompt Injection Attacks: AI’s Sneaky Weak Spot

07/06/2026
Large computer monitor showing colorful code on a tidy desk, with keyboard, mouse, mug, and small potted plants under warm indoor lighting.
Artificial Intelligence

AI Coding Tools Shift to Pay-Per-Use Models

31/05/2026
Sunlit government chamber with a curved wooden dais, microphones, scattered papers, and empty chairs.
Artificial Intelligence

AI Security Rules Face a Political Crossroads

24/05/2026
People hiking inside a large volcanic crater at sunset, with orange dust and a bulldozer parked on the slope.
Artificial Intelligence

AI Gold Rush Winners: Who Actually Gets Rich?

17/05/2026
Next Post
WhatsApp is also Integrating Artificial Intelligence

WhatsApp is also Integrating Artificial Intelligence

Taxis You'll Be Willing to Wait in Line

Taxis You'll Be Willing to Wait in Line

Search in KREAblog

No Result
View All Result

Recent News

AI Branding: Trust Now Has Two Audiences

AI Branding: Trust Now Has Two Audiences

10/06/2026
Sunlit city street at dawn shrouded in fog between tall skyscrapers, with pedestrians along the sidewalk and orange glow filling the scene.

AI Companies IPO: What Public Markets Really Mean

09/06/2026
Close-up of a dead mosquito perched on a circuit board among chips and capacitors.

The First Computer Bug Ever Found in Hardware

08/06/2026
Laptop on a desk displaying lines of code in a dark blue-lit office setting

Prompt Injection Attacks: AI’s Sneaky Weak Spot

07/06/2026
The Longest-Running Tech Hoaxes Ever Believed

The Longest-Running Tech Hoaxes Ever Believed

06/06/2026

Popular News

  • Batman Designed Tables

    Batman Designed Tables

    1 shares
    Share 0 Tweet 0
  • New Honda Logo in Step with the Times

    1 shares
    Share 0 Tweet 0
  • OpenAI’s New Multimodal Intelligence “GPT-4o”

    1 shares
    Share 0 Tweet 0
  • Different Hotel Concepts for Those Who Want to Get Away from Classic Hotels

    1 shares
    Share 0 Tweet 0
  • Changi Airport in the Heart of Nature

    1 shares
    Share 0 Tweet 0
KREAblog

Recent Posts

AI Branding: Trust Now Has Two Audiences

AI Companies IPO: What Public Markets Really Mean

The First Computer Bug Ever Found in Hardware

Prompt Injection Attacks: AI’s Sneaky Weak Spot

The Longest-Running Tech Hoaxes Ever Believed

KREAblog Menu

  • Home Page
  • About Us
  • Contact Us
  • Cookie Policy
  • Privacy Policy
© 2024 KREAblog – Designed by KREABAZ.
  • tr Türkçe
  • en English
No Result
View All Result
  • Home
  • World’s Firsts
  • TOP 10
  • Brand / Advertising
  • Artificial Intelligence
  • Technology
  • Design
  • Social Media

© 2024 KREAblog - Designed by KREABAZ.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.