About Us
Thursday, June 19, 2025
  • tr Türkçe
  • en English
KREAblog | Creative News
No Result
View All Result
  • Home
  • Brand / Advertising
  • Artificial Intelligence
  • Technology
  • Design
  • Social Media
  • TOP 10
KREAblog | Creative News
  • Home
  • Brand / Advertising
  • Artificial Intelligence
  • Technology
  • Design
  • Social Media
  • TOP 10
No Result
View All Result
KREAblog | Creative News
No Result
View All Result
Home Artificial Intelligence

Another Artificial Intelligence Program from Google “VLOGGER”

23/03/2024
in Artificial Intelligence
A A
Another Artificial Intelligence Program from Google “VLOGGER”
1
SHARES
43
VIEWS
Share on FacebookShare on TwitterShare on Whatsapp
Recently, there has been another important step in the field of artificial intelligence: Google’s new artificial intelligence software called VLOGGER. This software offers a way to create text- and voice-driven talking human video from a single input image of a person. This innovative method builds on the success of recent generative diffusion models and was evaluated on three different criteria: image quality, identity preservation and temporal consistency.VLOGGER consists of a stochastic human-to-3D motion diffusion model and a novel diffusion-based architecture that augments text-to-image models with both temporal and spatial controls. This approach enables the creation of variable-length high-quality videos that can be easily controlled through high-level representations of human faces and bodies. Unlike previous work, VLOGGER does not require separate training for each person, does not rely on face detection and clipping, and considers a wide range of scenarios to accurately synthesize people communicating.

Google VLOGGER

The program is similar to Alibaba’s EMO application, which we have introduced to you before. To understand how VLOGGER works, the goal is to create a variable-length photorealistic video depicting a talking target person, including head and gestures. The first network takes an audio waveform as input to generate intermediate body motion controls responsible for gaze, facial expressions and pose along the target video length. The second network is a temporal image-to-image translation model that extends large image diffusion models by taking the estimated body controls. To condition the process to a specific identity, the network also takes a reference image of a person.

The diversity of the model is an important measure of success. The model provides a significant amount of motion and realism while producing a diverse distribution of videos of the original subject. This emphasizes the realistic appearance and diversity of the generated videos. Furthermore, VLOGGER’s video editing applications are also quite impressive. For example, VLOGGER can take a video and close the mouth or eyes to change the subject’s expression, making video edits consistent with the original unaltered pixels.

Another Artificial Intelligence Program from Google “VLOGGER” 3

Google and Artificial Intelligence

VLOGGER represents an important step forward in the field of human talking video production with artificial intelligence. Standing out from other state-of-the-art methods in terms of image quality, identity preservation, and temporal consistency, this model could shape future developments in this field and offer impressive application areas. You can click on the link for a more detailed review and to see the work done.

ShareTweetSend
Previous Post

Neuralink Chip Subject Manages to Play Chess with Mind Power

Next Post

WhatsApp is also Integrating Artificial Intelligence

Related News

Introducing The New GPT 4.5 Features 1
Artificial Intelligence

Introducing The New GPT 4.5 Features

04/03/2025
AI in the Technical Director's Chair! 1
Artificial Intelligence

AI in the Technical Director’s Chair!

03/02/2025
China's DeepSeek is the New Leader of AI 1
Artificial Intelligence

China’s DeepSeek is the New Leader of Artificial Intelligence

29/01/2025
First AI Movie with Google Veo 2 (1)
Artificial Intelligence

First AI Movie with Google Veo 2

24/12/2024
Next Post
WhatsApp is also Integrating Artificial Intelligence

WhatsApp is also Integrating Artificial Intelligence

Taxis You'll Be Willing to Wait in Line

Taxis You'll Be Willing to Wait in Line

Search in KREAblog

No Result
View All Result

Recent News

First Synthetic Biological Intelligence CL1

First Synthetic Biological Intelligence CL1

08/06/2025
What Does Google Veo 3 Offer

What Does Google Veo 3 Offer?

26/05/2025
Google Changes Its Logo After 10 Years

Google Changes Its Logo After 10 Years

14/05/2025
The First Electric Car in History

The First Electric Car in History

05/05/2025
Riding Robots Era Begins with Kawasaki CORLEO 1

Riding Robots Era Begins with Kawasaki CORLEO

07/04/2025

Popular News

  • Batman Designed Tables

    Batman Designed Tables

    1 shares
    Share 0 Tweet 0
  • New Honda Logo in Step with the Times

    1 shares
    Share 0 Tweet 0
  • Different Hotel Concepts for Those Who Want to Get Away from Classic Hotels

    1 shares
    Share 0 Tweet 0
  • Changi Airport in the Heart of Nature

    1 shares
    Share 0 Tweet 0
  • OpenAI Launches GPT Store

    1 shares
    Share 0 Tweet 0
KREAblog

Recent Posts

First Synthetic Biological Intelligence CL1

What Does Google Veo 3 Offer?

Google Changes Its Logo After 10 Years

The First Electric Car in History

Riding Robots Era Begins with Kawasaki CORLEO

KREAblog Menu

  • Home Page
  • About Us
  • Contact Us
  • Cookie Policy
  • Privacy Policy
© 2024 KREAblog – Designed by KREABAZ.
  • tr Türkçe
  • en English
No Result
View All Result
  • Home
  • Brand / Advertising
  • Artificial Intelligence
  • Technology
  • Design
  • Social Media
  • TOP 10

© 2024 KREAblog - Designed by KREABAZ.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.