Microsoft’s New AI Can Make Photographs Sing and Talk — and It Already Has the Mona Lisa Lip-Syncing

Microsoft published a research paper this week highlighting a new AI model called VASA-1 that can transform a single picture and audio clip of a person into a realistic video of them lip-syncing — with facial expressions, head movements, and all.

The AI model was trained on AI-generated images from generators like DALL·E-3, which the researchers then layered with audio clips. The results are images-turned-videos of talking faces.

The researchers built on technology from competitors such as Runway and Nvidia, but state in the paper that their method of doing things is higher-quality, more realistic, and “significantly outperforms” existing methods.

Related: Adobe’s Firefly Image Generator Was Partially Trained on AI Images

→ Continue reading at Entrepreneur

More from author

Related posts

Advertisment

Latest posts

Your movie theater may be shortchanging your drinks, a lawsuit alleges | CNN Business

CNN  —  Not even a beer at the movies is spared from shrinkflation, alleges a lawsuit against Cinemark....

Get Your Business a One-Year Sam’s Club Membership for Just $14

Disclosure: Our goal is to feature products and services that we think you'll find interesting and useful. If you purchase them, Entrepreneur may...

United Airlines Is Offering Taylor Swift Fans a 13% Discount on Select Flights — Here’s How to Cash In

Taylor Swift delighted fans Friday, April 19, when she released a surprise double album, "The Tortured Poets Department: The Anthology," her 11th studio album.In...