Microsoft's VASA-1 model lets photos talk

Microsoft recently published a study in which the company presented the AI model VASA-1. This model uses portrait photos and associated audio files to show realistic “talking heads”. This technology offers creative options, but carries serious risks.

The VASA-1 AI model is still in a research phase. Microsoft does already show that it can make portrait photos of people talk ‘realistically’ in combination with audio files. The facial expressions shown are context sensitive, adapting to the detected tone of the audio.

Very realistic

The individuals in the portrait photographs used do not have to look directly into the camera. In addition, the AI model has many features such as determining eye gaze, head distance and even emotional expressions.

This gives the processed images a very realistic “look and feel” when they appear to be talking, Microsoft states. The technology enables these “talking pictures” to sing songs, among other things.

According to Microsoft, VASA-1 was designed specifically for animating virtual characters. The images released by the tech giant at the inquiry are said to be virtual examples created with OpenAI’s DALL-E.

Collage van verschillende gezichtsuitdrukkingen van meerdere individuen, die emoties zoals vreugde, verwarring en verrassing demonstreren, voor een visuele audiosynchronisatieanalyse.

Use cases and serious risks

The new technology obviously offers many possible uses. Obviously, it can be used to develop more realistic AI characters, complete with “normal” lip-synching and facial expressions for more depth. It also makes it possible to create avatars for social media videos. Microsoft itself also came up with having the Mona Lisa sing as a striking example of the very varied ways the technology can be used.

Yet there are also risks associated with this new AI technology. If the technology were publicly available, it could lead directly to much more convincing deepfakes. The very potential malicious use of the technology is a reason for Microsoft to keep the specific details of VASA-1 to itself for now. In doing so, the researchers warn that although the technology has good intentions especially for the creative sector, the dangers of misuse are most certainly lurking.

Also read: French AI startup Mistral AI again looking for investors

Top story

SAP seeks AI jewels at every S/4HANA corner

At SAP, AI investments are starting to pay off. Users of the S/4HANA ERP can use AI tools such as Joule to wo...

Berry Zwets 21 hours ago

Tech calendar

Whitepapers

Enhance your data protection strategy for 2025

The Data Protection Guide 2025 explores the essential strategies and...

Are you data and AI ready?

Discover the essential strategies and imperatives to create a data an...

Microsoft’s VASA-1 model lets photos talk

Very realistic

Use cases and serious risks

Stay tuned, subscribe!

CISA saves MITRE’s CVE database at the 11th hour

Free VMware ESXi hypervisor returns

MuleSoft meets AI: Powering the Agentforce revolution

Google lets AI agents do the data work in BigQuery and Looker

Data analysts still heavily reliant on spreadsheets

Tableau keeps business intelligence (BI) alive and kicking

47 years of SAS: age gives SAS an edge in current AI landscape

VeeamON 2025

GITEX ASIA

SAS Innovate 2025

.NEXT 2025

LambdaConf 2025

Qlik Connect 2025

Cloud Account Executive – Slack

AI & Data Architect

Try the latest high-end Synology backup system for free

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices

Are you data and AI ready?