Everyone knows sound is Singapore Archivesa critical component to most films and videos. After all, even when films were silent, there was still a musical accompanist letting the audience know how to feel.
This natural law remains the same for the new crop of generative AI videos, which emerge eerily silent. That's part of why Google has been working on "video-to-audio" technology (V2A) which "makes synchronized audiovisual generation possible." On Monday, Google's AI lab, DeepMind, shared progress on generating such audio including soundtracks and dialogue that automatically match up with AI-generated videos.
Google has been hard at work developing multimodal generative AI technology to compete with rivals. OpenAI has its AI video generator Sora (yet to be publicly released) and GPT-4o, which creates AI voice responses. Companies like Meta and Suno have been exploring AI-generated audio and music, but pairing audio with video is relatively new. ElevenLabs has a similar tool that matches audio to text prompts, but DeepMind says V2A is different because it doesn't require text prompts.
V2A can be paired with AI video tools like Google Veo or existing archival footage and silent films. This can be used for soundtracks, sound effects, and even dialogue. It works by using a diffusion model trained with visual inputs, natural language prompts, and video annotations to gradually refine random noise into audio that fits the tone and context of videos.
Google DeepMind says V2A can "understand raw pixels" therefore you don't actually need a text prompt to generate the audio, but it does help with the accuracy. The model can also be prompted to make the tone of the audio sound positive or negative. Along with the announcement, DeepMind released some demo videos, including a video of a dark, creepy hallway accompanied by horror music, a lone cowboy at sunset scored to a mellow harmonica tune, and an animated figure talking about its dinner.
V2A will include Google's SynthID watermarking as a safeguarding measure against misuse, and Deepmind's blog post says the feature is currently undergoing testing before it's released to the public.
Topics Artificial Intelligence Google
New York Comic Con's 'Game of Thrones' cosplayers are all worthy of the Iron ThroneFacebook is spinning off Events into a separate app, but don't freak outIvanka Trump tweets about pizza, remains silent on Donald Trump's leaked audioAll the humans in 'Stranger Things,' replaced by hamstersWhat Donald Trump's comments mean to me as a survivor of sexual assaultCNN contributor blasts colleague for asking her not to quote Trump's remarksEven Speedtest thinks Reliance Jio's internet speeds are slowing downIt's time for Samsung to explain what the hell is wrong with the Note7Webb telescope spots proof of the first stars to light the universeNo, 'Invincible' isn't ending because 'The Walking Dead' is more popularGrubhub and Seamless have gift cards nowStarbucks launches its first premium Reserve store in CambodiaWe found out what you can win playing Xbox ArenaBeyoncé invites Jay Z, Serena Williams onstage at last Formation tour stopDonald Trump audio leak dominates the news... except on Fox NewsJustin Bieber is terrible at disguisesIs this video evidence that creepy clowns really are in the UK?Donald Trump caught on tape: 'I did try and f*ck her, she was married'Dog unitards are here, just in time for fall and not without controversyHow to save for 2017 and 2057 at the same time Spacecraft beams back stunning moon video before ambitious landing AT&T data breach impacts tens of millions of customers China's Ehang and JAC to form joint venture for flying car production · TechNode Musetti vs. Djokovic 2024 livestream: Watch Wimbledon for free NASA just jumped online to correct outrageous space station misinformation Target Circle Week is here — just ahead of Prime Day miHoYo founder’s AI game Whispers From The Star features real What is the newest Xbox? In rare move, BMW and China’s Huawei sign deal for in Manus partners with Alibaba’s Qwen to expand AI capabilities · TechNode Tencent reportedly purchases billions worth of NVIDIA H20 chips · TechNode Luchen Technology becomes first to drop DeepSeek API over cost concerns · TechNode Americans to witness a lunar eclipse blood moon. Here's who'll have good viewing weather. Amazon deals of the day: Kindle Scribe, Samsung Galaxy Tab S9, Coleman camping grill, and more Network of TikTok accounts using AI to spread political misinformation, report finds Wordle today: The answer and hints for July 11 Huawei applies for trademarks on the Monkey King and other fictional figures · TechNode Tongji University purchases 10 Unitree humanoid robots for student training · TechNode The best noise China Mobile, Huawei, and Leju Robot unveil world's first 5G
1.4188s , 8203.9296875 kb
Copyright © 2025 Powered by 【Singapore Archives】,Defense Information Network