AI has entranced the scientific community. While chatbots like ChatGPT might be the most prominent AI we see in our daily lives, there’s a lot more you can do with AI than just talk to it. In fact, some researchers have even found a way to create a sound-based AI image generator that uses soundscapes to create accurate street images.
In a new paper published in Computers, Environment and Urban Systems, researchers showed that it is possible to take the “soundtracks” of real locations of urban and rural settings and recreate them using AI. Researchers at the University of Texas at Austin carried out the study, working to convert sounds from audio recordings into fairly accurate street-view images like you might see on Google Street View.
It’s quite an accomplishment, to be honest, and reminds me quite a bit of the AI-powered camera that takes photos without a lens by using location data to recreate wherever the photographer has pointed it. These researchers used both audio and visual data to train their sound-based AI image generator. They then tested using just audio to recreate some of the locations from which they captured soundscapes.
The results are quite compelling, showcasing just how much the acoustic environments of an area can help represent the visual nature of the location, too. The researchers used a YouTube video, as well as audio clips from cities in North America, Asia, and Europe, to carry out their tests. They created 10-second audio clips and image stills from the locations to train the AI model used in their image generator.
They then compared the images created from 100 audio clips to photos taken of their respective real-world locations using both human and computer evaluations. They discovered that the sound-based AI image generator was capable of capturing the scene accurately just based on the acoustic properties—something that was previously a uniquely human capability.
The post AI can create accurate images of streets just by listening to them appeared first on BGR.
Today’s Top Deals
Cyber Week deals: $180 iPhone SE 3, $199 Bose QC headphones, $29 Roku Stick 4K, $279 Google Nest WiFi Pro, more
Sonos has 10 Black Friday deals that are all at record-low prices
Amazon is giving out free money for Black Friday 2024
Cyber Week deals: $329 Apple Watch S10, $50 off Microsoft Office 2024, $374 PS5 Slim, Vitamix blenders, more
AI can create accurate images of streets just by listening to them originally appeared on BGR.com on Mon, 9 Dec 2024 at 19:38:00 EDT. Please see our terms for use of feeds.