Take a Virtual, Interactive Tour 

Advancing Computer Science, Blogs, Geeky Stuff, vision, Voice, Human Computer Interaction, Artificial Intelligence

  |  
4 Min Read

Computers Are Watching Videos To Learn The World Around Them

Source: wired.com Source: wired.com

The Massachusetts Institute of Technology, MIT, let their AI computer watch a year's worth of videos from the web in the name of learning about sounds. It is more common now that computers can identify objects, faces, animals, and landmarks – but the sounds in our world are still foreign to them.

A group of computer scientists from MIT have built off an existing project "computer vision" to add the capability of listening to what it is seeing. Humans have a natural, simultaneous relationship between sight and sound, so why don't our computers?

After processing a year's worth of online videos, "SoundNet" was created. It is able to determine sounds that we commonly can identify like rain, a baby's babble, human noises, and some animals. Using its vision and hearing, the computers would compare what it saw to the noises it was hearing. It learned things like babies tend to babble a lot. After learning with the help of video, the computer can now identify noises without visual queues.

googlehome-ed Google Home

In a Turing test, SoundNet scored a 92% in identifying noises with a few mistakes like mistaking footsteps for knocking on the door or humans laughing and the sound of hens. It's nothing that cannot be fixed, though - This is only with one year of media.

This type of development will greatly improve the area of computer listening. Voice assistants and sound recognition could use a revamp. Siri is a good example of some of the struggles in voice AI – ambient noise, stuttering of words, and inflections in voices. This may give users the natural communication that our devices lack. Home voice AI could be truly interactive when knowing that a dog is barking or a cat is meowing, or more importantly the sound of a window shatter or smoke alarm.

If we want natural communication, we need to give our computers the ability to experience their surrounding the same way we do. It will know to ignore the dog's bark when we're trying to tell it something or it will be the next best baby monitor distinguishing between a happy baby and a crying one.

Rutkin, Aviva. 3 Nov 2016. "Binge-watching videos teaches computers to recognize sounds". Daily News.

Comment

Telescopes Are Really Just Time Machines

Did you know telescopes are time machines? Not actually, as cool as that would be, but they DO look at the past. The way a telescope works is that light from a celestial object is gathered and ...
Picture of Bailey Nielsen Bailey Nielsen 4 Min Read

Seamless Transition from Person to Computer.

We all know about the recent events in the news. From the start of this year, we have encountered many different events. The current event we're dealing with is the pandemic and many students like ...

2020 Astronomy Events You Can See at UAT

While Phoenix is not exactly known for its night skies, Arizona is. On a new moon night, the desert lights up with the cosmos around us. There are quite a few cool astronomical events that will occur ...
Picture of Bailey Nielsen Bailey Nielsen 4 Min Read