Breakthroughs in Machine Learning

New System Can Recognize Objects from a Verbal Description

Recently, computer scientists at MIT revealed a major breakthrough in machine learning.  They created a system which learned to recognize objects in photographs based upon provided audio descriptions in real-time.  The success of the project illustrates new potential for artificial intelligence due to the ability of the system to utilize voice and object recognition in conjunction.

Low-level speech recognition technology, like the technology presently used by smart phones and voice assistants, are programmed through an arduous training process of analysis, filtering, and digitization.  To train these systems, countless hours of recordings and their associated transcriptions must be processed so that the language can be mapped.  The “training” process becomes more complex when there is a necessity to learn different accents or dialects.

The system that MIT has developed learns more like a human child. Similar to the board books used to teach language and word association to toddlers, the system is given words in an audio format and objects in the form of raw images, which it learns to associate with one-another.

“We wanted to do speech recognition in a way that’s more natural, leveraging additional signals and information that humans have the benefit of using, but that machine learning algorithms don’t typically have access to.”

David Harwath, a researcher in MIT’s Computer Science and Artificial Intelligence Lab

Though the system has only learned a couple hundred words so far, developers have high hopes for the future of the technology. One of the most exciting possibilities is the potential for advanced language translation applications. The ability to correlate objects to spoken words can eliminate the need for transcription data, accelerating the speed in which technology can learn languages, and open the door for real-time personal translation technologies.

For more details on this new technology, read the MIT’s full Press Release here.

 

Media Contact:
Gerald Jonathan
541.335.2283
gerald.jonathan@kghawes.com

 

 

 

Media Inquiries

If you have a press request or inquiry, please fill out our Media Inquiry form.

Looking for a Trusted Partner?