Technology
Microsoft develops first human-like speech recognition system
In a major breakthrough in the field of speech recognition, Microsoft researchers have created a technology that accurately recognises the words in a conversation like humans do.
 
The team from Microsoft Artificial Intelligence and Research reported a speech recognition system that makes the same or fewer errors than professional transcriptionists.
 
The researchers reported a word error rate (WER) of 5.9 percent, down from the 6.3 percent WER the team reported just last month.
 
The 5.9 percent error rate is about equal to that of people who were asked to transcribe the same conversation, and it's the lowest ever recorded against the industry standard "Switchboard" speech recognition task.
 
"We've reached human parity. This is an historic achievement," said Xuedong Huang, the company's chief speech scientist in a Microsoft blog post.
 
The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would.
 
In doing so, the team has beat a goal they set less than a year ago - and greatly exceeded everyone else's expectations as well.
 
"Even five years ago, I wouldn't have thought we could have achieved this. I just wouldn't have thought it would be possible," said Harry Shum, executive vice president who heads the Microsoft Artificial Intelligence and Research group.
 
The research milestone comes after decades of research in speech recognition, beginning in the early 1970s with DARPA, the US agency tasked with making technology breakthroughs in the interest of national security.
 
"This accomplishment is the culmination of over 20 years of effort," said Geoffrey Zweig, who manages the Speech & Dialog research group.
 
The milestone will have broad implications for consumer and business products that can be significantly augmented by speech recognition. That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.
 
"This will make Cortana (Microsoft personal assistant) more powerful, making a truly intelligent assistant possible," Shum said.
 
To reach the human parity milestone, the team used Microsoft's Computational Network Toolkit (CNTK), a home-grown system for deep learning that the research team has made available on GitHub via an open source license.
 
CNTK's ability to quickly process deep learning algorithms across multiple computers running a specialised chip called a graphics processing unit vastly improved the speed at which the team was able to do research and, ultimately, reach human parity.
 
Moving forward, the researchers are working on ways to make sure that speech recognition works well in more real-life settings.
 
That includes places where there is a lot of background noise, such as at a party or while driving on the highway.
 
In the longer term, researchers will focus on ways to teach computers not just to transcribe the acoustic signals that come out of people's mouths, but instead to understand the words they are saying.
 
"The next frontier is to move from recognition to understanding," Zweig said.
 
Disclaimer: Information, facts or opinions expressed in this news article are presented as sourced from IANS and do not reflect views of Moneylife and hence Moneylife is not responsible or liable for the same. As a source and news provider, IANS is responsible for accuracy, completeness, suitability and validity of any information in this article.

User

Book Ola or Uber directly from Google Search
People can now book Ola or Uber directly from Google Search from their smartphones in India, Google announced on Wednesday.
 
The users can find directions and choose from a menu of ride service options straight from Google Search app or through the mobile search browser.
 
This feature was made available on the latest version of Google Maps earlier this year.
 
"With this, users will be able to order, explore and make quick comparisons about the best estimated taxi fare directly from Google Search results on their mobile," said Sanket Gupta, Programme Manager, Google.
 
Commuters will get to see a tab for ride services, both Uber and Ola, with fare estimates and pick up times if a car is available near their location.
 
On request for specific information such as "Uber to Bangalore airport" or "Ola to Bangalore airport", the app/ browser directs the user to select a service and automatically opens the app to book the ride, all in just one tap.
 
In cases where the app is not installed on the device, Google Search will show a link to install the app.
 
Disclaimer: Information, facts or opinions expressed in this news article are presented as sourced from IANS and do not reflect views of Moneylife and hence Moneylife is not responsible or liable for the same. As a source and news provider, IANS is responsible for accuracy, completeness, suitability and validity of any information in this article.

User

COMMENTS

crajeshwari krishna

1 month ago

Op

How a Leading R&T Agent Let an Investor Down
We, as investors / consumers, tend to believe that when we are dealing with an established /...
Premium Content
Monthly Digital Access

Subscribe

Already A Subscriber?
Login
Yearly Digital+Print Access

Subscribe

Moneylife Magazine Subscriber or MSSN member?
Login

Yearly Subscriber Login

Enter the mail id that you want to use & click on Go. We will send you a link to your email for verficiation

We are listening!

Solve the equation and enter in the Captcha field.
  Loading...
Close

To continue


Please
Sign Up or Sign In
with

Email
Close

To continue


Please
Sign Up or Sign In
with

Email

BUY NOW

The Scam
24 Year Of The Scam: The Perennial Bestseller, reads like a Thriller!
Moneylife Magazine
Fiercely independent and pro-consumer information on personal finance
Stockletters in 3 Flavours
Outstanding research that beats mutual funds year after year
MAS: Complete Online Financial Advisory
(Includes Moneylife Magazine and Lion Stockletter)