Skip to content

Technology & Innovation — December 20, 2014

Chinese Search Engine Says It Has Reached Deep Learning Breakthrough

With its Deep Speech system, Chinese search giant Baidu claims to have created one of the most advanced and successful speech recognition programs in the world.

Robert Montenegro

Sign up for Big Think on Substack

The most surprising and impactful new stories delivered to your inbox every week, for free.

Subscribe

As reported by Derrick Harris of Gigaom, the Chinese search engine company Baidu claims to have made a major breakthrough in the realm of speech recognition software with its new Deep Speech system:

“Chinese search engine giant Baidu says it has developed a speech recognition system, called Deep Speech, the likes of which has never been seen, especially in noisy environments. In restaurant settings and other loud places where other commercial speech recognition systems fail, the deep learning model proved accurate nearly 81 percent of the time.

Featured Videos

That might not sound too great, but consider the alternative: commercial speech-recognition APIs against which Deep Speech was tested, including those for Microsoft Bing, Google and Wit.AI, topped out at nearly 65 percent accuracy in noisy environments.”

Baidu’s chief scientist is prudently avoiding getting too excited. He noted to Harris that the company’s research is still just research. Still, Baidu’s supposed breakthrough could mean a major shift down the line if Deep Speech gets incorporated into products such as smartphones.

Harris explains that the secret behind Deep Speech’s unprecedented accuracy is a massive trove of data plus the training of the system to deal with background noise:

“Baidu gathered about 7,000 hours of data on people speaking conversationally, and then synthesized a total of roughly 100,000 hours by fusing those files with files containing background noise. That was noise from a restaurant, a television, a cafeteria, and the inside of a car and a train.”

For more about Deep Speech and voice recognition software, be sure to check out Harris’ full article linked below. Included is a video demonstration of Baidu’s new software. It’s worth a look.

Read more at Gigaom

Photo credit: Carlos Amarillo / Shutterstock

Sign up for Big Think on Substack

The most surprising and impactful new stories delivered to your inbox every week, for free.

Subscribe

Related

a man sitting in a wheel chair next to a laptop.

Stephen Hawking’s famous voice belonged to this pioneering MIT scientist

Dennis Klatt developed trailblazing text-to-speech systems before losing his own voice to cancer.

A man and a woman in ancient attire sit at a table indoors, engaged in conversation; beside the jug, roses, and scroll lies a small straw man figure.

Mini Philosophy

Daniel Dennett’s 4 rules for a good debate

What’s the point in fighting a made up monster?

A woman sits on a chair against a white backdrop, with yellow graphics of brain wave patterns in the background.

Move your body, grow your brain: the science of hippocampal neurogenesis

“We know that as little as 10 minutes of walking can improve your mood, getting that bubble bath with the dopamine, serotonin, endorphins going. Anybody can do that.”

▸

01:16:33 min

—

with

black hole baby universe

Starts With A Bang

Ask Ethan: Was the Universe “timeless” before the Big Bang?

Here in our Universe, time passes at a fixed rate for all observers: one second-per-second. Before the Big Bang, things were very different.

Musical notes and prime numbers, including 2, 5, 17, 29, and 43, are arranged on a staff with blue horizontal lines on a light background, blending mathematics with music.

Mini Philosophy

From Messiaen to Radiohead: How great music uses prime numbers

The strange, undulating sound of mathematics.

Up Next

Personal Growth

Meditate While Lying in Bed For More Restful Sleep

American-born Buddhist monk Hwansan Sunim has written a series of articles with instructions and advice on how to meditate in various everyday postures and positions.