Listen to New Google AI Program Talk Like a Human and Write Music
Google's DeepMind creates AI that blows away existing speech synthesizers.
Google-owned artificial intelligence company DeepMind presented a deep neural network that generates amazingly human-like speech. Called WaveNet, this AI makes a significant advancement over existing speech synthesizers. What’s more, it can write pretty good classical music.
DeepMind is a British company, previously known for creating machine-learning AI software that beat the world champion of the notoriously-intricate game Go. Machine learning allows computer systems to teach themselves and make predictions based on gathered data.
The company claims that its WaveNet creates speech that can mimic any human voice and closes the gap with human speech performance by more than 50%. Google’s 500-person blind test study found people rating WaveNet’s English speech at a 4.21 (5 being realistic human speech), while concatenate speech got a 3.86 and parametric an even worse 3.67.
WaveNet also generated speech in Mandarin, which got similar results.
They did this by re-imagining currently used text-to-speech (TTS) processes. The two most common being concatenative TTS, used by Apple’s Siri, which involves pre-recorded fragments of speech, and parametric TTS, which sounds even less natural, getting speech generated through computer algorithms.
What’s different about WaveNet is that its can directly model the raw waveform of an audio signal, an extremely complicated task that required a novel neural network. WaveNet learns from voice recordings, then on its own creates speech. This independence also allows the program to generate other kinds of audio, like music.
To bolster their claim, DeepMind released some samples, comparing their WaveNets with samples made by concatenate and parametric TTS. You be the judge.
And now, this is what WaveNet generated:
After it was trained on a dataset of classical piano music, WaveNet produced these intriguing musical creations of its own:
What are the implications of this new tech? While it also means our eventual robotic overlords should being easier to talk to, virtual AI assistants like Siri or Cortana could benefit sooner. Google isn’t promising this is headed straight to such applications, however, as WaveNet requires serious computing power.
This achievement shows again the potential of DeepMind's neural networks which can and are being used for fraud and spam detection, handwriting recognition, image search, translation and other tasks.
In a very Google move, the paper on WaveNet is available on Google Drive here.
Want to know more about DeepMind? Check out this video:
- The meaning of the word 'confidence' seems obvious. But it's not the same as self-esteem.
- Confidence isn't just a feeling on your inside. It comes from taking action in the world.
- Join Big Think Edge today and learn how to achieve more confidence when and where it really matters.
- Bad outcomes get criticized as evidence of bad decisions, but that's not necessarily so.
- Here, poker pro Annie Duke desribes a simple thought experiment that separates decisions from outcomes.
- It is quite possible to make a very good decision that, due to external factors, results in a bad outcome.
Decide to Play Great Poker: A Strategy Guide to No-Limit Texas Hold '’Em
Lauren Miranda sent a nude selfie to a boyfriend years ago. Somehow one of her students discovered it.
- Math teacher Lauren Miranda was fired from her Long Island school when a topless selfie surfaced.
- Miranda had only shared the photo with her ex-boyfriend, who is also a teacher in the school district.
- She's suing the school for $3 million as well as getting her job back, citing gender discrimination.
If you're lacking confidence and feel like you could benefit from an ego boost, try writing your life story.
In truth, so much of what happens to us in life is random – we are pawns at the mercy of Lady Luck. To take ownership of our experiences and exert a feeling of control over our future, we tell stories about ourselves that weave meaning and continuity into our personal identity.
SMARTER FASTER trademarks owned by The Big Think, Inc. All rights reserved.