WaveNet
Latest
Pretty sure Google's new talking AI just beat the Turing test
So that whole Turing test metric, wherein we gauge how human-like an AI system appears to be based on its ability to mimic our vocal affectations? At the 2018 I/O developers conference Tuesday, Google utterly dismantled it. The company did so by having its AI-driven Assistant book a reservation. On the phone. With a live, unsuspecting human on the other end of the line. And it worked flawlessly.
Google's new text-to-speech service has more realistic voices
Google will now let developers use the text-to-speech synthesis that powers the voices in Google Assistant and Maps. Cloud Text-to-Speech is available now through the Google Cloud Platform and the company says it can be used to power voice response systems in call centers, enable IoT device speech and convert media like news articles and books into a spoken format. There are 32 different voice options in 12 languages and users can customize pitch, speaking rate and volume gain.
This is what AI sees and hears when it watches 'The Joy of Painting'
Computers don't dream of electric sheep, they imagine the dulcet tones of legendary public access painter, Bob Ross. Bay Area artist and engineer Alexander Reben has produced an incredible feat of machine learning in honor of the late Ross, creating a mashup video that applies Deep Dream-like algorithms to both the video and audio tracks. The result is an utterly surreal experience that will leave you pinching yourself.
Baidu's Deep Voice can quickly synthesize realistic human speech
Baidu has been quietly working on other projects besides self-driving cars at its AI center in Silicon Valley, and now it has revealed one of them to MIT's Technology Review. Apparently, the Chinese tech titan has created a text-to-speech system called Deep Voice that's faster and more efficient than Google's WaveNet. The company says Deep Voice can be trained to speak in just a few hours with little to no human interaction. And since Baidu can control how it speaks to convey different emotions, it can (quickly) synthesize speech that sounds pretty natural and realistic.
Google DeepMind's AI can mimic realistic human speech
It's still pretty easy to tell whether it's a real person who's talking or a text-to-speech program. But there might come a time when a robot could dupe you into thinking that you're speaking with a real person, thanks to a new AI called WaveNet developed by Google's DeepMind team. They have a pretty good track record when it comes to building neural networks -- you probably know them as the folks who created AlphaGo, the AI that defeated one of the world's best Go players.