WaveNet

Latest

  • Ociacia via Getty Images

    Pretty sure Google's new talking AI just beat the Turing test

    by 
    Andrew Tarantola
    Andrew Tarantola
    05.08.2018

    So that whole Turing test metric, wherein we gauge how human-like an AI system appears to be based on its ability to mimic our vocal affectations? At the 2018 I/O developers conference Tuesday, Google utterly dismantled it. The company did so by having its AI-driven Assistant book a reservation. On the phone. With a live, unsuspecting human on the other end of the line. And it worked flawlessly.

  • NurPhoto via Getty Images

    Google's new text-to-speech service has more realistic voices

    by 
    Mallory Locklear
    Mallory Locklear
    03.27.2018

    Google will now let developers use the text-to-speech synthesis that powers the voices in Google Assistant and Maps. Cloud Text-to-Speech is available now through the Google Cloud Platform and the company says it can be used to power voice response systems in call centers, enable IoT device speech and convert media like news articles and books into a spoken format. There are 32 different voice options in 12 languages and users can customize pitch, speaking rate and volume gain.

  • artBoffin

    This is what AI sees and hears when it watches 'The Joy of Painting'

    by 
    Andrew Tarantola
    Andrew Tarantola
    04.06.2017

    Computers don't dream of electric sheep, they imagine the dulcet tones of legendary public access painter, Bob Ross. Bay Area artist and engineer Alexander Reben has produced an incredible feat of machine learning in honor of the late Ross, creating a mashup video that applies Deep Dream-like algorithms to both the video and audio tracks. The result is an utterly surreal experience that will leave you pinching yourself.

  • Getty Images

    Baidu's Deep Voice can quickly synthesize realistic human speech

    by 
    Mariella Moon
    Mariella Moon
    03.09.2017

    Baidu has been quietly working on other projects besides self-driving cars at its AI center in Silicon Valley, and now it has revealed one of them to MIT's Technology Review. Apparently, the Chinese tech titan has created a text-to-speech system called Deep Voice that's faster and more efficient than Google's WaveNet. The company says Deep Voice can be trained to speak in just a few hours with little to no human interaction. And since Baidu can control how it speaks to convey different emotions, it can (quickly) synthesize speech that sounds pretty natural and realistic.

  • JUNG YEON-JE/AFP/Getty Images

    Google DeepMind's AI can mimic realistic human speech

    by 
    Mariella Moon
    Mariella Moon
    09.10.2016

    It's still pretty easy to tell whether it's a real person who's talking or a text-to-speech program. But there might come a time when a robot could dupe you into thinking that you're speaking with a real person, thanks to a new AI called WaveNet developed by Google's DeepMind team. They have a pretty good track record when it comes to building neural networks -- you probably know them as the folks who created AlphaGo, the AI that defeated one of the world's best Go players.