deepvoice2

Latest

  • Jason Lee / Reuters

    Baidu’s text-to-speech system mimics a variety of accents ‘perfectly'

    by 
    Timothy J. Seppala
    Timothy J. Seppala
    05.25.2017

    Chinese tech giant Baidu's text-to-speech system, Deep Voice, is making a lot of progress toward sounding more human. The latest news about the tech are audio samples showcasing its ability to accurately portray differences in regional accents. The company says that the new version, aptly named Deep Voice 2, has been able to "learn from hundreds of unique voices from less than a half an hour of data per speaker, while achieving high audio quality." That's compared to the 20 hours hours of training it took to get similar results from the previous iteration, for a single voice, further pushing its efficiency past Google's WaveNet in a few months time.