Latest in Gear

Image credit:

Google's Translatotron can translate speech in the speaker's voice

It's the first tool that can directly translate speech from one language into another.
AJ Dellinger, @ajdell
May 15, 2019
1 Shares
Share
Tweet
Share

Sponsored Links

NurPhoto via Getty Images

Speaking another language may be getting easier. Google is showing off Translatotron, a first-of-its-kind translation model that can directly convert speech from one language into another while maintaining a speaker's voice and cadence. The tool forgoes the usual step of translating speech to text and back to speech, which can often lead to errors along the way. Instead, the end-to-end technique directly translates a speaker's voice into another language. The company is hoping the development will open up future developments using the direct translation model.

According to Google, Translatotron uses a sequence-to-sequence network model that takes a voice input, processes it as a spectrogram -- a visual representation of frequencies -- and generates a new spectrogram in a target language. The result is a much faster translation with less likelihood of something getting lost along the way. The tool also works with an optional speaker encoder component, which works to maintain a speaker's voice. The translated speech is still synthesized and sounds a bit robotic, but can effectively maintain some elements of a speaker's voice. You can listen to samples of Translatotron's attempts to maintain a speaker's voice as it completes translations on Google Research's GitHub page. Some are certainly better than others, but it's a start.

Model architecture of Translatotron

Google has been fine-tuning its translations in recent months. Last year, the company introduced accents in Google Translate that can speak a variety of languages in region-based pronunciations and added more langauges to its real-time translation feature. Earlier this year, Google Assistant got an "interpreter mode" for smart displays and speakers that can translate between 26 languages.

All products recommended by Engadget are selected by our editorial team, independent of our parent company. Some of our stories include affiliate links. If you buy something through one of these links, we may earn an affiliate commission.
Comment
Comments
Share
1 Shares
Share
Tweet
Share

Popular on Engadget

The 2020 Engadget Holiday Gift Guide

The 2020 Engadget Holiday Gift Guide

View
DoorDash lets restaurants add their own delivery drivers to its service

DoorDash lets restaurants add their own delivery drivers to its service

View
Justice Department sues Facebook over H-1B hiring practices

Justice Department sues Facebook over H-1B hiring practices

View
macOS Big Sur review: A mix of new and familiar

macOS Big Sur review: A mix of new and familiar

View
The gold, 8th-generation iPad returns to $299 at Amazon

The gold, 8th-generation iPad returns to $299 at Amazon

View

From around the web

Page 1Page 1ear iconeye iconFill 23text filevr