Pretty sure Google's new talking AI just beat the Turing test

Fool me once, shame on you; fool me 150-plus times, that's some sort of record.

Former Senior Editor

Updated Tue, May 8, 2018, 6:04 PM·5 min read

I laughed along with most of the audience at I/O 2018 when, in response to a

So that whole Turing test metric, wherein we gauge how human-like an AI system appears to be based on its ability to mimic our vocal affectations? At the 2018 I/O developers conference Tuesday, Google utterly dismantled it. The company did so by having its AI-driven Assistant book a reservation. On the phone. With a live, unsuspecting human on the other end of the line. And it worked flawlessly.

During the on-stage demonstration, Google played calls to a number of businesses, including a hair salon and a Chinese restaurant. At no point did either of the people on the other end of the line appear to suspect that the entity they were interacting with was a bot. And how could they when the Assistant would even throw in random "ums," "ahhs" and other verbal fillers people use when they're in the middle of a thought? According to the company, it's already generated hundreds of similar interactions over the course of the technology's development.

This robo-vocalization breakthrough comes as the result of Google's Duplex AI system, which itself grew out of earlier Deep Learning projects such as WaveNet. As with Google's other AI programs, like AlphaGo, Duplex is designed to perform a narrowly defined task but do it better than any human. In this case, that task is talking to people over the phone.

Duplex's initial application, according to the company, will be in automated customer-service centers. So rather than repeatedly shouting "operator" into your handset to get competent help the next time you call your bank, cable provider or power company, the digital agent will be able to assist you directly. "For such tasks, the system makes the conversational experience as natural as possible, allowing people to speak normally, like they would to another person, without having to adapt to a machine," the release read.

But don't expect to be able to ask the agent any random question that pops into your head. "Duplex can only carry out natural conversations after being deeply trained in such domains," a Google release points out. "It cannot carry out general conversations." Fingers crossed that the company trained Duplex to at least say "does not compute" in such situations.

This news comes as the customer-service industry continues to lean on automation and robotics to reduce operating costs and streamline wait times. Everyone from DoNotPay and Kodak to Microsoft and Facebook are looking to leverage chatbots and other automated respondents to their advantage. It's not even that hard to simply code your own. However, like every technology before it, Duplex can only be as useful as we make it. And unfortunately, we've not had a particularly sterling record when it comes to training AIs to be upstanding members of online society. Yes, Tay, I'm looking right at you. And your racist chatbot buddy, Zo, too.

Of course, casual racism is only one of a number of ethical and societal challenges that such emerging technologies face. As always, there's the issue of privacy and data collection. "To obtain its high precision, we trained Duplex's RNN on a corpus of anonymized phone-conversation data," Google's release explained. This AI wasn't taught how to talk like people just by conversing with its developers (we're still years away from that level of machine-learning, unfortunately). No, it was trained on hundreds, if not thousands, of hours of recorded phone conversations skimmed from opted-in customer calls. Which brings us, yet again, back to the same debate we've been having for the better part of a decade between maintaining personal data privacy and advancing the boundaries of technological convenience.

Existential solutions to the tragedy of the public commons aside, the emergence of Duplex AI exposes a number of tangible issues that should probably be addressed before we start talking to machines like we do other people. For one thing, WaveNet is really, really good. WaveNet is Google's voice synthesizing program, and unlike conventional text-to-speech engines -- which have a voice actor basically read through the dictionary in order to populate a database of words and speech fragments -- it relies on a smaller grouping of raw audio waveforms on which the computer's words and responses are built. This results in a system that is faster to train and boasts a broader, more natural-sounding response range.

To illustrate, remember when TomToms were still a thing and had installable voices like James Earl Jones or Morgan Freeman to give you driving directions? Those were generated using the conventional "read a dictionary" method and were unable to pass for the real person given their stiff intonations. The new John Legend voice available for Google Assistant sounds far more like the man himself thanks to WaveNet's capabilities.

But what's to prevent someone from illicitly leveraging recordings of a public figure to train an AI and generate falsified audio? I'm not saying we're facing a "my name is my password" situation just yet, but given that people are already able to fake images and video (even porn), being able to incorporate a layer of false audio will only make Google and Facebook's content-moderation jobs even harder. Especially since their omega-level objectionable-content censors are human -- exactly whom the technology is designed to fool.

That said, the mere possibility of misuse should not be a deal-breaker for this technology (or any other). Duplex AI has the potential to revolutionize how we interact with computer systems, effectively making tactile inputs like keyboards, mice and touchpads obsolete. Just, next time a "family member" calls out of the blue asking for your Social Security number, maybe try to independently confirm their identities first by asking them a series of unrelated questions.

Click here to catch up on the latest news from Google I/O 2018!

Images: Google (diagram)

Engadget
Doctor Who: The Devil’s Chord review: Is this madness?
'The Devil's Chord' is a mess, but it's probably an intentional mess.
Engadget
Doctor Who Space Babies review: Bet you didn’t expect that
'Space Babies' is crazy, chaotic and political. Just as Doctor Who should be.
Engadget
Apple’s big AI rollout at WWDC will reportedly focus on making Siri suck less
Apple will reportedly focus its first round of generative AI enhancements on beefing up Siri’s conversational chops. The company will reportedly roll out a new version of Siri powered by generative AI at its WWDC keynote on June 10.
Engadget
Samsung HW-Q990D soundbar review: A small but significant update
Samsung's home theater powerhouse got the one thing it was missing, but not much else for this year's model.
Engadget
Climate protestors clash with police outside Tesla’s German gigafactory
Climate protestors in Germany reportedly broke through police barricades on Friday, amid clashes between activists and law enforcement. The protestors either made it onto (according to protestors) or near (according to local police) the grounds of a Tesla gigafactory in Grünheide, Germany, near Berlin.
Engadget
The world’s largest direct carbon capture plant just went online
Climeworks has just opened the world’s largest direct carbon capture plant. It can suck around 36,000 tons of CO2 from the air each year, burying it underground.
Engadget
Apple's entire AirPods lineup is discounted, plus the rest of the week's best tech deals
This week's best tech deals include the AirPods Pro for $180, the Amazon Kindle for $80 and a year of Paramount+ with Showtime for $60, among others.
Engadget
Amazon's Echo Dot drops to just $28
Amazon's Echo Dot has dropped to $28, which is a great price for our favorite sub-$50 smart speaker.
Engadget
Hulu's Black Twitter documentary is a vital cultural chronicle
Hulu's new documentary, "Black Lives Matter: A People's History," explores the rise and global influence of the community.
Engadget
Google just patched the fifth zero-day exploit for Chrome this year
Google just patched the fifth zero-day exploit for Chrome this year. The company has released a security update for users to download.
Engadget
The Rogue Prince of Persia is delayed because Hades II is a juggernaut
The Rogue Prince of Persia was supposed to debut in early access on May 14, but Evil Empire and Ubisoft have delayed it to get out of the way of Hades II.
Engadget
Engadget Podcast: Is the iPad Pro M4 overkill?
The Rabbit R1 is finally here, and it's yet another useless AI gadget.
Engadget
The Morning After: Apple apologizes for its iPad Pro ad that crushed human creativity
The biggest news stories this morning: How to watch Google's I/O 2024 keynote, Apple apologizes for its iPad Pro ad, Nintendo is done paying Elon Musk for X integration on its consoles.
Engadget
Microsoft's web-based mobile game store opens in July
Microsoft is launching a browser-based store to make sure it's "accessible across all devices, all countries, no matter what."
Engadget
The best smart plugs in 2024
To find out which smart plug you should invite into your connected home, we tested ten, using Alexa, Siri and the Google Assistant.
Engadget
Jack Dorsey claims Bluesky is 'repeating all the mistakes' he made at Twitter
Just in case there was any doubt about how Jack Dorsey really feels about Bluesky, the former Twitter CEO has offered new details on why he left the board and deleted his account.
Engadget
Apple apologizes for its tone-deaf ad that crushed human creativity to make an iPad
Apple has reportedly apologized for its tone-deaf “Crush!” ad that sparked a furious backlash — especially with artists. AdAge reports that Apple said the video “missed the mark” and has scrapped plans to run the cutesy-turned-cringey commercial on TV.
Engadget
Get up to $450 off a Google Pixel Tablet when you trade in your old iPad or Android slab
Google has an offer for iPad owners who are curious about the Pixel Tablet. The company has a trade-in promotion that covers at least the cost of the Pixel Tablet — if not more, depending on which iPad model you have.
Engadget
The best midrange smartphones for 2024
Here's a list of the best midrange smartphones you can buy, as chosen by Engadget editors.
Engadget
Alienware m16 R2 review: When less power makes for a better laptop
Despite featuring a less powerful top-end GPU than the previous model, the new m16 R2's sleeker design and better battery make it a better overall system.

Pretty sure Google's new talking AI just beat the Turing test

Fool me once, shame on you; fool me 150-plus times, that's some sort of record.

Latest Stories

Doctor Who: The Devil’s Chord review: Is this madness?

Doctor Who Space Babies review: Bet you didn’t expect that

Apple’s big AI rollout at WWDC will reportedly focus on making Siri suck less

Samsung HW-Q990D soundbar review: A small but significant update

Climate protestors clash with police outside Tesla’s German gigafactory

The world’s largest direct carbon capture plant just went online

Apple's entire AirPods lineup is discounted, plus the rest of the week's best tech deals

Amazon's Echo Dot drops to just $28

Hulu's Black Twitter documentary is a vital cultural chronicle

Google just patched the fifth zero-day exploit for Chrome this year

The Rogue Prince of Persia is delayed because Hades II is a juggernaut

Engadget Podcast: Is the iPad Pro M4 overkill?

The Morning After: Apple apologizes for its iPad Pro ad that crushed human creativity

Microsoft's web-based mobile game store opens in July

The best smart plugs in 2024

Jack Dorsey claims Bluesky is 'repeating all the mistakes' he made at Twitter

Apple apologizes for its tone-deaf ad that crushed human creativity to make an iPad

Get up to $450 off a Google Pixel Tablet when you trade in your old iPad or Android slab

The best midrange smartphones for 2024

Alienware m16 R2 review: When less power makes for a better laptop

About

Sections

Contribute

Buying Guides