Personal assistants are ushering in the age of AI at home

How the use of the voice is making machines smarter and humans more dependent.

Former Senior Editor

Updated Wed, Oct 5, 2016, 4:00 PM·7 min read

Google Home is the latest embodiment of a virtual assistant. The voice-activated speaker can help you make a dinner reservation, remind you to catch your flight, fire up your favorite playlist and even translate words for you on the fly. While the voice interface is expected to make quotidian tasks easier, it also gives the company unprecedented access to human patterns and preferences that are crucial to the next phase of artificial intelligence.

Comparing an AI agent to a personal assistant, as most companies have been doing of late, makes for a powerful metaphor. It is one that is indicative of the human capabilities that most major technology companies want their disembodied helpers to adopt. Over the last couple of years, with improvements in speech-recognition technology, Siri, Cortana and Google Now have slowly learned to move beyond the basics of weather updates to take on more complex responsibilities like managing your calendar or answering your queries. But products that invade our personal spaces -- like Amazon's Echo and Google Home -- point to a larger shift in human-device interaction that is currently underway.

Onstage demos of Google Home, which has the company's Assistant built into it, suggest a conversational capability that requires an advanced understanding of human intent and context. The device relies almost entirely on the company's speech-recognition technology that has been in the making for almost a decade, since the early days of GOOG 411. But over the years, the basic telephone-based directory search has grown into the much more complex Google Now.

Amazon's Echo ecosystem relies on virtual assistant Alexa to respond to voice commands.

The drastic jump in the Android assistant's capabilities has come from neural net training and deep learning techniques that have allowed scientists to boost speech-recognition technology to a point where it is now starting to learn the nuances of human behavior through the medium of voice.

Using the voice to communicate with an outside entity makes for an intimate and innately human experience. "Speech is the most dominant way that humanity has been communicating with each other," David Nahamoo, speech CTO at IBM Research, said over the phone. "When we communicate with the outside, we speak. But from outside to inside, we absorb information a lot better visually. It's because of our heritage and the evolution that we have gone through. From the standpoint of efficiency, speech is quickest way to get a point across."

"Voice changes the way people interact with their systems." - Françoise Beaufays, Google

Devices like Echo and Google Home, for instance, are built on speech recognition that can help you stay heads-up and hands-free while you multitask around the house. So instead of spending time swiping and typing, you can tell the personal assistant what you need or what you're looking for. It's that kind of ease and productivity that companies dangle in front of the users to have them adopt chatbots and personal assistants in their daily communications, but talking to devices also opens the door to a new kind of relationship.

"I think voice changes the way people interact with their systems," says Françoise Beaufays, a research scientist who works on speech recognition at Google. "For a long time when people were typing in their browsers for information, they would write something cryptic like 'Eiffel Tower height,' for example." The string of seemingly random words would instantly pull up search results on google.com with pictures, details and dimensions of the iconic French structure. But when speech recognition started to take shape with smartphone assistants, Beaufays says there was a clear change in communication.

"As people started feeling comfortable with speech, instead of being cryptic they started saying: 'Hey, what is the height of the Eiffel Tower?' or 'How tall is the Eiffel Tower?'," she says. "We saw that switch in the way people were addressing their devices in speech first and typing next. Using your voice is bringing in more discursive type of interaction, and even though you know very well it's a machine you behave a little more human with it."

A still from the movie Her (2013), directed by Spike Jonze.

While a verbal exchange with a virtual assistant can make it easier to get things done, it also makes it easier for the companies to gain invaluable insight into the human world that's filled with vocal clues to feelings and preferences. "We're going from computing to understanding," says James Barrat, author of Our Final Invention: Artificial Intelligence and the End of the Human Era. "It's not just us chatting. These machines are listening to what we like and don't like, how we speak and what we speak about. It's greater access to how we think."

In the world of AI, data is the currency that will set one company apart from the other. Through voice searches, millions of vocal samples become available to the companies that are fine-tuning personal assistants. The stream of information is fed back into the system to improve the accuracy of the algorithms, but it also gives the companies access to the complexities of human intent. In effect, using the voice to communicate with an AI helper only makes it smarter.

A lot can be gleaned from the vocal communication. Words and intonations start to give away user patterns, preferences and even emotions over time. That kind of insight into the mindset of the user is critical to the next wave of personalized AI that is already taking shape at companies like Google, Amazon and Facebook.

Smart, talking AIs at home will fire up the ecosystem of the Internet of Things, taking it from novelty machines to necessities. With companies aspiring to make their assistants omnipresent and their machines more interconnectable, they need capable speech recognition to get the job done.

"There's a parallel thrust," says Vlad Sejnoha, CTO at Nuance Communications, one of the leaders in voice-recognition technologies. "You'll interact with your smart fridge or printer in a more natural way but also see a portable personal assistant that lives in a cloud and follows you around to help you navigate a complex world." Google Home, much like Amazon's Echo, already comes with partnerships that are useful around the house. You can use the speaker to control your Chromecast, Nest and Philips Hue lights.

In addition to navigating the immediate physical world, an omnipresent assistant could potentially become a gateway to unfamiliar settings or foreign languages too. In the spot aired during the Google event this week, the company demonstrated that Home has the ability to tap Google Translate to respond with accurate translations from English to Spanish. But whether the machine can comprehend foreign accents and translate the reverse, remains to be seen.

"These machines are listening to what we like and don't like, how we speak and what we speak about. It's greater access to how we think." - James Barrat

Failing to comprehend different accents has been one of the biggest downfalls of most digital assistants on smartphones today. Scientists building these systems often talk about the lack of data as one of the biggest obstacles to understanding new accents and languages. The copious amounts of information required to make that possible calls for massive investments from the companies. Taking the technology straight to people's homes opens up a steady stream of data that can be used for tests back in the research labs.

A lot of the building blocks are starting to fall into place for devices like Google Home to become efficient personal assistants. And even though, there's a need to be more vigilant of the ways human-device interactions are starting to shift; most voice interface developers believe it's a necessary change that will extend human capabilities.

"Having an AI that is your agent and helps you exist in the world better, gets you better information and services is hugely exciting," says Sejnoha. "As with anything there are uses that can be negative, we're all familiar with privacy and mining data. That's something we have to be thoughtful about, but the benefits far outweigh those scenarios."

Engadget
ISPs are fighting to raise the price of low-income broadband
Internet service providers are objected to the lower rates they need to offer lower income customers if they want to obtain government funds from a new Internet access program.
Engadget
Amazon is giving The Boys the prequel treatment
The cast and crew of Amazon's The Boys announced a bunch of new spinoffs for the supe action series.
Engadget
You can date everything in Date Everything!
Date Everything! is an upcoming dating sim game that lets you date evert
Engadget
The Bioshock movie is still happening but with a reduced budget
The Bioshock movie is still happening, but with steep budget cuts. It’s being reconfigured to become a ‘more personal’ film.
Engadget
Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package
Warner Bros. Discovery followed through on its threat to “take appropriate action” against the NBA for rejecting its broadcasting rights offer. On Friday, the media company sued the league after the NBA turned down its bid to match Amazon’s streaming package.
Engadget
Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now
Apple’s M3 MacBook Air combines Apple’s lightest and thinnest laptop design with the cutting-edge horsepower of the latest Apple silicon chip. You can get the 2024 model on sale for $200 off right now.
Engadget
Here's how to stop Grok's AI models using your tweets for training
X automatically opted users into letting Grok's AI models train on their tweets and interactions with the chatbot. Here's how to opt out.
Engadget
The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals
The week after Amazon's Prime Day can be a bit sleepy for deals, but we still found a few decent discounts on gear we've tested and recommend.
Engadget
The 65-inch LG C3 OLED TV is nearly half off for today only
The 65-inch LG C3 OLED TV is nearly half off for today only. That brings the set down to a record low of $1,300.
Engadget
NASA's Perseverance rover found a rock on Mars that could indicate ancient life
A Martian rock sample collected by Perseverance contains "chemical signatures and structures" that could've been formed by ancient microbial life from billions of years ago.
Engadget
Apple agrees to stick by Biden administration's voluntary AI safeguards
Apple has joined more than a dozen other tech companies in signing up for the Biden administration's voluntary AI code of practice.
Engadget
North Korean who used ransomware to attack US healthcare providers has been indicted
A grand jury in Kansas City has indicted Rim Jong Hyok, a North Korean intelligence operative who allegedly used ransomware to attack health providers' systems in the US.
Engadget
Samsung Galaxy Ring review: A bit basic, a bit pricey
The Galaxy Ring is comfortable and seemingly basic, but actually delivers detailed insight on your sleep, walks and runs.
Engadget
Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon
Apple's well-specked 14-inch MacBook Pro with an M3 Pro chip, 18GB of memory and 512GB of storage is on sale for the lowest price we've seen yet at Amazon.
Engadget
Gran Turismo 7's more realistic physics update is launching cars into orbit
Gran Turismo 7's latest update is causing some bizarre problems, making cars bounce violently or launch completely into the air.
Engadget
The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT
The biggest news stories this morning: AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission, The best cameras for 2024, WhatsApp hits 100 million monthly active US users.
Engadget
The best fitness trackers for 2024
Here's a list of the best fitness trackers you can buy, as chosen by Engadget editors.
Engadget
The best cameras for 2024
Here's a list of the best cameras you can buy, as chosen by Engadget editors.
Engadget
X's Grok chatbot is misleading voters about the presidential election
Grok's AI chatbot claims that President Biden's name must stay on the ballot in nine states, a claim that is categorically false.
Engadget
Comic-Con leak sparks rumors of two remastered Soul Reaver games
A photo from Comic-Con has leaked possible remasters of two Soul Reaver games from Crystal Dynamics.

Personal assistants are ushering in the age of AI at home

How the use of the voice is making machines smarter and humans more dependent.

Latest Stories

ISPs are fighting to raise the price of low-income broadband

Amazon is giving The Boys the prequel treatment

You can date everything in Date Everything!

The Bioshock movie is still happening but with a reduced budget

Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package

Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now

Here's how to stop Grok's AI models using your tweets for training

The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals

The 65-inch LG C3 OLED TV is nearly half off for today only

NASA's Perseverance rover found a rock on Mars that could indicate ancient life

Apple agrees to stick by Biden administration's voluntary AI safeguards

North Korean who used ransomware to attack US healthcare providers has been indicted

Samsung Galaxy Ring review: A bit basic, a bit pricey

Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon

Gran Turismo 7's more realistic physics update is launching cars into orbit

The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT

The best fitness trackers for 2024

The best cameras for 2024

X's Grok chatbot is misleading voters about the presidential election

Comic-Con leak sparks rumors of two remastered Soul Reaver games

About

Sections

Contribute

Buying Guides