Facebook releases its 'Blender' chatbot as an open-source project

It could help tomorrow's AI converse more naturally with people.

Former Senior Editor

Wed, Apr 29, 2020, 11:00 AM·5 min read

The virtual assistants that inhabit our smartphones are helpful, sure, but they’re not going to pass the Turing test any time soon. They’re designed for understanding specific commands and actions like checking on restaurant reservations or getting updates on the weather, rather than, say, carrying on an in-depth conversation with a human. But chatbots could soon become far more loquacious thanks to Facebook, which this morning released a startlingly lifelike chatbot that it’s been developing, dubbed Blender, as an open-source resource for AI research.

Facebook has been pouring money and resources into its Natural Language Processing technologies for a few years now and those efforts appear to have paid off. The company claims that Blender is the single largest open-source chatbot created to date. It’s been trained on a whopping 9.4 billion parameters -- nearly 4x as many as Google’s Meena and more than 10x as many as the previous largest OS chatbot available on the internet.

“One of the recent findings in the area of NLP, and AI in general, has been that as you scale, as these neural network models larger and larger, they tend to perform better,” Stephen Roller, a research engineer at Facebook’s AI lab (FAIR), told Engadget. “We had a number of issues when we were trying to train this thing. When you start to get that large, these things no longer are able to fit on a single GPU anymore.”

Instead, the team had to devise a means of siloing various aspects of the neural network being trained and working them in parallel across a number of devices while maintaining the overall efficiency of the networks as a whole. “There are a lot of sophisticated techniques that you have to use in terms of how you chop this thing up,” Roller continued. “If you split it over different devices and chop it up like the wrong way, then you're going to lose that efficiency that you have and you're not going to be able to scale to these terabyte-sized data sets that we've been working with.”

However, size is only important up to a point. “One interesting thing we found is scale isn't everything,” Roller said. “For instance, our 2.7 billion parameter model, actually slightly outperforms the 9.4 billion parameter model. What seems to be really important is that you scale these things at least until a certain point, but then it becomes supercritical to imbue these things with specialized skills and behaviors [to further refine its performance].”

FAIR has focused on three specific behaviors -- the ability to display empathy, personality and knowledge -- to further humanize Blender’s responses. But it’s not so much that Blender can produce those three behaviors so much as it can switch seamlessly between them as the conversation progresses thanks to its unique Blended Skill Talk feature.

“We, in the past two years of research, have designed tasks for each one of these skills,” Emily Dinan, a research engineer at FAIR, told Engadget. “This is the first time we've really shown that you can blend all of these aspects of conversation seamlessly in one. Our evaluation setup showed that models that were fine-tuned on these nice conversational skill datasets are more engaging and consider more human, more lifelike than models which were not.”

This means that Blender is emotionally smart enough to know to congratulate you if you tell it you just got a promotion at work and offer condolences when you reveal that your dog just died. FAIR has also taught it to give more than rote cursory responses when asked about a particular subject. For example, if you ask Google Assistant about Led Zeppelin, it will typically read off the first couple of lines from the band’s Wikipedia page. “So, we designed a data set that was intending to go beyond this sort of surface level of chitchat and go more in depth about a topic, and sort of imbue these models with more knowledge about the world,” Dinan said.

Facebook's AI Chatbot Blender — Facebook

The research team first pre-trained Blender using the Wizard of Wikipedia system to impart a broad general knowledge base into the chatbot, then fine tuned it with data that “specifically was collected by having two humans talk to each other about a given topic,” Dinan continued. This helped to normalize the bot’s speech patterns so it sounded more like a natural reply rather than a robot reading encyclopedia entries at you.

“Getting this great supervised conversational data is really expensive,” Dinan admitted. “And so we typically only have small amounts of this data, relative to the data that we use to pre train the model.”

And if Bladerunner taught us anything, it’s that your synthetic lifeform is only as convincing as its best backstory, which is why FAIR has started adding them to its chatbots. “This is a way of imbuing the bots with a specific personality and showing more personality,” Dinan said. “And so this data set was designed by giving to human character descriptions like… being a basketball lover from Michigan with three kids.” Keeping the bots “in character” reduces the rate of logical faults produced by the AI system, such as first saying that it owns three cats and then turning around with its next response and denying ever having lived with a trio of felines.

So far, people who have interacted with Blender seem to prefer it to other open-source models. During a recent test between it and Meena using the ACUTE-Eval method, 67 percent of respondents found Blender to sound more human, and 75 percent would rather engage with Blender than with Meena for longer conversations.

Engadget
Apple is launching new iPads May 7: Here's what to expect from the 'Let Loose' event
Apple has scheduled an event for May 7 that'll more than likely focus on new iPads. Here's what we expect the company to show off.
6h ago
Engadget
Spotify tests Apple's resolve with new pricing update in the EU
Spotify submitted a new update for Apple's approval that would display pricing right in the app.
15h ago
Engadget
Tupac’s estate threatens to sue Drake for his AI-infused Kendrick Lamar diss
Tupac Shakur’s estate isn’t pleased with Drake cloning the late hip-hop legend’s voice in a Kendrick Lamar diss track. Attorney Howard King, representing Mr. Shakur’s estate, sent a cease-and-desist letter calling Drake’s use of Shakur’s voice “a flagrant violation of Tupac’s publicity and the estate’s legal rights.”
9h ago
Engadget
BlizzCon 2024 is canceled
Blizzard has canceled this year's edition of BlizzCon. It plans to bring back the event in the future.
10h ago
Engadget
Reddit is back online after a major outage forced everyone to touch grass
Reddit is currently down. The company is working on a fix.
10h ago
Engadget
Formula E debuts Gen3 Evo race car: All-wheel drive unlocks 0-60 mph in 1.82 seconds
For the first time, all-wheel drive will be available on a Formula E car, which will unlock 0-60 MPH times under two seconds.
10h ago
Engadget
Our favorite budget webcam is 20 percent off right now
The Anker PowerConf C200 webcam is 20 percent off right now via Amazon or directly from the company. This brings the price down to $48.
10h ago
Engadget
Threads is getting its own Hidden Words feature
Threads is launching the Hidden Words feature for users, hiding offensive words, phrases or emojis.
10h ago
Engadget
FCC votes to restore net neutrality protections
The Federal Communications Commission has voted to restore net neutrality protections. Under those rules, ISPs are not allowed to throttle access to specific sites or charge streaming services for faster service.
11h ago
Engadget
The world’s biggest 3D printer can a make a house in under 80 hours
The University of Maine has beat its own world record by creating the largest polymer 3D printer. This new model can print up to 500 pounds each hour.
12h ago
Engadget
Garry’s Mod faces deluge of Nintendo-related DMCA takedown notices
Facepunch Studios has announced on Steam that it's removing 20 years' worth of Nintendo-related workshop items for its sandbox game Garry's Mod to comply with the Japanese company's demands. But fans say the takedown notices came from trolls, not from Nintendo itself.
15h ago
Engadget
The best gifts for teachers
Show your favorite educator that you appreciate their hard work with these tech gift ideas.
a year ago
Engadget
The Morning After: Testing the Rabbit R1's AI assistant skills
The biggest news stories this morning: TikTok Lite axes ‘addictive as cigarettes’ reward-to-watch feature, Joe Biden signs the bill that could ban TikTok, JetBlue’s in-flight entertainment system just got a watch party feature.
16h ago
Engadget
Adobe's new upscaling tech uses AI to sharpen video
Adobe has unveiled VideoGigaGAN, an experimental AI feature that can upscale video by eight times without the usual artifacts like flickering or distortion.
17h ago
Engadget
The best MacBook accessories for 2024
Here are the accessories we use and recommend to improve the ergonomics, connectivity and overall productivity of your MacBook.
7 months ago
Engadget
Manhattan's DA wants to know why YouTube is pushing 'ghost gun' tutorials to kids
Manhattan's District Attorney wants to meet with YouTube's CEO to discuss why the website allows the posting of videos on how to manufacture "ghost guns" and why its algorithm is pushing them to underage viewers who watch video game content.
20h ago
Engadget
The best ereaders for 2024
We tested ereaders from Kobo, Amazon, Boox and more to see which one is the best overall, along with a budget pick and the best one with page-turn buttons.
7 months ago
Engadget
Threads has 150 million monthly users
Meta’s Threads app now has more than 150 million monthly users, an increase of about 20 million new users since February
1d ago
Engadget
TikTok Lite axes ‘addictive as cigarettes’ reward-to-watch feature under the EU’s watchful eye
The EU has effectively vanquished a TikTok feature that Europe’s digital commissioner described as “toxic” and “addictive as cigarettes.” TikTok owner ByteDance said on Wednesday that TikTok Lite’s reward-to-watch feature would be suspended.
1d ago
Engadget
PUBG will take a nostalgia-infused trip back to its first map in May
PUBG: Battlegrounds is somehow old enough to evoke nostalgia. The pioneering battle royale game will recreate the old-school battlefield from the game’s inception for a limited two-week run in May and June.
1d ago

Facebook releases its 'Blender' chatbot as an open-source project

It could help tomorrow's AI converse more naturally with people.

Latest Stories

Apple is launching new iPads May 7: Here's what to expect from the 'Let Loose' event

Spotify tests Apple's resolve with new pricing update in the EU

Tupac’s estate threatens to sue Drake for his AI-infused Kendrick Lamar diss

BlizzCon 2024 is canceled

Reddit is back online after a major outage forced everyone to touch grass

Formula E debuts Gen3 Evo race car: All-wheel drive unlocks 0-60 mph in 1.82 seconds

Our favorite budget webcam is 20 percent off right now

Threads is getting its own Hidden Words feature

FCC votes to restore net neutrality protections

The world’s biggest 3D printer can a make a house in under 80 hours

Garry’s Mod faces deluge of Nintendo-related DMCA takedown notices

The best gifts for teachers

The Morning After: Testing the Rabbit R1's AI assistant skills

Adobe's new upscaling tech uses AI to sharpen video

The best MacBook accessories for 2024

Manhattan's DA wants to know why YouTube is pushing 'ghost gun' tutorials to kids

The best ereaders for 2024

Threads has 150 million monthly users

TikTok Lite axes ‘addictive as cigarettes’ reward-to-watch feature under the EU’s watchful eye

PUBG will take a nostalgia-infused trip back to its first map in May

About

Sections

Contribute

Buying Guides