Facebook’s BlenderBot chat AI no longer has the mental capacity of a goldfish

The bot can now recall past conversations that span weeks or even months.

Senior Editor

Fri, Jul 16, 2021, 9:30 AM·4 min read

Last April, Facebook’s AI research lab (FAIR) announced and released as open source its BlenderBot social chat app. While the neophyte AI immediately proved far less prone to racist outbursts than previous attempts, BlenderBot was not without its shortcomings. For one, the system had the recollection capacity of a goldfish — any subject or data point the AI wasn’t initially trained simply didn’t exist in its online reality, as evidenced by the OG BB’s continued insistence that Tom Brady still plays for the New England Patriots. For another, due to its limited knowledge of current events, the system had a strong tendency to hallucinate knowledge, like a digital Dunning-Kruger effect. But the advancements BlenderBot 2.0 displays, which FAIR debuted on Friday, should make the AI far more sociable, knowledgeable, and capable.

While BlenderBot 1.0 could only maintain its memory for a single discussion, its successor can remember topics of conversation over the course of multiple talks that can take days, weeks or even months to complete thanks to the implementation of a long-term memory module. What’s more, the AI can actively update its knowledge base by searching the internet for the latest news and details on any subject that the user wishes to speak about.

“BlenderBot 2 queries the Bing API for search results based on a generated search query, and conditions its response on the top few results,” Kurt Shuster, Research Engineer at Facebook AI, told Engadget. “We rely on Bing to provide high quality search results.” As such, BlenderBot 2.0 is now capable of speaking coherently about breaking news and new media, not just the data it was trained upon.

“BlenderBot 2 is limited only by what a powerful search engine can provide,” Jason Weston, Research Scientist at Facebook AI, added. So for example, if you are more interested in learning about Tom Yewcic (the Patriot’s combo QB/Punter from the 1962 season) than you are about Tom Brady, BB 2.0 has you covered. It’s the same with more scholarly subjects, like photosynthesis or redox reactions, Weston continued. So long as the information is available on the web, “there is no reason BlenderBot 2 cannot discuss this.”

By actively searching the internet for information, BlenderBot 2.0 can also reduce the instances in which it hallucinates knowledge. “Providing the system with more commonsense reasoning will allow BlenderBot to make sure it does not confuse subtle concepts,” Weston explained, “such as a movie director versus a producer or a pitching coach versus a hitting coach.”

The only wrinkle really occurs when discussing non-english based media, such as Demon Slayer: Kimetsu no Yaiba. “It is reasonable to conclude Bing will surface information about it and BlenderBot 2 can use that information accordingly,” he said. “We currently focus on english-based search results, so non-english references may not be fully covered.” The system will, however, recognize that Demon Slayer is of interest to you and will be more likely to bring up manga-centric subjects in future discussions.

FAIR has taken multiple steps to ensure that BlenderBot does not become the next Tay. “BlenderBot 2 does not learn directly from user input, as Tay did,” Shuster said. “We have taken extensive safety steps to ensure that BlenderBot 2 can handle adversarial users. Specifically, we employ both baked-in and two-stage techniques. BlenderBot 2 can detect itself if the incoming context will result in an offensive response, and additional safety layers where a safety classifier can detect if either the user input or the bot's output is offensive. Each handles the response appropriately.”

And while the system is currently focused on chewing its way through the English language corpus, FAIR does see BlenderBot does eventually extend to other languages as well. “While not in our immediate plans, the goal of our team is to build a superhuman conversationalist,” Shuster said. “This kind of agent requires multilingual understanding.”

Recent internal benchmarking processes found that BlenderBot 2.0 outperformed its predecessor by 17 percent in its engagingness score and 55 percent in its use of previous conversation sessions according to human evaluators, per a Friday blog from FAIR. What’s more, BlenderBot's rate of knowledge hallucinations dropped from 9 percent (!) in BB 1.0 down to just 3 percent in the current iteration.

Looking ahead, “humans interacting with AI systems via discourse is the future of AI,” Weston asserted, “and ensuring that humans have an engaging, informative experience is critical to that future. BlenderBot 2 combines the engagingness of BlenderBot 1.0 with the knowledge capabilities of a system with access to the entire internet, so ostensibly we are on the right track.”

Engadget
The 5 best mechanical keyboards for 2024
Here's everything you need to know before buying a mechanical keyboard, plus the best mechanical keyboards you can get right now.
1h ago
Engadget
Slack rolls out its AI tools to all paying customers
Slack just rolled out its AI tools to all paying customers, after releasing it to just a subset of users. These features include thread summaries and conversational search.
1h ago
Engadget
Playdate developers have made more than $500K in Catalog sales
Panic is celebrating Playdate's second birthday this month, and the party favors include some piping-hot statistics about Catalog game sales.
1h ago
Engadget
The Morning After: Is the new Zephyrus G16 any good?
The biggest news stories of the day include Nintendo emulators on the App Store, silly AI bots and Sony's change of heart.
2h ago
Engadget
Nothing's Ear and Ear (a) earbuds with active noise cancellation are now available for pre-order
Nothing has introduced the Ear and the Ear (a) earbuds at an event in Tokyo.
2h ago
Engadget
EU criticizes Meta's 'privacy for cash' business model
The European Data Protection Board states Meta and other big players should forego "consent or pay" models.
3h ago
Engadget
The best Bluetooth speaker for 2024: 15 portable options for every price range
The Bluetooth speaker space is oversaturated at this point. We set out to find the best portable Bluetooth speakers across a number of different price ranges and uses cases. These are our favorites.
7 months ago
Engadget
Google fired 28 workers who protested Israeli government cloud contract
Google has fired 28 employees involved in protests against the company's "Project Nimbus" cloud contract with the Israeli government.
4h ago
Engadget
Supergiant shows off Hades II's gameplay and new god designs
Supergiant Games just treated Hades fans to an extensive look at the game's upcoming sequel.
6h ago
Engadget
Twitch is giving all users access to its discovery feed later this month
The feed will first appear as a new tab in the mobile app.
9h ago
Engadget
Media coalition asks the feds to investigate Google’s removal of California news links
The News/Media Alliance asked US federal agencies to investigate Google’s removal of links to California news media outlets. Google’s tactic is in response to the proposed California Journalism Preservation Act, which would require it to pay for links to California-based publishers’ news content.
16h ago
Engadget
TikTok is trying to clean up its ‘For You’ recommendations
TikTok is ramping up penalties for creators who post potentially “problematic” content and tightening its rules around what can be recommended in the app.
16h ago
Engadget
Nintendo emulator Delta hits the iOS App Store, no sideloading required
Delta (a successor to GBA4iOS) is one of the best-known Nintendo emulators on iOS. Now, you can download Delta for free from the App Store without having to sideload it.
17h ago
Engadget
Amazon says a whopping 140 third-party stores in four countries use its Just Walk Out tech
Amazon published a blog post on Wednesday providing an update about its Just Walk Out technology, which it reportedly pulled from its Fresh grocery stores earlier this month. While extolling Just Walk Out’s virtues as a sales pitch to potential retail partners, the article lists a startlingly minuscule number of businesses using the tech.
18h ago
Engadget
There’s a TV show coming based on Sega's classic arcade game Golden Axe
Comedy Central just greenlit a cartoon based on the classic Sega arcade game Golden Axe. It stars Danny Pudi, Carl Tart, Lisa Gilroy and Matthew Rhys, among others.
18h ago
Engadget
Cheaper Evercade retro consoles will arrive in July
Cheaper versions of Evercade's retro game consoles are on the way. The first three Tomb Raider games will be bundled with the EXP-R and VS-R.
19h ago
Engadget
LG's S95TR soundbar with wireless Dolby Atmos is now available for $1,500
LG's S95TR Dolby Atmos soundbar isn't cheap, but at least it includes a subwoofer and rear surround speakers in the box.
19h ago
Engadget
Apple renews For All Mankind and announces a spinoff series set in the Soviet Union
Apple has renewed For All Mankind for a fifth season. The company also announced a spinoff series that follows the Russian space program.
19h ago
Engadget
X’s AI bot is so dumb it can’t tell the difference between a bad game and vandalism
After misinterpreting user posts about Klay Thompson's poor shooting during an NBA game, X's AI bot Grok created a fictitious story on the social media platform's trending section.
20h ago
Engadget
Creepy monitoring service sells searchable Discord user data for as little as $5
A data scraper is selling information on what it claims to be 600 million Discord users. A report from 404 Media details Spy Pet, an online service that gathers, stores and sells troves of information from the social platform.
20h ago

Facebook’s BlenderBot chat AI no longer has the mental capacity of a goldfish

The bot can now recall past conversations that span weeks or even months.

Latest Stories

The 5 best mechanical keyboards for 2024

Slack rolls out its AI tools to all paying customers

Playdate developers have made more than $500K in Catalog sales

The Morning After: Is the new Zephyrus G16 any good?

Nothing's Ear and Ear (a) earbuds with active noise cancellation are now available for pre-order

EU criticizes Meta's 'privacy for cash' business model

The best Bluetooth speaker for 2024: 15 portable options for every price range

Google fired 28 workers who protested Israeli government cloud contract

Supergiant shows off Hades II's gameplay and new god designs

Twitch is giving all users access to its discovery feed later this month

Media coalition asks the feds to investigate Google’s removal of California news links

TikTok is trying to clean up its ‘For You’ recommendations

Nintendo emulator Delta hits the iOS App Store, no sideloading required

Amazon says a whopping 140 third-party stores in four countries use its Just Walk Out tech

There’s a TV show coming based on Sega's classic arcade game Golden Axe

Cheaper Evercade retro consoles will arrive in July

LG's S95TR soundbar with wireless Dolby Atmos is now available for $1,500

Apple renews For All Mankind and announces a spinoff series set in the Soviet Union

X’s AI bot is so dumb it can’t tell the difference between a bad game and vandalism

Creepy monitoring service sells searchable Discord user data for as little as $5

About

Sections

Contribute

Buying Guides