Facebook’s BlenderBot chat AI no longer has the mental capacity of a goldfish

The bot can now recall past conversations that span weeks or even months.

Former Senior Editor

Fri, Jul 16, 2021, 9:30 AM·4 min read

Last April, Facebook’s AI research lab (FAIR) announced and released as open source its BlenderBot social chat app. While the neophyte AI immediately proved far less prone to racist outbursts than previous attempts, BlenderBot was not without its shortcomings. For one, the system had the recollection capacity of a goldfish — any subject or data point the AI wasn’t initially trained simply didn’t exist in its online reality, as evidenced by the OG BB’s continued insistence that Tom Brady still plays for the New England Patriots. For another, due to its limited knowledge of current events, the system had a strong tendency to hallucinate knowledge, like a digital Dunning-Kruger effect. But the advancements BlenderBot 2.0 displays, which FAIR debuted on Friday, should make the AI far more sociable, knowledgeable, and capable.

While BlenderBot 1.0 could only maintain its memory for a single discussion, its successor can remember topics of conversation over the course of multiple talks that can take days, weeks or even months to complete thanks to the implementation of a long-term memory module. What’s more, the AI can actively update its knowledge base by searching the internet for the latest news and details on any subject that the user wishes to speak about.

“BlenderBot 2 queries the Bing API for search results based on a generated search query, and conditions its response on the top few results,” Kurt Shuster, Research Engineer at Facebook AI, told Engadget. “We rely on Bing to provide high quality search results.” As such, BlenderBot 2.0 is now capable of speaking coherently about breaking news and new media, not just the data it was trained upon.

“BlenderBot 2 is limited only by what a powerful search engine can provide,” Jason Weston, Research Scientist at Facebook AI, added. So for example, if you are more interested in learning about Tom Yewcic (the Patriot’s combo QB/Punter from the 1962 season) than you are about Tom Brady, BB 2.0 has you covered. It’s the same with more scholarly subjects, like photosynthesis or redox reactions, Weston continued. So long as the information is available on the web, “there is no reason BlenderBot 2 cannot discuss this.”

By actively searching the internet for information, BlenderBot 2.0 can also reduce the instances in which it hallucinates knowledge. “Providing the system with more commonsense reasoning will allow BlenderBot to make sure it does not confuse subtle concepts,” Weston explained, “such as a movie director versus a producer or a pitching coach versus a hitting coach.”

The only wrinkle really occurs when discussing non-english based media, such as Demon Slayer: Kimetsu no Yaiba. “It is reasonable to conclude Bing will surface information about it and BlenderBot 2 can use that information accordingly,” he said. “We currently focus on english-based search results, so non-english references may not be fully covered.” The system will, however, recognize that Demon Slayer is of interest to you and will be more likely to bring up manga-centric subjects in future discussions.

FAIR has taken multiple steps to ensure that BlenderBot does not become the next Tay. “BlenderBot 2 does not learn directly from user input, as Tay did,” Shuster said. “We have taken extensive safety steps to ensure that BlenderBot 2 can handle adversarial users. Specifically, we employ both baked-in and two-stage techniques. BlenderBot 2 can detect itself if the incoming context will result in an offensive response, and additional safety layers where a safety classifier can detect if either the user input or the bot's output is offensive. Each handles the response appropriately.”

And while the system is currently focused on chewing its way through the English language corpus, FAIR does see BlenderBot does eventually extend to other languages as well. “While not in our immediate plans, the goal of our team is to build a superhuman conversationalist,” Shuster said. “This kind of agent requires multilingual understanding.”

Recent internal benchmarking processes found that BlenderBot 2.0 outperformed its predecessor by 17 percent in its engagingness score and 55 percent in its use of previous conversation sessions according to human evaluators, per a Friday blog from FAIR. What’s more, BlenderBot's rate of knowledge hallucinations dropped from 9 percent (!) in BB 1.0 down to just 3 percent in the current iteration.

Looking ahead, “humans interacting with AI systems via discourse is the future of AI,” Weston asserted, “and ensuring that humans have an engaging, informative experience is critical to that future. BlenderBot 2 combines the engagingness of BlenderBot 1.0 with the knowledge capabilities of a system with access to the entire internet, so ostensibly we are on the right track.”

Engadget
ISPs are fighting to raise the price of low-income broadband
Internet service providers are objected to the lower rates they need to offer lower income customers if they want to obtain government funds from a new Internet access program.
Engadget
Amazon is giving The Boys the prequel treatment
The cast and crew of Amazon's The Boys announced a bunch of new spinoffs for the supe action series.
Engadget
You can date everything in Date Everything!
Date Everything! is an upcoming dating sim game that lets you date evert
Engadget
The Bioshock movie is still happening but with a reduced budget
The Bioshock movie is still happening, but with steep budget cuts. It’s being reconfigured to become a ‘more personal’ film.
Engadget
Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package
Warner Bros. Discovery followed through on its threat to “take appropriate action” against the NBA for rejecting its broadcasting rights offer. On Friday, the media company sued the league after the NBA turned down its bid to match Amazon’s streaming package.
Engadget
Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now
Apple’s M3 MacBook Air combines Apple’s lightest and thinnest laptop design with the cutting-edge horsepower of the latest Apple silicon chip. You can get the 2024 model on sale for $200 off right now.
Engadget
Here's how to stop Grok's AI models using your tweets for training
X automatically opted users into letting Grok's AI models train on their tweets and interactions with the chatbot. Here's how to opt out.
Engadget
The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals
The week after Amazon's Prime Day can be a bit sleepy for deals, but we still found a few decent discounts on gear we've tested and recommend.
Engadget
The 65-inch LG C3 OLED TV is nearly half off for today only
The 65-inch LG C3 OLED TV is nearly half off for today only. That brings the set down to a record low of $1,300.
Engadget
NASA's Perseverance rover found a rock on Mars that could indicate ancient life
A Martian rock sample collected by Perseverance contains "chemical signatures and structures" that could've been formed by ancient microbial life from billions of years ago.
Engadget
Apple agrees to stick by Biden administration's voluntary AI safeguards
Apple has joined more than a dozen other tech companies in signing up for the Biden administration's voluntary AI code of practice.
Engadget
North Korean who used ransomware to attack US healthcare providers has been indicted
A grand jury in Kansas City has indicted Rim Jong Hyok, a North Korean intelligence operative who allegedly used ransomware to attack health providers' systems in the US.
Engadget
Samsung Galaxy Ring review: A bit basic, a bit pricey
The Galaxy Ring is comfortable and seemingly basic, but actually delivers detailed insight on your sleep, walks and runs.
Engadget
Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon
Apple's well-specked 14-inch MacBook Pro with an M3 Pro chip, 18GB of memory and 512GB of storage is on sale for the lowest price we've seen yet at Amazon.
Engadget
Gran Turismo 7's more realistic physics update is launching cars into orbit
Gran Turismo 7's latest update is causing some bizarre problems, making cars bounce violently or launch completely into the air.
Engadget
The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT
The biggest news stories this morning: AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission, The best cameras for 2024, WhatsApp hits 100 million monthly active US users.
Engadget
The best fitness trackers for 2024
Here's a list of the best fitness trackers you can buy, as chosen by Engadget editors.
Engadget
The best cameras for 2024
Here's a list of the best cameras you can buy, as chosen by Engadget editors.
Engadget
X's Grok chatbot is misleading voters about the presidential election
Grok's AI chatbot claims that President Biden's name must stay on the ballot in nine states, a claim that is categorically false.
Engadget
Comic-Con leak sparks rumors of two remastered Soul Reaver games
A photo from Comic-Con has leaked possible remasters of two Soul Reaver games from Crystal Dynamics.

Facebook’s BlenderBot chat AI no longer has the mental capacity of a goldfish

The bot can now recall past conversations that span weeks or even months.

Latest Stories

ISPs are fighting to raise the price of low-income broadband

Amazon is giving The Boys the prequel treatment

You can date everything in Date Everything!

The Bioshock movie is still happening but with a reduced budget

Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package

Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now

Here's how to stop Grok's AI models using your tweets for training

The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals

The 65-inch LG C3 OLED TV is nearly half off for today only

NASA's Perseverance rover found a rock on Mars that could indicate ancient life

Apple agrees to stick by Biden administration's voluntary AI safeguards

North Korean who used ransomware to attack US healthcare providers has been indicted

Samsung Galaxy Ring review: A bit basic, a bit pricey

Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon

Gran Turismo 7's more realistic physics update is launching cars into orbit

The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT

The best fitness trackers for 2024

The best cameras for 2024

X's Grok chatbot is misleading voters about the presidential election

Comic-Con leak sparks rumors of two remastered Soul Reaver games

About

Sections

Contribute

Buying Guides