Facebook deploys AI in its fight against hate speech and misinformation

The company is also launching a $100,000 Hate Meme Challenge.

Former Senior Editor

Updated Tue, May 12, 2020, 12:00 PM·4 min read

Even in the year 2020, it’s not very hard to be led astray on Facebook. Click a few misleading links and you can find yourself at the bottom of an ethnonationalist rabbit hole facing a flurry of hate speech and medical misinformation. But with the help of AI and machine learning systems, the social media platform is accelerating its efforts to keep this content from spreading.

It’s bad enough that we’re having to deal with the COVID-19 pandemic without being bombarded on Facebook with ads for sham cures and conspiracy theories passed off as the gospel truth. The company is already partnering with 60 fact checking organizations to fight this disinformation and has issued a temporary ban to halt the sale of PPE, hand sanitizers, and cleaning supplies on the platform since the start of the outbreak in March. The problem is that people can easily get around this ban by slightly altering the text or image in an ad and resubmitting it. The two images may appear nearly identical to people -- screenshots, for example -- but can trip up conventional computer vision systems because those machines are designed to look at individual pixels rather than the image as a whole. They miss the forest for the trees.

But that’s where SimSearchNet comes in. This convolutional neural net-based model is purpose built to identify nearly-identical images, which in turn helps to automate the enforcement of the checks made by human moderators. Once a human fact-checker flags an image as containing false claims about COVID-19, that information is fed back into SimSearchNet which seeks out near-duplicates so moderators can affix warning labels to those images as well. It essentially scales up the moderator’s reach, autonomously applying their decisions to the thousands (potentially millions) of digital doppelgangers that those false images spawn.

“What we want to be able to do is detect those those things as being identical, because they are — to a person — the same thing,” Mike Schroepfer, Facebook CTO, explained on a conference call Tuesday. “But we have to do this very high accuracy, because we don't want to take something that looks very similar, but is actually qualitatively different, and either put in misinformation overlay or or block it as appropriate. And so our previous systems were very accurate, but they were very fragile and brittle to even very small changes if you change a small number of pixels.”

In a blog post, the company claims to have labelled about 50 million posts related to COVID-19 during the month of April and removed “more than 2.5 million pieces of content for the sale of masks, hand sanitizers, surface disinfecting wipes and COVID-19 test kits.”

Of course, Facebook has had troll troubles for far longer than COVID-19 has had us sheltering in place. The company has long sought to rein in the prevalence of hate speech spread on its site. According to a separate blog posted on Tuesday, company reps said that per “Community Standards Enforcement Report released today, AI now proactively detects 88.8 percent of the hate speech content we remove, up from 80.2 percent the previous quarter. In the first quarter of 2020, we took action on 9.6 million pieces of content for violating our hate speech policies -- an increase of 3.9 million.”

Detecting hate speech is no easy feat. There can be layers upon layers of nuance involved -- it’s not just what is being said, but how it is being said, who it is being said to, and what sorts of accompanying content (whether that’s images, audio or video) are presented alongside it. Even more difficult is correctly identifying responses to hate speech, especially when those statements use much of the same language as the offending post. Even human moderators are routinely tripped up. In response, Facebook has spent the past few years improving its natural language processing capabilities, including XLM-R, which can translate text between roughly 100 spoken languages, and RoBERTa, a model that helps pretrain the likes of XLM-R for longer periods and using magnitudes more training data.

The company is even tackling hate memes. As mentioned previously, combining two types of content -- i.e. text and an image -- can severely hamper an AI’s attempts to ID it as hate speech. So, Facebook AI went and created an entire database of multimodal examples -- more than 10,000 professionally-created memes using licensed Getty images -- with which to train tomorrow’s AI moderators.

“The memes were selected in such a way that strictly unimodal classifiers would struggle to classify them correctly,” the Facebook AI team wrote in a Tuesday blog post. “We also designed the data set specifically to overcome common challenges in AI research, such as the lack of examples to help machines learn to avoid false positives.”

And to help spur the development of those sorts of machine learning content moderation systems, Facebook is partnering with DrivenData to launch the Hateful Meme Challenge. If you can code an AI to automatically identify and label these multimodal hate speech, you could earn yourself a cool $100,000.

Engadget
ISPs are fighting to raise the price of low-income broadband
Internet service providers are objected to the lower rates they need to offer lower income customers if they want to obtain government funds from a new Internet access program.
Engadget
Amazon is giving The Boys the prequel treatment
The cast and crew of Amazon's The Boys announced a bunch of new spinoffs for the supe action series.
Engadget
You can date everything in Date Everything!
Date Everything! is an upcoming dating sim game that lets you date evert
Engadget
The Bioshock movie is still happening but with a reduced budget
The Bioshock movie is still happening, but with steep budget cuts. It’s being reconfigured to become a ‘more personal’ film.
Engadget
Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package
Warner Bros. Discovery followed through on its threat to “take appropriate action” against the NBA for rejecting its broadcasting rights offer. On Friday, the media company sued the league after the NBA turned down its bid to match Amazon’s streaming package.
Engadget
Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now
Apple’s M3 MacBook Air combines Apple’s lightest and thinnest laptop design with the cutting-edge horsepower of the latest Apple silicon chip. You can get the 2024 model on sale for $200 off right now.
Engadget
Here's how to stop Grok's AI models using your tweets for training
X automatically opted users into letting Grok's AI models train on their tweets and interactions with the chatbot. Here's how to opt out.
Engadget
The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals
The week after Amazon's Prime Day can be a bit sleepy for deals, but we still found a few decent discounts on gear we've tested and recommend.
Engadget
The 65-inch LG C3 OLED TV is nearly half off for today only
The 65-inch LG C3 OLED TV is nearly half off for today only. That brings the set down to a record low of $1,300.
Engadget
NASA's Perseverance rover found a rock on Mars that could indicate ancient life
A Martian rock sample collected by Perseverance contains "chemical signatures and structures" that could've been formed by ancient microbial life from billions of years ago.
Engadget
Apple agrees to stick by Biden administration's voluntary AI safeguards
Apple has joined more than a dozen other tech companies in signing up for the Biden administration's voluntary AI code of practice.
Engadget
North Korean who used ransomware to attack US healthcare providers has been indicted
A grand jury in Kansas City has indicted Rim Jong Hyok, a North Korean intelligence operative who allegedly used ransomware to attack health providers' systems in the US.
Engadget
Samsung Galaxy Ring review: A bit basic, a bit pricey
The Galaxy Ring is comfortable and seemingly basic, but actually delivers detailed insight on your sleep, walks and runs.
Engadget
Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon
Apple's well-specked 14-inch MacBook Pro with an M3 Pro chip, 18GB of memory and 512GB of storage is on sale for the lowest price we've seen yet at Amazon.
Engadget
Gran Turismo 7's more realistic physics update is launching cars into orbit
Gran Turismo 7's latest update is causing some bizarre problems, making cars bounce violently or launch completely into the air.
Engadget
The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT
The biggest news stories this morning: AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission, The best cameras for 2024, WhatsApp hits 100 million monthly active US users.
Engadget
The best fitness trackers for 2024
Here's a list of the best fitness trackers you can buy, as chosen by Engadget editors.
Engadget
The best cameras for 2024
Here's a list of the best cameras you can buy, as chosen by Engadget editors.
Engadget
X's Grok chatbot is misleading voters about the presidential election
Grok's AI chatbot claims that President Biden's name must stay on the ballot in nine states, a claim that is categorically false.
Engadget
Comic-Con leak sparks rumors of two remastered Soul Reaver games
A photo from Comic-Con has leaked possible remasters of two Soul Reaver games from Crystal Dynamics.

Facebook deploys AI in its fight against hate speech and misinformation

The company is also launching a $100,000 Hate Meme Challenge.

Latest Stories

ISPs are fighting to raise the price of low-income broadband

Amazon is giving The Boys the prequel treatment

You can date everything in Date Everything!

The Bioshock movie is still happening but with a reduced budget

Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package

Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now

Here's how to stop Grok's AI models using your tweets for training

The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals

The 65-inch LG C3 OLED TV is nearly half off for today only

NASA's Perseverance rover found a rock on Mars that could indicate ancient life

Apple agrees to stick by Biden administration's voluntary AI safeguards

North Korean who used ransomware to attack US healthcare providers has been indicted

Samsung Galaxy Ring review: A bit basic, a bit pricey

Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon

Gran Turismo 7's more realistic physics update is launching cars into orbit

The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT

The best fitness trackers for 2024

The best cameras for 2024

X's Grok chatbot is misleading voters about the presidential election

Comic-Con leak sparks rumors of two remastered Soul Reaver games

About

Sections

Contribute

Buying Guides