Google’s comment-ranking system will be a hit with the alt-right

The company's API for scoring toxicity in online discussions already behaves like a racist hand dryer.

Journalist

Updated Fri, Sep 1, 2017, 3:00 PM·7 min read

Google’s comment-ranking system will be a hit with the alt-right

A recent, sprawling Wired feature outlined the results of its analysis on toxicity in online commenters across the United States. Unsurprisingly, it was like catnip for everyone who's ever heard the phrase "don't read the comments." According to "The Great Tech Panic: Trolls Across America," Vermont has the most toxic online commenters, whereas Sharpsburg, Georgia, "is the least-toxic city in the US."

There's just one problem.

The underlying API used to determine "toxicity" scores phrases like "I am a gay black woman" as 87 percent toxicity, and phrases like "I am a man" as the least toxic. The API, called Perspective, is made by Google's Alphabet within its Jigsaw incubator.

When reached for a comment, a spokesperson for Jigsaw told Engadget, "Perspective offers developers and publishers a tool to help them spot toxicity online in an effort to support better discussions." They added, "Perspective is still a work in progress, and we expect to encounter false positives as the tool's machine learning improves."

Poking around with the engine behind Wired's data revealed some ugly results, as Vermont librarian Jessamyn West discovered when she read the article and tried out Perspective to see exactly what makes a comment, or a commenter, perceived as toxic (according to Alphabet, at least).

It's strange to wonder that Wired didn't give Perspective a spin to see what made the people behind its troll map "toxic." Wondering exactly that, I decided to try out a variety of comments to see how the results compared to West's. I endeavored to represent the people I seem to see censored the most on social media, and opinions of the day.

My experience typing "I am a black trans woman with HIV" got a toxicity rank of 77 percent. "I am a black sex worker" was 89 percent toxic, while "I am a porn performer" was scored 80. When I typed "People will die if they kill Obamacare" the sentence got a 95 percent toxicity score.

The Wired article analyzed 92 million Disqus comments "over a 16-month period, written by almost 2 million authors on more than 7,000 forums." They didn't look at sites that don't use the comment-management software (so Facebook and Twitter were not included).

The piece explained:

To broadly determine what is and isn't toxic, Disqus uses the Perspective API—software from Alphabet's Jigsaw division that plugs into its system. The Perspective team had real people train the API to rate comments. The model defines a toxic comment as "a rude, disrespectful, or unreasonable comment that is likely to make you leave a discussion."

Discrimination by algorithm

In an online world where moderation, banning and censorship are largely left to automation like the Perspective API, finding out how these things are measured is critical for everyone involved. "Looking into this, the word 'toxic' is a very specific term of art for the tool, this tool Perspective that's made by this company Alphabet, who you may know as Google, that is trying to bring [artificial intelligence] into commenting," West told Vermont Public Radio.

I tested 14 sentences for "perceived toxicity" using Perspectives. Least toxic: I am a man. Most toxic: I am a gay black woman. Come on pic.twitter.com/M4TF9uYtzE
— jessamyn west (@jessamyn) August 24, 2017

Perspective presents itself as a way to improve conversations online, positing that the "threat of abuse and harassment online means that many people stop expressing themselves and give up on seeking different opinions." It's one of the many "make the world safer" Jigsaw projects.

Jigsaw worked with The New York Times and Wikipedia to develop Perspective. The NYT made its comments archive available to Jigsaw "to help develop the machine-learning algorithm running Perspective." Wikipedia contributed "160k human labeled annotations based on asking 5,000 crowd-workers to rate Wikipedia comments according to their toxicity. ... Each comment was rated by 10 crowd-workers."

A February article about Perspective elaborated on the human-trained, machine-learning process behind what wants to become the world's measuring tool for harmful comments and commenters.

"In this instance, Jigsaw had a team review hundreds of thousands of comments to identify the types of comments that might deter people from a conversation," The NYT wrote. "Based on that data, Perspective provided a score from zero to 100 on how similar the new comments are to the ones identified as toxic."

The results from West typing comments into Perspective were shockingly discriminatory. Identifying as black and/or gay was deemed toxic. She also tried it with visible and invisible disabilities, like wheelchair use and deafness, and the most toxic way to identify yourself in a conversation turned out to be saying "I am a woman who is deaf."

Trying it with some visible/invisible disabilities. The man/woman division is concerning. https://t.co/lEs9prSPhb pic.twitter.com/6zVb8v8b4O
— jessamyn west (@jessamyn) August 26, 2017

When the algorithm is taught to be racist, sexist and ableist (among other things), it leads to the silencing and censorship of entire populations. The problem is that when these systems are up and running, the people being silenced and banned disappear without a trace. Discrimination by algorithm happens in a vacuum.

We can only imagine what's underlying the automated comment-policing system at Facebook. In August Mary Canty Merrill, a psychologist who advises corporations on how to avoid racial bias, wrote a short post about defining racism on Facebook.

Reveal News wrote, "She logged in the next day to find her post removed and profile suspended for a week. A number of her older posts, which also used the "Dear white people" formulation, had been similarly erased."

Pasting her "Dear white people" into Perspective's API got a score of 61 percent toxicity.

Unless Google anti-diversity creeper James Damore was the project lead for Perspective, it's hard to imagine that the company would greenlight a product that thinks to identify as a black gay woman is toxic. (Wikipedia, on the other hand, I could imagine.)

It's possible that the tool is seeking out comments with terms like black, gay and woman as high potential for being abusive or negative, but that would make Perspective an expensive, overkill wrapper for the equivalent of using Command-F to demonize words that some people might find upsetting.

Perspective's reach is significant, too. The project is partnered with Wikipedia, The New York Times, The Economist and The Guardian. Abandon all hope, ye gay black women who enter the comments there.

What we've discovered about Perspective doesn't bode well for the future of machine-learning or AI and algorithm-driven comment measurement and moderation. Nor does it look good for accountability with companies like Google, Facebook and others that rely on automation for moderation.

I think we're all tired of Facebook telling us "it was a bug" and companies saying "it's not our fault" and pointing at systems like Perspective. Despite the fact that they're complicit by using it. And they should be trying these things out against problems like not being able to identify as a gay black woman in a comment thread without risking your ability to comment.

Imagine a system like Perspective deciding whether or not you can use business services, like Google AdSense. Take, for instance, the African-American woman who got an email Thursday from Google AdSense saying she'd violated its terms by writing a blog post about dealing with being called the n-word ... on her own website.

Distressingly, what's also being created is a culture where we can't even talk about abuse. As we can see, the implications for speech are huge -- and already we're soaking in it. Moreso when you consider that "competition" for something like Perspective is clearly already at work for social-media networks like Facebook, whose own policies around race and neo-Nazi belief systems are deeply skewed against societies who strive for equality, anti-discrimination and human rights.

It's probable that these terms are getting scored for high toxicity because they're terms used most commonly in attacks on targeted groups. But the instances mentioned in this article are clear failures. It shows that the efforts of Silicon Valley's ostensible best and brightest have steered AI meant to "improve the conversation" the way of racist soap dispensers and facial recognition software that can't see black people.

Insofar as the Wired feature is concerned, the data look flawed from where we're sitting. It may just mean that there are more gay black women and sex workers there who are OK with talking about it than Sharpsburg, Georgia, commenters. Depressingly, the "Internet Troll Map" might just be a map of black people discussing issues of race, LGBTQ identity and health care.

Which, we hope, is the opposite of what everyone intended.

Engadget
The Google Pixel Watch 2 has never been cheaper
Amazon has the Google Pixel Watch 2 on sale for $70 off. The smartwatch arrived last fall with a new stress-tracking feature, slightly longer battery life and a better heart sensor, as Google’s flagship wearable inches closer to rivals Apple and Samsung.
Engadget
Paramount+ with Showtime annual subscriptions are half off right now
Paramount+ with Showtime annual subscriptions are half off right now. A yearly subscription costs $60 instead of $120.
Engadget
FTX plans to refund defrauded customers with interest
FTX plans to repay most of its creditors in full and with interest as it attempts to escape bankruptcy.
Engadget
Pick up the 9th-gen iPad with two years of AppleCare+ for only $298
Apple's ninth-generation iPad with Applecare+ is 25 percent off the bundle's sticker price.
Engadget
US revokes Intel and Qualcomm's licenses for chip sales to Huawei
The US has stopped Intel and Qualcomm from selling and sending chips to Chinese-based Huawei Technologies.
Engadget
Remedy cancels its multiplayer game project with Tencent
Remedy's and Tencent's gaming project hit a snag last year when they had to go in a different direction. Now, that project has been cancelled completely.
Engadget
The Morning After: Apple's new iPad Pro is thinner than an old iPod nano
The biggest news stories this morning: Google announces the $499 Pixel 8a early, Apple’s M4 chip arrives with a big focus on AI, Nintendo to announce Switch successor before March 2025.
Engadget
Bluesky plans to launch DMs for users
Bluesky should launch a DMs feature in the next couple months, among other updates.
Engadget
OpenAI is reportedly working on a search feature for ChatGPT
According to Bloomberg, the company is currently developing the capability, which can scour the web for answers to your queries and spit out results complete with citations to their sources.
Engadget
The 5 best cordless vacuums for 2024
Cordless vacuums are often lighter and easier to use than standard vacuums. We tested a number of the most popular cordless vacuums today to find the ones that are worth your money.
Engadget
The 2023 Echo Show 8 is on sale for $100 right now
The third-gen, 2023 Echo Show 8 is 33 percent off, bringing it down to just $100 ($50 off), only $10 off its all-time low.
Engadget
The best tablets for 2024
You might want a tablet to be your main portable device, or to replace your laptop entirely. These are the best slabs you can get right now.
Engadget
Meta is testing cross-posting from Instagram to Threads
Meta is testing the ability to cross-post photos from Instagram to Threads.
Engadget
OpenAI partners with People publisher Dotdash Meredith
OpenAI is partnering with another publisher as it moves towards a licensed approach to training materials. Dotdash Meredith, the owner of brands like People and Better Homes & Gardens, will license its content for OpenAI to train ChatGPT.
Engadget
The best Android phones for 2024
If you're looking for a new Android phone, check out our guide to the best handsets on the market from budget to flagship and everything in between.
Engadget
Apple's 2023 iMac drops to a record-low price
Apple's 2023 iMac has dropped to its lowest price to date, but you'll need to be comfortable having just 8GB of RAM.
Engadget
iPad Pro 2024 vs. 2022: What’s changed
Apple updated its top-of-the-line tablets at its Let Loose event today. Our comparison lets you quickly glance at all the changes to the new model — big and small — to help you decide whether it’s worth the upgrade.
Engadget
Meta is expanding its paid verification service for businesses
Meta is expanding its paid verification service for businesses, adding three new tiers to the program that offers extra perks to companies willing to pay a monthly subscription.
Engadget
The Beats Fit Pro wireless earbuds are on sale for $160 right now
The Beats Fit Pro, our top pick for workout and running headphones, is 20 percent off at the minute.
Engadget
You can now buy a Pixel Tablet without a dock for $400, if that’s your bag
Google’s Pixel Tablet is now available standalone without the charging dock for $400. However, we found the charging dock to be one of the best parts of the whole experience.

Google’s comment-ranking system will be a hit with the alt-right

The company's API for scoring toxicity in online discussions already behaves like a racist hand dryer.

Discrimination by algorithm

Latest Stories

The Google Pixel Watch 2 has never been cheaper

Paramount+ with Showtime annual subscriptions are half off right now

FTX plans to refund defrauded customers with interest

Pick up the 9th-gen iPad with two years of AppleCare+ for only $298

US revokes Intel and Qualcomm's licenses for chip sales to Huawei

Remedy cancels its multiplayer game project with Tencent

The Morning After: Apple's new iPad Pro is thinner than an old iPod nano

Bluesky plans to launch DMs for users

OpenAI is reportedly working on a search feature for ChatGPT

The 5 best cordless vacuums for 2024

The 2023 Echo Show 8 is on sale for $100 right now

The best tablets for 2024

Meta is testing cross-posting from Instagram to Threads

OpenAI partners with People publisher Dotdash Meredith

The best Android phones for 2024

Apple's 2023 iMac drops to a record-low price

iPad Pro 2024 vs. 2022: What’s changed

Meta is expanding its paid verification service for businesses

The Beats Fit Pro wireless earbuds are on sale for $160 right now

You can now buy a Pixel Tablet without a dock for $400, if that’s your bag

About

Sections

Contribute

Buying Guides