What did Twitter’s ‘open source’ algorithm actually reveal? Not a lot.

Some of the most important questions about Twitter's algorithm remain unanswered

Senior Editor

Thu, Apr 6, 2023, 3:46 PM·4 min read

When Elon Musk first proposed taking over Twitter, one the first changes he claimed he'd make would be “open-sourcing” Twitter’s algorithm. Last week, Twitter finally followed through on that promise, publishing the underlying code for the site’s "For You" recommendations on GitHub.

Quickly, Twitter sleuths began sifting through the code to see what they could dig up. It didn’t take long for one eyebrow-raising finding: that Musk’s tweets have their own category (along with Democrats, Republicans and “power users”). Twitter engineers hastily explained that this was for “stat tracking purposes,” which has since been confirmed by other analyses. And though Twitter removed that section of code from GitHub within hours of its publishing, it’s still fueled speculation that Twitter’s engineers pay special attention to their boss’ engagement and have taken steps to artificially boost his tweets.

But there have been few other major revelations about the contents of the code or how Twitter’s algorithm works since. And anyone hoping this public code would produce new insights into the inner workings of Twitter will likely be disappointed. That’s because the code Twitter released omitted important details about how “the algorithm” actually works, according to engineers who have studied it.

The code Twitter shared was a “highly redacted” version of Twitter’s algorithm, according to Sol Messing, associate professor at NYU’s Center for Social Media and Politics and former Twitter employee. For one, it didn't include every system that plays a role in Twitter’s recommendations.

Twitter said it was withholding code dealing with ads, as well as trust and safety systems in an effort to prevent bad actors from gaming it. The company also opted to withhold the underlying models used to train its algorithm, explaining in a blog post last week that this was to “to ensure that user safety and privacy would be protected.” That decision is even more consequential, according to Messing. “The model that drives the most important part of the algorithm has not been open-sourced,” he tells me. “So the most important part of the algorithm is still inscrutable.”

Musk’s original motivation to make the algorithm open source seemed to stem from his belief that Twitter had used the algorithm to suppress free speech. “One of the things that I believe Twitter should do is open source the algorithm and make any changes to people's tweets — if they're emphasized or de-emphasized —that action should be made apparent,” Musk said last April in an appearance at TED shortly after he confirmed his takeover bid. “So anyone can see that action has been taken, so there's no sort of behind-the-scenes manipulation, either algorithmically or manually.”

But none of the code Twitter released tells us much about potential bias or the kind of “behind-the-scenes manipulation” Musk said he wanted to reveal. “It has the flavor of transparency,” Messing says. “But it doesn’t really give insight into what the algorithm is doing. It doesn't really give insight into why someone's tweets may be down-ranked and why others might be up-ranked.”

Messing also points out that Twitter’s recent API changes have essentially cut off the vast majority of researchers from accessing a meaningful amount of Twitter data. Without proper API access, researchers are unable to conduct their own audits, which would be able to provide new details about how the algorithm works. “So at the same time Twitter is releasing this code, it’s made it incredibly difficult for research to audit this code,” he wrote in his own analysis.

Alex Hanna, director of research at the Distributed AI Research Institute (DAIR) also raised the importance of audits when we talked last year, shortly after Musk first discussed plans to “open source” Twitter’s algorithm. Like Messing, she was skeptical that simply releasing code on GitHub would meaningfully increase transparency into how Twitter works.

"If you're actually interested in public oversight on something like a Twitter algorithm, then you would actually need multiple methods for oversight to happen” Hanna said.

There is one aspect of Twitter’s algorithm that the GitHub code does shed some new light on, though. Messing points to a file unearthed by data scientist Jeff Allen, which reveals a kind of “formula” for how different types of engagement are given priority by the algorithm. “If we take that at face value, a fav (twitter like) is worth half a retweet,” Messing writes. “A reply is worth 27 retweets, and a reply with a response from a tweet’s author is worth a whopping 75 retweets.”

While that’s somewhat revealing, it’s, once again, an incomplete picture of what’s actually happening. “It doesn't mean that much without the actual data,” Messing says. “And Musk just made data so insanely expensive for academics to get. If they want to actually study this now, you basically have to get a giant, massive grants — half a million dollars a year — to get a meaningful amount of data to study what's happening.”

Engadget
ISPs are fighting to raise the price of low-income broadband
Internet service providers are objected to the lower rates they need to offer lower income customers if they want to obtain government funds from a new Internet access program.
Engadget
Amazon is giving The Boys the prequel treatment
The cast and crew of Amazon's The Boys announced a bunch of new spinoffs for the supe action series.
Engadget
You can date everything in Date Everything!
Date Everything! is an upcoming dating sim game that lets you date evert
Engadget
The Bioshock movie is still happening but with a reduced budget
The Bioshock movie is still happening, but with steep budget cuts. It’s being reconfigured to become a ‘more personal’ film.
Engadget
Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package
Warner Bros. Discovery followed through on its threat to “take appropriate action” against the NBA for rejecting its broadcasting rights offer. On Friday, the media company sued the league after the NBA turned down its bid to match Amazon’s streaming package.
Engadget
Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now
Apple’s M3 MacBook Air combines Apple’s lightest and thinnest laptop design with the cutting-edge horsepower of the latest Apple silicon chip. You can get the 2024 model on sale for $200 off right now.
Engadget
Here's how to stop Grok's AI models using your tweets for training
X automatically opted users into letting Grok's AI models train on their tweets and interactions with the chatbot. Here's how to opt out.
Engadget
The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals
The week after Amazon's Prime Day can be a bit sleepy for deals, but we still found a few decent discounts on gear we've tested and recommend.
Engadget
The 65-inch LG C3 OLED TV is nearly half off for today only
The 65-inch LG C3 OLED TV is nearly half off for today only. That brings the set down to a record low of $1,300.
Engadget
NASA's Perseverance rover found a rock on Mars that could indicate ancient life
A Martian rock sample collected by Perseverance contains "chemical signatures and structures" that could've been formed by ancient microbial life from billions of years ago.
Engadget
Apple agrees to stick by Biden administration's voluntary AI safeguards
Apple has joined more than a dozen other tech companies in signing up for the Biden administration's voluntary AI code of practice.
Engadget
North Korean who used ransomware to attack US healthcare providers has been indicted
A grand jury in Kansas City has indicted Rim Jong Hyok, a North Korean intelligence operative who allegedly used ransomware to attack health providers' systems in the US.
Engadget
Samsung Galaxy Ring review: A bit basic, a bit pricey
The Galaxy Ring is comfortable and seemingly basic, but actually delivers detailed insight on your sleep, walks and runs.
Engadget
Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon
Apple's well-specked 14-inch MacBook Pro with an M3 Pro chip, 18GB of memory and 512GB of storage is on sale for the lowest price we've seen yet at Amazon.
Engadget
Gran Turismo 7's more realistic physics update is launching cars into orbit
Gran Turismo 7's latest update is causing some bizarre problems, making cars bounce violently or launch completely into the air.
Engadget
The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT
The biggest news stories this morning: AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission, The best cameras for 2024, WhatsApp hits 100 million monthly active US users.
Engadget
The best fitness trackers for 2024
Here's a list of the best fitness trackers you can buy, as chosen by Engadget editors.
Engadget
The best cameras for 2024
Here's a list of the best cameras you can buy, as chosen by Engadget editors.
Engadget
X's Grok chatbot is misleading voters about the presidential election
Grok's AI chatbot claims that President Biden's name must stay on the ballot in nine states, a claim that is categorically false.
Engadget
Comic-Con leak sparks rumors of two remastered Soul Reaver games
A photo from Comic-Con has leaked possible remasters of two Soul Reaver games from Crystal Dynamics.

What did Twitter’s ‘open source’ algorithm actually reveal? Not a lot.

Some of the most important questions about Twitter's algorithm remain unanswered

Latest Stories

ISPs are fighting to raise the price of low-income broadband

Amazon is giving The Boys the prequel treatment

You can date everything in Date Everything!

The Bioshock movie is still happening but with a reduced budget

Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package

Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now

Here's how to stop Grok's AI models using your tweets for training

The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals

The 65-inch LG C3 OLED TV is nearly half off for today only

NASA's Perseverance rover found a rock on Mars that could indicate ancient life

Apple agrees to stick by Biden administration's voluntary AI safeguards

North Korean who used ransomware to attack US healthcare providers has been indicted

Samsung Galaxy Ring review: A bit basic, a bit pricey

Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon

Gran Turismo 7's more realistic physics update is launching cars into orbit

The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT

The best fitness trackers for 2024

The best cameras for 2024

X's Grok chatbot is misleading voters about the presidential election

Comic-Con leak sparks rumors of two remastered Soul Reaver games

About

Sections

Contribute

Buying Guides