moderation

Latest

Meta reportedly won't make its AI advertising tools available to political marketers
by
Andrew Tarantola
11.06.2023
Reuters reports that Meta will specifically not make its AI advertising tools available to political marketers ahead of the presidential election cycle.
Xbox’s reporting system for abusive-voice chat arrives this week
by
Will Shanklin
09.05.2023
Microsoft is set to launch its abusive voice reporting feature for Xbox consoles. Announced in July, it lets gamers submit inappropriate remarks heard while playing multiplayer titles. The system captures a 60-second clip saved to the console; you then have 24 hours to complete the report. The feature arrives this week in the September update for Xbox Series X/S and Xbox One. However, it’s initially limited to the “select English-language markets” of the US, UK, Canada, Ireland, Australia and New Zealand.
ChatGPT is easily exploited for political messaging despite OpenAI's policies
by
Andrew Tarantola
08.28.2023
ChatGPT was supposed to be inoculated against political misinformation in March. Doesn't look like that actually happened.
Twitch streamers can soon block banned accounts from tuning in
by
Will Shanklin
08.18.2023
Twitch announced this week that an upcoming change will allow streamers to block banned users from tuning into their streams. “You can choose to have your banned chatters no longer be able to watch the stream,” Senior Product Manager Trevor Fisher revealed on Twitch’s Patch Notes podcast (via TechCrunch), stressing that the feature won’t be enabled by default. The new blocking feature will roll out in the next few weeks.
Anthropic explains how Claude's AI constitution protects it against adversarial inputs
by
Andrew Tarantola
05.09.2023
Anthropic explains how binding its Claude AI to a set of guiding principles will lead to better outputs and prevent racist meltdowns.
Bipartisan bill would require that social networks have 'clear' content policies
by
Jon Fingas
02.16.2023
Senators have introduced a bill that would make social networks provide clearer content moderation policies.
Google is making free anti-terrorism moderation tools for smaller websites
by
Jon Fingas
01.03.2023
Google is helping smaller websites fight terrorist content by making a free moderation tool.
Twitch halts paid stream boosts after viewers abuse them to push porn
by
Amrita Khalid
04.01.2022
The new program lets viewers boost their favorite streams when they purchase subscriptions and bits.
GGWP is an AI system that tracks and fights in-game toxicity
by
Jessica Conditt
03.21.2022
AAA studios won’t fix the problem, so esports pro Dennis Fong is stepping up.
Instagram Live creators can now bring in moderators to handle trolls
by
Kris Holt
03.11.2022
The feature could make broadcasts a more positive experience for almost everyone.
Twitter reportedly knew Spaces could be misused due to a lack of moderation
by
Kris Holt
12.10.2021
Company executives forged ahead with the audio chat feature despite warnings, according to 'The Washington Post.'
Twitch offers slightly more information about suspensions
by
Daniel Cooper
08.10.2021
But won't just come out and say why.
Senate Republicans vote to subpoena Facebook and Twitter CEOs
by
Nathan Ingraham
10.22.2020
Senate Republicans voted to subpoena Facebook and Twitter CEOs to testify about their alleged blocking of a controversial New York Post article.
Sony tries to clear up confusion over voice chat recording on PS5
by
Richard Lawler
10.16.2020
There is no opt-out of this — if you use voice chat on PS5, someone in your chat can clip your audio and submit it for review by the moderation team.
Reddit has banned nearly 7,000 hateful subreddits since June 29th
by
Christine Fisher
08.20.2020
Reddit has banned nearly 7,000 hateful subreddits since its new content policies went into effect.
Pinterest's moderation doesn't catch some abusive and false material
by
Jon Fingas
07.12.2020
Pinterest's emphasis on hiding harmful content over removing it has left some abusive and false material visible.
Twitch forms review board to strengthen moderation policies
by
Jessica Conditt
05.14.2020
Twitch’s reputation needs a spit-shine. The Safety Advisory Council is an attempt to do just that.
Facebook will pay content moderators $52 million in PTSD settlement
by
Kris Holt
05.12.2020
Each of the 11,250 plaintiffs will receive at least $1,000.
YouTube will temporarily increase automated content moderation
by
Christine Fisher
03.16.2020
YouTube will rely more on machine learning and less on human reviewers during the coronavirus outbreak. Normally, algorithms detect potentially harmful content and send it to human reviewers for assessment. But these are not normal times, and in an effort to reduce the need for employees and contractors to come into an office, YouTube will allow its automated system to remove some content without human review.
TikTok will stop using China-based moderators to screen foreign content
by
Jon Fingas
03.15.2020
TikTok has already taken steps to reassure the world that the Chinese government doesn't control its app overseas, including the use of non-Chinese moderators for the US and plans for a transparency center. However, it's taking things one step further. The social media company said it will stop using China-based moderators to screen content in any other country, and that more than 100 moderators will have to either find other jobs inside parent company Bytedance or leave. Teams local to given areas should take over within a few weeks, TikTok said.