Stories filed under: "scunthorpe problem"

Content Moderation Case Study: Facebook Struggles To Correctly Moderate The Word 'Hoe' (2021)

from the language-is-a-funny-thing dept

Thu, Nov 4th 2021 03:43pm - Copia Institute

Summary: One of the many challenges with content moderation is the flexibility of language. When applying blocklists — a list of prohibited terms considered not appropriate for the platform — moderators need to consider innocuous uses of words that, when removed from their context, appear to be violations of the platform’s terms of use.

Multiple platforms have run into the phenomenon known as the “Scunthorpe problem.” In this famous case, a town whose name no one would ever mistake for offensive was deemed offensive by moderation blocklists simply because within the name of the town is the word “cunt” which many blocklists forbids.

Deploying automated blocklists can be even more challenging when dealing with specialized or niche content, which may use certain terms that are offensive outside of this specific context, but are essential to discussing and understanding the relevant subject matter. A paleontologists’ conference was derailed when the moderation blocklist made it impossible for participants to use words like “bone,” “pubic,” “stream,” and “beaver.”

Facebook has worked continuously to refine its moderation processes, but it still occasionally makes the wrong call when it comes to their blocklists. In January 2021, residents of (and visitors to) a Devon, England landmark were surprised to find their posts and comments vanishing from the site. After a little investigation, it became clear Facebook was deleting posts containing references to the landmark known as Plymouth Hoe.

In addition to being the name of a common garden tool (more on that in a moment), “hoe” also refers to a “sloping ridge shaped like an inverted foot or heel,” such as Plymouth Hoe, which is known locally as the Hoe. Users were temporarily forced to self-censor the harmless term to avoid moderation, either by adding unnecessary punctuation or dropping the “h.” It appeared Facebook’s automated processes believed these comments and posts were using a derogatory term for a romantic partner who is only in a relationship to better their own financial position.

Facebook soon apologized for the moderation error and stated it was “taking steps to rectify the error” and figure out what caused the mistaken moderation in the first place. Problem solved?

Not really.

The same problem popped up again, this time affecting a New York gardening group. WNY Gardeners, a group with more than 8,000 members, is the latest to be affected by Facebook’s “hoe” pruning. A member responded to the prompt “most loved & indispensable weeding tool” with “Push pull hoe!” Not long after that, the member was informed by Facebook that the comment violated the site’s policy on bullying and harassment.

Company Considerations:

How could blocklists and keyword searches be better utilized to detect and remove violations of site policies?
How much collateral damage from automated moderation should be considered acceptable? Is this an acceptable trade-off for lower moderation costs, which often relies on more automated moderation and fewer human moderators?
Can AI-based moderation more reliably detect actual violations (rather than innocuous uses of blocklisted terms) as the technology advances? What are the trade-offs with AI-based moderation tools as compared to simple blocklists?
What mitigation measures might be put in place to deal with a blocklist that catches words with different meanings depending on context?
Who should be in charge of reviewing a blocklist and how frequently should it be updated?

Issue Considerations:

Does prohibiting words like “hoe” make a significant dent in online harassment and abuse? Does the tech have the capability to “catch up” (or surpass) the ability of humans to route around moderation efforts?
Should more resources go to staffing human moderators in order to prevent errors and/or allow for a more robust challenge process that allows content to remain “live” until the challenge process has concluded?
What ways might automation and human reviewers be used in combination to avoid the more egregious automated blocklist mistakes?

Resolution: Once again, Facebook has apologized for not recognizing the word “hoe” in contexts where it’s appropriate to use. But after two highly-publicized incidents in less than a year — both involving the same word — Facebook has added human moderators to backstop automated calls on flagged terms like these in order to prevent unjustified removals of posts, accounts, or groups.

Originally posted to the Trust & Safety Foundation website.

Filed Under: content moderation, hoe, scunthorpe problem
Companies: facebook

21 Comments

Expand

Content Moderation Case Study: Lyft Blocks Users From Using Their Real Names To Sign Up (2019)

Content Moderation

from the scunthorpe-again? dept

Wed, Apr 14th 2021 03:33pm - Copia Institute

Summary: Users attempting to sign up for a new ride-sharing program ran into a problem from the earliest days of content moderation. The “Scunthorpe problem” dates back to 1996, when AOL refused to let residents of Scunthorpe, England register accounts with the online service. The service’s blocklist of “offensive” words picked out four of the first five letters of the town’s name and served up a blanket ban to residents.

Flash forward twenty-three years and services still aren’t much closer to solving this problem.

Users attempting to sign up for Lyft found themselves booted from the service for “violating community guidelines” simply for attempting to create accounts using their real names. Some of the users affected were Nicole Cumming, Cara Dick, Dick DeBartolo, and Candace Poon.

These users were asked to “update their names,” as though such a thing were even possible to do with a service that ties names to payment systems and internal efforts to ensure driver and passenger safety.

Decisions to be made by Lyft:

Should names triggering Community Guidelines violations be reviewed by human moderators, rather than automatically rejected?
Is the cross-verification process enough to deter pranksters and trolls from activating accounts with actually offensive names?

Questions and policy implications to consider:

Considering the identification system is backstopped by credit cards and payment services that require real names, does deploying a blocklist actually serve any useful purpose?
Given that potential users are likely to abandon a service that generates too much friction at sign up, does a blocklist like this do damage to company growth?
Does global growth create a larger problem by adding other languages and possible names that will trigger rejections of more potential users? Can this be mitigated by backstopping more automatic processes with human moderators?

Resolution: The users affected by Lyft’s blocklist were reinstated. Lyft apologized for the rejections, pointing a finger at automated moderation efforts designed to keep people from creating offensive content using nothing more than the First Name/Last Name fields.

Unfortunately, the problem still hasn’t been solved. Candace Poon — whose first attempt to sign up for Lyft was rejected — just ran into the same issue attempting to create an account for new social media platform, Clubhouse.

Originally posted to the Trust & Safety Foundation website.

Filed Under: content moderation, filtering, keywords, names, scunthorpe problem
Companies: lyft

6 Comments

Expand

Follow Techdirt

Subscribe to Our Newsletter

Essential Reading

The Techdirt Greenhouse

Read the latest posts:

Read All »

Trending Posts

Techdirt Insider Discord

The latest chatter on the Techdirt Insider Discord channel...

Older Stuff

Thursday
11:08	To Dodge A Fight With Trump, Law Firms Cut Deals. Now The Deals Are Creating A Fight With Trump. (13)
11:04	Daily Deal: The 2026 Microsoft Azure Architect & Administrator Exam Prep Bundle (0)
09:36	Florida's Stop WOKE Act Shut Down (Again) By Eleventh Circuit Appeals Court (4)
05:30	Writers Guild Of America Also Sues Paramount, Citing Looming Merger Layoff Bloodbath (2)
Wednesday
19:49	Sony Deletes A Bunch More Movies From The Accounts Of People Who 'Bought' Them (30)
14:50	A Troubling Milestone: Most Supreme Court Rulings Are Secretive Votes With Little Justification (12)
12:53	Fifth Circuit Looks Like It's Ready To Roll Back Its Decision Recognizing Due Process Rights For Migrants (20)
10:53	Rubio Wanted To Ban 'Censors' From Entering The US. A Court Says He's The One Censoring. (13)
10:48	Daily Deal: Opusonix Pro Subscription (0)
09:22	Kash Patel Continues To Draw Heat For His Exorbitant Spending Habits (7)
05:25	NYC Passes Click To Cancel Rules As Lina Khan Lives On (9)
Tuesday
20:00	RFK Jr. Cut Funding For FoodNet, Making It Harder To Figure Out Why You're Shitting Yourself Uncontrollably (9)
15:41	Paramount Falsely Threatens To Leave California After State Challenges Merger (17)
13:49	How The Spread Of Local AI Models Makes Copyright Enforcement Harder (6)
11:17	Federal Judge Nukes Trump's Self-Dealt IRS 'Settlement,' Sends Lawyers To The Bar (26)
11:12	Daily Deal: The 2026 Data Engineering Bundle featuring Databricks (0)
09:24	ICE Camera Crews Are Labeling Themselves 'Media,' Filming Anti-ICE Protesters (14)
05:25	A Dozen States Sue To Block Paramount's Shitty, Unpopular Merger (5)
Monday
20:01	Former CDC CMO: RFK Jr. Is Doing 'Irreparable Harm' (6)
15:25	The UK’s New Under-16 Social Media Ban Will Cause More Harm Than It Prevents (15)
13:05	Oregon AG Wants Pause On Paramount Merger, Hints At Federal Corruption (5)
11:13	Trump Admin Supoenas NYT Reporters Because They Dared To Criticize His Qatari Graft Plane (31)
11:08	Daily Deal: uTalk Language Education (0)
09:36	"Reckless" Ben's Videos Keep Getting More Damning. His Pro Se Lawyering Keeps Getting Worse. (11)
05:27	Musk's Starlink Socks Customers With $1500 'High Demand' Surcharge (36)
Sunday
12:00	Funniest/Most Insightful Comments Of The Week At Techdirt (3)
Saturday
12:00	This Week In Techdirt History: July 5th - 11th (0)
Friday
19:39	Xbox Lays Off 20% Of Staff, Cut Studios, Largely Impacting Acquired Devs It Promised It Wouldn't Layoff (9)
15:50	How Google And AI Nearly Made A Seasoned Reporter Spiral (15)
13:18	Ctrl-Alt-Speech: Sell Me Lies, Sell Me Sweet Meta Lies (0)

Content Moderation Case Study: Facebook Struggles To Correctly Moderate The Word 'Hoe' (2021)

from the language-is-a-funny-thing dept

Content Moderation Case Study: Lyft Blocks Users From Using Their Real Names To Sign Up (2019)

from the scunthorpe-again? dept

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Thursday

Wednesday

Tuesday

Monday

Sunday

Saturday

Friday

More

Tools & Services

Company

Contact

More

from the language-is-a-funny-thing dept

from the scunthorpe-again? dept

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Email This Story

Tools & Services

Company

Contact

More