Samuel Abram (profile)

September 25, 2020 at 3:47 pm

"flesh color"

The thing is, not all flesh color is pasty white; there is also flesh color that is darker in complexion, and unfortunately, many places in Silicon Valley fail to take that into account. The reason why it’s relevant to this article is that darker skin tones don’t always get tested and lighter skin tones get over-tested, so you have that disparity and whereas over-testing results in many false positives, false negatives can permeate when it comes to people with darker skin tones because the lack of diversity in skin color in Silicon Valley’s firms mean that people with more melanin don’t get true positives, let alone false ones.

Anonymous Coward

September 25, 2020 at 4:05 pm

My question is: If they are using "AI", why does it never seem to get any better at its job?

I’ve watched people put video and images through Google’s API for their cloud machine learning wotsit, and it is frequently incredibly awful.

(I don’t know where the interface lives, or if it still does, (or one needs an account) but this
https://cloud.google.com/vision/docs/how-to and in particular the Safe Search bit. )

Anonymous Coward

September 25, 2020 at 8:25 pm

Re: Re:

Because "AI" isn’t some kind of magical label that you can slap onto something to make it better. As of now, anything labeled "AI" is either (1) one of several variations on optimization algorithms which are not explicitly coded or (2) labeled incorrectly for marketing reasons.

Assuming that it’s (1), it still runs into the same problems as any other optimization algorithm… access to additional training data does not guarantee improvement, and is routinely either wholly or partially detrimental.

This is exacerbated in many cases due to the sparsity of labeled inputs relative to all potential inputs, that labeled inputs are not representative (it’s often not even possible to define what such representative sets would look like with our current understanding of the field), and that significant portions of those labeled inputs are internally inconsistent due to disagreement among human moderators, changes in strategy over time, accidental separation from context, etc.

You will find in particular that nobody has yet found a way to generally recognize the contents of an image (though some progress has been made on recognizing specific types of images eg human faces, landmarks). What algorithms there are simply don’t "see" images in any way similar to how humans see images; most image algorithms still struggle to reliably ignore compression artifacts in otherwise identical images… something that many humans don’t even notice.

Anonymous Coward

September 26, 2020 at 5:55 pm

Re: Re: Re:

Labels: Yeah, that’s why "AI" is in quotes. AI is a field of study, not a product, and certainly not even a working model anywhere. Machine learning is central to AI, and still super loosey-goosey as to whether any of that works in any of the domains where people apparently really really want to use it. (Remember expert systems? I guess that term is like time-sharing is to the cloud.)

Anyway, my starting assumption is that AI and machine learning are indeed not what they are portrayed to be. I suppose i should have directly asked "if they are so bad at rating, classification, or identification, and do not seem to improve over years of input*, why are they being used at all?" (OK, the answer to that is the dosomething pressure and the fact that people like releasing pre-alpha crap into production – the good-enough philosophy. I should not have bothered writing anything i suppose.)

*Input: If there isn’t manual review more often, the negative feedback is never input to the system…

PaulT (profile)

September 26, 2020 at 12:26 am

Re: Re:

"My question is: If they are using "AI", why does it never seem to get any better at its job?"

It does. It’s just that identifying subjective content will never be perfect, and the best it can ever do is the same as a human being. Who will never be perfect at such a task, given that you can sit 100 human beings in a room and they will never agree on a subject. I’ll guarantee that if you did so, one of the 100 people would have flagged the above image.

The advantages of AI in this setting is speed and volume of processing. If you want accuracy surrounding subjective material, you want magic.

Anonymous Coward

September 26, 2020 at 5:41 pm

Re: Re: Re:

Well sure, no one will ever agree on subjective matters, but i do not see any improvement in the putative machine learning for identifying even nudity or "raciness". Fine, it will of course be based on someone’s operant definition of "racy" or whatever, but lol a black and white photo of a cartoonish costume head (among endless other things)? No, no improvements there.

Anonymous Coward

September 26, 2020 at 4:42 am

It might be easier to understand moderation if you think about something that is universally despised. Like politics, but less complicated. Say … SPAM.

Everyone knows what spam is. Everyone agrees that in a perfectly just world spammers would be slow-cooked while their skin was being removed by an acid mist–before their bones were ground up to make latrine bricks.

Gmail does an extremely good job of filtering out spam. And yet, and yet–who hasn’t (very occasionally) seen important email show up in their spam folder? And the spammers are still operating, so apparently enough spam is getting through the filters to make the abhorrent habit profitable.

How do you react?

Well, if you’re an insane egocentric idiot, you immediately go across the web, posting that Google has it in for your bank, or nonprofit org, or second-cousin-once-removed, because THEIR email was deprecate, whereas some other parties’ email did not get filtered. You get your congresscritter (whichever side of the aisle they lair and liar in) to fulminate and spray froth all over the Sacred Halls of Our Republic. And you wrap yourself in a putrescent cloak of victimhood.

If you are sane, or less stupid than yeast, or you have any consideration at all for the difficulties other people are having in their quest to make your online experience less painful, then you try a different approach. In fact, even if you’re a viscious spammer, you take the different approach!

You look carefully at the deprecated email, looking for words or word-parts that could appear (to a stupid computer, not that there is any other kind) to be commercial/promotional. You look at the email address and sending server and linked-to sites to see if they show patterns that are commonly associated with spam. You remove your second cousin, bank, and charitable org from the blacklist and add them to the whitelist. And, if you’re a spammer, you try to recraft your spam so as not to LOOK like spam to the stupid computer. YOU DO NOT TAKE IT PERSONALLY, BECAUSE THE STUPID COMPUTER IS NOT A PERSON; IT IS CONTROLLED BY AN ARITHMETIC EXPRESSION, NOT A SOUL.

And if life is still sometimes complicated, frustrating, and inexplicable–welcome to the human condition.

Samuel Abram (profile)

September 26, 2020 at 6:56 am

Re: Re:

YOU DO NOT TAKE IT PERSONALLY, BECAUSE THE STUPID COMPUTER IS NOT A PERSON; IT IS CONTROLLED BY AN ARITHMETIC EXPRESSION, NOT A SOUL.

It reminds me of what I asked my father when I was young:

Me: "Daddy, are computers perfect?"
My father: "Computers aren’t perfect, but they expect us to be perfect."

If anything, computers are only as good as what we make out of them.

PaulT (profile)

September 26, 2020 at 8:43 am

Re: Re: Re:

"My father: "Computers aren’t perfect, but they expect us to be perfect.""

The better way of explaining this is the old adage GIGO – Garbage In, Garbage Out. Computers do perfectly do what they’re told to do. But, a human operator needs to tell them what to do, and they are far from perfect. If someone gives them a bad instruction, be that the coder who created the programs they run, or a user not using the program correctly, they will perfectly follow the bad instruction.

Rocky

September 26, 2020 at 8:59 am

Re: Re: Re: Re:

"Stupid computer! It doesn’t do what I want, only what I tell it to do!!"

Anonymous Coward

September 26, 2020 at 12:14 pm

Re: Re: Re:

The best thing about computers is that they do exactly what you tell them to do. You can’t say that about a lot of people. The worst thing about computers is that they do exactly what you tell them to do. No matter how stupid that is.

SpaceLifeForm

September 26, 2020 at 12:36 pm

The Paradox of Insanity

Einstein:

The definition of insanity is doing the same thing over and over and expecting different results.

Programmer:

I keep running the same program over and over and I keep getting different results!

Tin-Foil-Hat

September 26, 2020 at 2:03 pm

There should be some rules

Youtube is notorious. The encourage users to create content and when they quit their day job to create content YouTube demonetizes them for some unknown reason. It’s difficult to be reinstated too.

There really should be some obligation. They want business owners to use the service but when the business’ communication is shut down the platform is 100% void of responsibility even though the business is harmed.

Youtube, Twitter and Facebook should be lower priority methods of communication if you care about consistency and reliability.

Stephen T. Stone (profile)

September 26, 2020 at 2:58 pm

Re:

They want business owners to use the service but when the business’ communication is shut down the platform is 100% void of responsibility even though the business is harmed.

Why should we hold YouTube responsible for the decision of a third party to rely on one service so heavily that getting the boot from said service can fuck up their entire business model?

Samuel Abram (profile)

September 26, 2020 at 4:11 pm

Re: There should be some rules

Had you said this on Twitter, I would screencap this comment (or tweet) and send it to the @badsec230takes account.

But since it’s on TechDirt, I’ll just show you this.

PaulT (profile)

September 27, 2020 at 11:30 pm

Re: There should be some rules

"The encourage users to create content and when they quit their day job to create content YouTube demonetizes them for some unknown reason."

Perhaps the problem isn’t YouTube, but the idiot who decided to base his entire business on a single supplier, then violated the T&Cs of that supplier’s contract?

"They want business owners to use the service but when the business’ communication is shut down the platform is 100% void of responsibility even though the business is harmed."

There’s not zero recourse. But, the user is not their customer, and if the user decides to violate YouTube’s policies in a way that puts off their paying customers (i.e. advertisers), YouTube do not have an obligation to throw free money at people who are losing it customers.

"Youtube, Twitter and Facebook should be lower priority methods of communication if you care about consistency and reliability."

Perhaps true. So why, in your example, is the user who decided to base their entire business on an unreliable and inconsistent platform not responsible for that decision?

BernardoVerda (profile)

September 26, 2020 at 4:20 pm

Twitter is "porn-friendly"?

That’s… odd.
I personally see very little, if any, porn on Twitter.
But I have actually turned off the so-called "sensitive content" filter — simply because it was blocking so much stuff, that wasn’t porn, and wasn’t sensitive (and wasn’t even NSFW).

Anonymous Coward

September 26, 2020 at 6:04 pm

Re: Re:

I would expect that one would have to know where to look, and once you found a few instances, Twitter would then recommend them to you in the future.

Anonymous Coward

September 28, 2020 at 8:22 am

Banned from FB for violating unknowable "community standards"

4 times, banned, for posting images that violated "community standards". Several of the images were also posted by political groups who weren’t banned. One was "Don’t wash your MAGA hat with your klan outfit". I cannot post any image which suggests the GOP are similar to Nazis. No swastikas, etc. Yet I’m banned now for 30 days. Nice, right?

Thursday
15:43	Content Moderation Case Study: Facebook Struggles To Correctly Moderate The Word 'Hoe' (2021) (21)
Wednesday
15:32	Content Moderation Case Study: Linkedin Blocks Access To Journalist Profiles In China (2021) (1)
Wednesday
16:12	Content Moderation Case Studies: Snapchat Disables GIPHY Integration After Racist 'Sticker' Is Discovered (2018) (11)
Thursday
15:30	Content Moderation Case Study: Tumblr's Approach To Adult Content (2013) (5)
Wednesday
15:41	Content Moderation Case Study: Twitter's Self-Deleting Tweets Feature Creates New Moderation Problems (2)
Wednesday
15:47	Content Moderation Case Studies: Coca Cola Realizes Custom Bottle Labels Involve Moderation Issues (2021) (14)
Wednesday
15:28	Content Moderation Case Study: Bing Search Results Erases Images Of 'Tank Man' On Anniversary Of Tiananmen Square Crackdown (2021) (33)
Wednesday
15:32	Content Moderation Case Study: Twitter Removes 'Verified' Badge In Response To Policy Violations (2017) (8)
Wednesday
15:36	Content Moderation Case Study: Spam "Hacks" in Among Us (2020) (4)
Wednesday
15:37	Content Moderation Case Study: YouTube Deals With Disturbing Content Disguised As Videos For Kids (2017) (11)
Thursday
15:48	Content Moderation Case Study: Twitter Temporarily Locks Account Of Indian Technology Minister For Copyright Violations (2021) (8)
Wednesday
15:45	Content Moderation Case Study: Spotify Comes Under Fire For Hosting Joe Rogan's Podcast (2020) (64)
Wednesday
15:48	Content Moderation Case Study: Twitter Experiences Problems Moderating Audio Tweets (2020) (6)
Thursday
15:48	Content Moderation Case Study: Dealing With 'Cheap Fake' Modified Political Videos (2020) (9)
Thursday
15:35	Content Moderation Case Study: Facebook Removes Image Of Two Men Kissing (2011) (13)
Thursday
15:23	Content Moderation Case Study: Instagram Takes Down Instagram Account Of Book About Instagram (2020) (90)
Wednesday
15:49	Content Moderation Case Study: YouTube Relocates Video Accused Of Inflated Views (2014) (2)
Wednesday
15:34	Content Moderation Case Study: Pretty Much Every Platform Overreacts To Content Removal Stimuli (2015) (23)
Friday
16:03	Content Moderation Case Study: Roblox Tries To Deal With Adult Content On A Platform Used By Many Kids (2020) (0)
Wednesday
15:43	Content Moderation Case Study: Twitter Suspends Users Who Tweet The Word 'Memphis' (2021) (10)
Friday
15:35	Content Moderation Case Study: Time Warner Cable Doesn't Want Anyone To See Critical Parody (2013) (14)
Wednesday
15:38	Content Moderation Case Studies: Twitter Clarifies Hacked Material Policy After Hunter Biden Controversy (2020) (9)
Friday
15:42	Content Moderation Case Study: Kik Tries To Get Abuse Under Control (2017) (1)
Wednesday
15:31	Content Moderation Case Study: Newsletter Platform Substack Lets Users Make Most Of The Moderation Calls (2020) (8)
Friday
15:40	Content Moderation Case Study: Knitting Community Ravelry Bans All Talk Supporting President Trump (2019) (29)
Wednesday
15:50	Content Moderation Case Study: YouTube's New Policy On Nazi Content Results In Removal Of Historical And Education Videos (2019) (5)
Friday
15:36	Content Moderation Case Study: Google Removes Popular App That Removed Chinese Apps From Users' Phones (2020) (28)
Wednesday
15:42	Content Moderation Case Studies: How To Moderate World Leaders Justifying Violence (2020) (5)
Wednesday
15:47	Content Moderation Case Study: Apple Blocks WordPress Updates In Dispute Over Non-Existent In-app Purchase (2020) (18)
Friday
15:47	Content Moderation Case Study: Google Refuses To Honor Questionable Requests For Removal Of 'Defamatory' Content (2019) (25)

Content Moderation Case Study: Twitter's Algorithm Misidentifies Harmless Tweet As 'Sensitive Content' (April 2018)

from the content-moderation-isn't-easy dept

Comments on “Content Moderation Case Study: Twitter's Algorithm Misidentifies Harmless Tweet As 'Sensitive Content' (April 2018)”

Leave a Reply to Stephen T. Stone Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Email This Story

Tools & Services

Company

Contact

More