China's Precision Censorship Machine Allows Some Controversial Keywords, But Blocks Combinations Of Them

from the politically-problematic-images-also-a-no-no dept

Mon, Apr 17th 2017 05:50pm - Glyn Moody

China’s censorship of the Internet is both impressively thorough, and yet surprisingly subtle at times. For example, we’ve already written about ways in which the boundary between censored and non-censored is often vague, which paradoxically encourages people to be even more cautious than they would be with well-defined limits. But hidden among all the uncertainty, are there perhaps some fixed rules about when posts will definitely get censored?

A team of researchers at the University of Toronto’s Citizen Lab decided to find out by investigating one of the topics considered most controversial by the Chinese authorities, the so-called “709 Crackdown.” This refers to a major government clampdown that began on July 9 in 2015, when more than 250 Chinese rights lawyers, law firm staff, activists, and their relatives were detained by public security agents across China. Internet users are understandably keen to discuss this important event, and many of those conversations take place on the main blog site in China, Weibo, and using the messaging service WeChat, which is even more popular. But as the researchers discovered, those online conversations were subject to subtle but consistent interference:

as our experiments show, a good portion of that discussion fails to reach Chinese users of WeChat and Weibo. Our research shows that certain combinations of keywords, when sent together in a text message, are censored. When sent alone, they are not. So, for example, if one were to text Mainland China or Wang Quanzhang’s Wife or Harassment on Relatives [all written in Chinese characters] individually, the messages would get through. Sent together, however, the message would be censored.

Moreover, for the first time the researchers discovered censorship not just of text, but of images too:

In addition to a large number of censored keyword combinations our tests unearthed, we also discovered 58 images related to the 709 Crackdown that were censored on WeChat Moments for accounts registered with a mainland China phone number. (For accounts registered with a non-mainland China phone number, on the other hand, the images and keyword combinations go through fine).

Neither of these observations is earth-shattering in itself, but they do add usefully to our knowledge of the intricate clockwork of China’s mighty censorship machine.

Follow me @glynmoody on Twitter or identi.ca, and +glynmoody on Google+

Comments on “China's Precision Censorship Machine Allows Some Controversial Keywords, But Blocks Combinations Of Them”

Anonymous Coward

April 18, 2017 at 10:56 am

“Microsoft notes FISA orders are on the rise. Of course, its reporting is limited to useless “bands,” so the only thing that can definitely be determined is Microsoft’s FISA interactions have at least doubled.”

From a different post about a different country and a different thing but why censor when you can just collect it all?

CypherDragon (profile)

April 18, 2017 at 7:52 pm

Data classification

Sounds like they are using some variant of a data leakage protection (DLP) product for the censoring. One of the key features with most DLP products is that you can set thresholds for what triggers the rule. Eg, I want to block anything with the words “TechDirt” “Censorship” “Moody” and “China” but only if it has all 4 of those words in it. Simple to do with a DLP policy. Alternately, I could have a list of keywords, and have it trigger the policy once it hits a certain count.

These systems are fairly robust, but they aren’t without their flaws. Also, the system will only be as good as the policy makers can target their policies.

Anonymous Coward

April 19, 2017 at 12:22 am

A system that blocks specific combinations of keywords… smells like My_Name_Here’s involvement.

Anonymous Coward

April 23, 2017 at 4:56 pm

Trump is thinking: “Why can’t we do this?”

Add Your Comment

Sunday
12:00	Funniest/Most Insightful Comments Of The Week At Techdirt (0)
Saturday
12:00	This Week In Techdirt History: July 5th - 11th (0)
Friday
19:39	Xbox Lays Off 20% Of Staff, Cut Studios, Largely Impacting Acquired Devs It Promised It Wouldn't Layoff (6)
15:50	How Google And AI Nearly Made A Seasoned Reporter Spiral (15)
13:18	Ctrl-Alt-Speech: Sell Me Lies, Sell Me Sweet Meta Lies (0)
11:06	FCC General Counsel Channels Founding Fathers To Falsely Claim First Amendment Allows Banning Porn (19)
11:01	Daily Deal: The All-in-One Adobe Creative Cloud Suite Course Bundle (0)
09:23	Adults Broke The Internet, And They're Trying To Fix It By Kicking Kids Off (17)
05:22	FTC Strikes Settlement With John Deere On 'Right To Repair' (6)
Thursday
21:54	Mom That Blamed Deaths Of 1 Year Old Twins On Vaccines Charged With Their Murder (30)

China's Precision Censorship Machine Allows Some Controversial Keywords, But Blocks Combinations Of Them

from the politically-problematic-images-also-a-no-no dept

Comments on “China's Precision Censorship Machine Allows Some Controversial Keywords, But Blocks Combinations Of Them”

Data classification

Add Your Comment Cancel reply

Comment Options:

What's this?

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Sunday

Saturday

Friday

Thursday

More

Tools & Services

Company

Contact

More

China's Precision Censorship Machine Allows Some Controversial Keywords, But Blocks Combinations Of Them

from the politically-problematic-images-also-a-no-no dept

Comments on “China's Precision Censorship Machine Allows Some Controversial Keywords, But Blocks Combinations Of Them”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Sunday

Saturday

Friday

Thursday

More

Email This Story

Tools & Services

Company

Contact

More