DailyDirt: Correlation Is Not Causation

from the urls-we-dig-up dept

Wed, Jul 2nd 2014 05:00pm - Michael Ho

Big data is a term that’s been getting some buzz as the next thing that’s going to change everyone’s lives (for better or worse, depending on how you look at it). Having a lot of data doesn’t necessarily mean you also have a lot of useful knowledge. Garbage in, garbage out, so they say. And making correlations is easy compared to finding a direct causal relationship. However, that hasn’t stopped (so-called) journalists from writing misleading headlines. If you hate correlations being mistaken for causation, submit examples you’ve seen in the comments below. Here are just a few to start off.

If you’d like to read more awesome and interesting stuff, check out this unrelated (but not entirely random!) Techdirt post via StumbleUpon.

Comments on “DailyDirt: Correlation Is Not Causation”

Anonymous Coward

July 2, 2014 at 5:09 pm

“You may be interested to know that global warming, earthquakes, hurricanes, and other natural disasters are a direct effect of the shrinking numbers of Pirates since the 1800s.”

http://www.venganza.org/about/open-letter/

Anonymous Anonymous Coward

July 2, 2014 at 5:46 pm

Re: Re:

Post doesn’t count if you weren’t wearing your colander while writing it. Can you please supply evidence?

Anonymous Coward

July 3, 2014 at 5:01 am

Re: Re: Re:

Sorry – I was in full Pirate Regalia

Sheogorath (profile)

July 2, 2014 at 5:49 pm

I got one!

People claiming that the MMR causes Autism just because the vaccine is given around the time that late-oneset Autism first appears.

Chronno S. Trigger (profile)

July 2, 2014 at 7:20 pm

Go to news.google.com and click the health section. I guarantee you’ll find correlation articles. I just did and apparently dark chocolate (not milk chocolate) helps people with peripheral artery disease walk. Doesn’t matter that they only tested on 20 people or that the change was only 11%.

Anonymous Coward

July 2, 2014 at 10:22 pm

Re: Re:

If the study controls for reasonable factors and the 20 subjects were validly random, then it could be legit – at least as an initial study.

The more subjects a study needs to prove a point, the less you should trust the results. Psychiatric drug field studies routinely twiddle the numbers to get the results they want by combining samples from beneficial (lucky?) studies into cohorts that don’t exhibit any positive reponse to show that on average, patients from the two cohorts show a positive response!

Citation: http://www.youtube.com/watch?v=A3YB59EKMKw I think. I’m at work now so can’t confirm, but I’m pretty sure that’s the one.

Anonymous Coward

July 3, 2014 at 5:09 am

Re: Re: Re:

“The more subjects a study needs to prove a point, the less you should trust the results.”

“Needs to prove a point”, implies this is not science, but rather a marketing ploy.

In a well constructed experiment or “study”, as the sample size increases so does the precision.

Anonymous Coward

July 3, 2014 at 8:09 pm

Re: Re: Re: Re:

“Needs to prove a point”, implies this is not science, but rather a marketing ploy.

Well, yes.

In a well constructed experiment or “study”, as the sample size increases so does the precision.

Well, no. In a well constructed experiment, precision will remain constant regardless of the sample size, but the resolution of the findings may be different.

eg- assuming correct randomisation and good controls across all sample sizes, a study of 20 subjects with no negative outcomes means you can confidently state a rate of “less than 1 in 20”. Take it up to 20,000 subjects, you might find 100 negative outcomes relative to control, meaning you can refine your rate of

Anonymous Coward

July 3, 2014 at 9:23 pm

Re: Re: Re:² Re:

ate my comment!

meaning you can refine your rate of

Anonymous Coward

July 3, 2014 at 9:23 pm

Re: Re: Re:² Re:

take 2…

meaning you can refine your rate of less than 0.05 to less than 0.01 (or lower? My statistics-fu is weak)

Of course, neither study proves that the rate across the entire population isn’t really 0.5, but that’s what randomisation is supposed to (try to) address. Alternatively, even if the rate across the study population is accurate, it can be difficult to determine if a particular person might fit into that population or not.

Anonymous Coward

July 4, 2014 at 6:49 am

Re: Re: Re:² Re:

“precision will remain constant regardless of the sample size, but the resolution of the findings may be different”
– This is incorrect. You assume the sample size quantity exceeds the quantity of possible unique results. When the aforementioned is not the case, increased resolution would only provide more detail of an incomplete data set.

“assuming correct randomisation”
– This is an attempt at simplifying the problem, as clearly there is no such thing as “correct randomization”

Lawrence D’Oliveiro

July 3, 2014 at 12:29 am

So What If Correlation Is Not Causation?

How does that cause you to conclude anything?

Anonymous Coward

July 3, 2014 at 2:45 am

http://www.tylervigen.com/?new=TRUE

Anonymous Coward

July 3, 2014 at 5:14 pm

1) An increase of global surveillance since 9/11 by the NSA correlates with a decrease in terrorist attacks killing more than 2,500.
Conclusion: surveillance works, so we should do more.

2) An increase of global surveillance since 9/11 by the NSA correlates with an increase in global terrorist activity.
Conclusion: surveillance would work if we could do more.

Of course, #2’s predicate might actually involve legitimate causation…

Add Your Comment

Saturday
12:00	Game Jam Winner Spotlight: As I Lay Flying (0)
Friday
19:39	NVIDIA's DLSS 5 Demo Video Briefly Taken Down Because YouTube's Take Down Process Sucks (2)
15:07	Trump's Two-Faced AI Policy (1)
13:03	Trump Threatens CNN For Very Basic Reporting On His Shitty, Unpopular War (11)
11:06	AI And Cybersecurity: A Glass Half-Empty/Half-Full Proposition, Where The Glass Is Holding Nitroglycerin (18)
11:01	Daily Deal: Luminar Mobile for iOS And Android (0)
09:31	No Surprise Here: Inspection Reveals Dozens Of Violations In El Paso ICE Detention Center (8)
05:26	Court Blocks Republican Push To (Further) Dominate And Destroy Local Broadcast News (7)
Thursday
20:03	Court Dismisses Pepperdine's Nonsense Trademark Suit Against Netflix Over 'Running Point' (1)
15:30	Ctrl-Alt-Speech: Honey, I Shrunk the Kids' Internet (0)

DailyDirt: Correlation Is Not Causation

from the urls-we-dig-up dept

Comments on “DailyDirt: Correlation Is Not Causation”

Re: Re:

Re: Re: Re:

I got one!

Re: Re:

Re: Re: Re:

Re: Re: Re: Re:

Re: Re: Re:² Re:

Re: Re: Re:² Re:

Re: Re: Re:² Re:

So What If Correlation Is Not Causation?

Add Your Comment Cancel reply

Comment Options:

What's this?

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Saturday

Friday

Thursday

More

Tools & Services

Company

Contact

More

DailyDirt: Correlation Is Not Causation

from the urls-we-dig-up dept

Comments on “DailyDirt: Correlation Is Not Causation”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Saturday

Friday

Thursday

More

Email This Story

Tools & Services

Company

Contact

More