Reducing Your Book Buying To Statistically Ridiculous Triviality

from the how-many-words-per-dollar-is-that? dept

Tue, Aug 30th 2005 10:31am - Mike Masnick

Earlier this year, Amazon.com got some press for revealing their “text stats” with “statistically improbably phrases,” listing out phrases that tend to only appear in that particular book. There were other stats as well — and all were about equally as useless. It appears that the Washington Post has just discovered these silly stats and has written up an amusing article noting some of the completely useless and trivial stats you can now compare different books over. They really seem to like the “words per dollar” feature, for instance. “But in its pure form, Text Stats is a triumph of trivialization…. Now you too can sound like a literary insider at Washington cocktail parties. You can throw around statistics and make clever conversation about the hard history books, the long-winded novels, even those thick, heavy, make-you-think philosophy tomes that contain really, really long words. And the beauty of it is, with Amazon’s “Search Inside” Text Stats and other features, you won’t even have to read them.”

Comments on “Reducing Your Book Buying To Statistically Ridiculous Triviality”

dorpus

August 30, 2005 at 2:10 pm

But can it do Type I Nested F-tests?

You can have covariates that appear insignificant on their own, but appear significant in the presence of another covariate. Or the opposite can occur. We have condition indexes, variance inflation factors, and type I F-tests to evaluate these phenomena.

It might be fun to perform a principal components analysis (PCA) on a book, to find eigenvectors of words that describe a typical page. What if every page in a book is merely a linear combination of eigenvectors? It would probably work really well for Techdirt, with its predictable anti-recording-industry postings, free market dogma, and anti-dorpus rants.

stochastix

August 31, 2005 at 3:00 am

SIPs and search

SIPs are useful for certain types of searches (especially technical stuff). A SIP is very rare in the universe of books… so if you can find a small set of books where the phrase occurs several times then there is a strong chance that these books are relevant to the topic of the SIP.

Anonymous Coward

August 31, 2005 at 9:10 am

how ironic

What phrase is so improbable as “statistically improbably”?

malhombre

August 31, 2005 at 9:25 am

Re: how ironic

Or “anti-dorpus rant”

Add Your Comment

Friday
19:39	Knox County, TN Rolls Back 'Roots' Book Ban After Backlash (2)
15:24	How AI Can Lead To False Arrests & Wrongful Convictions (7)
13:09	Ctrl-Alt-Speech: Deus vs. Machina (0)
11:15	Court Temporarily Freezes Trump's $1.776 Billion 'Anti-Weaponization' Slush Fund To Figure Out WTF Is Going On (20)
11:10	Daily Deal: MasterBundle For Web Designers (0)
09:21	City Lawmaker Responds To Flock Camera Ban By Demanding A Cell Phone Ban (5)
05:22	Trump FCC Proposes Vile New Trans Panic TV Warnings (35)
Thursday
19:53	Stop Killing Games Gets Its First American Legislative Effort Out Of Committee in California (8)
15:07	Violent Crime In The US Is At Record Lows, But The DOJ Is Eliminating The Funding That Helped Reduce Crime (8)
13:35	Enemies Are Exploiting Unregulated Data Broker Location Data To Target And Kill U.S. Troops (12)

Reducing Your Book Buying To Statistically Ridiculous Triviality

from the how-many-words-per-dollar-is-that? dept

Comments on “Reducing Your Book Buying To Statistically Ridiculous Triviality”

But can it do Type I Nested F-tests?

SIPs and search

how ironic

Re: how ironic

Add Your Comment Cancel reply

Comment Options:

What's this?

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Friday

Thursday

More

Tools & Services

Company

Contact

More

Reducing Your Book Buying To Statistically Ridiculous Triviality

from the how-many-words-per-dollar-is-that? dept

Comments on “Reducing Your Book Buying To Statistically Ridiculous Triviality”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Friday

Thursday

More

Email This Story

Tools & Services

Company

Contact

More