Google Books Data Mining Reveals Mad Men's Big Historical Flaw: Business Lingo

from the keep-a-low-profile dept

The TV show Mad Men has quite a reputation for going to great lengths to be as authentic as possible. The clothes, the props, the scenarios are all supposedly thought out in great detail. While some who were actually in the business at the time quibble with certain aspects of the show, it cannot be denied that the show’s producers certainly go way beyond other period pieces to try to make keep everything accurate for the time period. However, it turns out that there’s one area where it appears the writers have completely flopped: period-specific language. On The Media ran an absolutely fascinating clip about a researcher who has shown how frequently Mad Men uses words or phrases that were not in popular usage at the time, but only came into the lexicon at a later date:


This is actually a cross-broadcast of another podcast, Lexicon Valley, and it’s covering the work of Ben Schmidt, who has produced a software algorithm that compares the Mad Men scripts… to a searchable database of language from Google’s book scanning project. Schmidt’s algorithm compares the language from the show with scanned books from the same period. Schmidt has a website, Prochronism, which covers his findings. I can’t quite explain why, but it’s really quite fascinating.

Schmidt has found that the show is pretty good about getting language about technology right (with one exception). It knows that there aren’t fax machines and computers and stuff. The one area where it gets things wrong, is with the phone. For example, using the phrase “on hold.” He notes that phones had hold buttons, but there wasn’t yet a concept of the state of being “on hold.” That showed up in the 70s.

What Schmidt has also found is that the show is absolutely terrible about getting “business” terms correct in a period specific way. That same post about “on hold” also chides the show for using “defining moment,” another phrase that showed up in the 70s, but was basically stuck in academia until the late 80s or early 90s when it became a popular phrase.


Honestly, Ben’s site is really fascinating. I could spend hours on it (and actually had to stop going through it post by post to finish this post). There are also discussions on phrases like “focus groups” and “leverage.” But one more awesome chart from Ben, discussing the use of both “moral high ground” and “consumerism,” both of which were barely in use until much later:


On the podcast, they discuss how part of the reason that the show gets the language about technology right, but not business, is because we know that technology rapidly evolves and we’re more attuned to it. But people don’t pay nearly as much attention to how business changes and especially how the language of business changes over time. I guess that’s true, though it doesn’t surprise me that “consumerism” and “moral high ground” are both more recent phenomena. “Defining moment” and “on hold” are a bit more surprising to me.

Either way, I also wanted to highlight something else about all of this that I find fascinating. For all the talk by some about just how evil Google’s book scanning project is, this kind of effort and research wouldn’t be possible without large scale scanning of books. While this particular example may appear (on its face) to be a frivolous (even if it’s fascinating) area of research, it does highlight just how collection of certain data can open up vast arrays of data that can be mined in useful ways. When people freak out about new technologies and services, they almost always focus on how it impacts the old offerings. So most of the talk was about book scanning and its impact on book sales. But what almost no one talks about is how it enables new things that simply weren’t possible before — such as being able to build an algorithm like the one Ben built. Those kinds of innovations — the unexpected “externalities” of projects like the Google book scanning project — shouldn’t be ignored, because there’s tremendous value that can come out of them.

Filed Under: , , , , , ,

Rate this comment as insightful
Rate this comment as funny
You have rated this comment as insightful
You have rated this comment as funny
Flag this comment as abusive/trolling/spam
You have flagged this comment
The first word has already been claimed
The last word has already been claimed
Insightful Lightbulb icon Funny Laughing icon Abusive/trolling/spam Flag icon Insightful badge Lightbulb icon Funny badge Laughing icon Comments icon

Comments on “Google Books Data Mining Reveals Mad Men's Big Historical Flaw: Business Lingo”

Subscribe: RSS Leave a comment
27 Comments
Chuck Norris' Enemy (deceased) (profile) says:

Re: Re:

Yes, because Lowery’s whole concept is based on the morality of file sharing. Of course, morality has nothing to do with economics…which is kinda the point made numerous times by Mike, et. al. on this blog. All the moral outrage from the content industries only brings terrible laws by the moral police (government) to the detriment of basic human rights. Moral outrage won’t bring any more money from fans…especially when fans (paying customers or not) are constantly called thieves and idiots by the morally superior.

Gwiz (profile) says:

Re: Re:

Bottom line: copying music is unethical.

Unethical to whom? Sorry, you are not *my* moral authority.

“Must someone, some unseen thing, declare what is right for it to be right? I believe that my own morality – which answers only to my heart – is more sure and true than the morality of those who do right only because they fear retribution.”
― Brandon Sanderson, The Way of Kings

Chosen Reject (profile) says:

Re: Re: Re:

If anything, copyright is immoral. Copyright doesn’t create a new right for the holder, it only takes away rights from everyone else in order to make that holder’s right exclusive. Everyone already has the right to copy and distribute any publicly available material. Copyright takes those rights away from everyone except the holder. And with increasing lengths of copyright, that means we are having our rights taken away but get nothing in return. That is immoral and unethical.

Go ahead Emily, keep on sharing. Stand up for your natural rights and tell the government-made rights to stuff it.

Anonymous Coward says:

Re: Re:

“Artists can make money on the road (or its variant ?Artists are rich?). The average income of a musician that files taxes is something like 35k a year w/o benefits.”

Since you asked for a response, how about you respond to the economic issue…

During this time period when the ‘average income of a musician’ was 35K a year (this is from your article, you find the reference and period), What was the average ‘record executive’s’ salary? How about the average ‘label executive’? How about the average **AA cut?

Please show us how well the musicians were doing compared to the ‘fat middlemen’ in this era so that we can properly judge today’s artists and middlemen…

hegemon13 says:

Language of Today

I personally don’t have a huge problem with this. The show is a work of fiction. It is a period piece, yes. It has to show enough authenticity to be convincing. It does NOT have to serve as precise reference material.

While it is a show about the past, it is also a show FOR the present. To a degree, it must be written in the language of today. To much adherence to past language would make the show indecipherable to a lot of people. If the language leaves you confused, that’s as much a breach of the fourth wall as an egregious anachronism.

The job of a fiction writer is to be convincing, not to be accurate.

The site is definitely an interesting analysis of language evolution. Comparing the real past to our fictional ideas of the past is a fascinating and worthwhile exercise. However, to use that analysis to chide a fictional show for being inaccurate is silly and pointless.

Brendy says:

Re: Language of Today

I think it’s important to capture the language of the time as well. I mean one of the points in the podcast was that they used the term “negro” instead of “African-American.” The show would have lost a bit of credit in my mind if they had swapped the proper word for today. I think they can get by and use the language of the past accurately and still have the actions of the character allow us to use context clues to figure out the meanings of phrases we don’t understand. Hell, I have watched British movies and no clue what they were saying, but I was still able to infer what was happening. So, I disagree with your main point.

hegemon13 says:

Re: Re: Language of Today

There are instances where that is true. But business lingo is something the average person has much less familiarity with, let alone obsolete business lingo. You point out that the use of “negro.” That stands out because it is a major cultural shift that most people are aware of. Business lingo…not so much.

Again, it’s about balance. The language must be genuine enough to be convincing. But as fiction, it does not bear a responsibility to be real-world accurate.

Therefore, accuracy that serves the story and the atmosphere is important. Essential even. Same goes for accuracy on items that are familiar to modern culture. But “accuracy” that gets in the way of telling the story does the viewer a disservice. It’s not a documentary. It’s fiction. Story and immersion are paramount. Accuracy is far down the list, as long as the “mistakes” are not something that stands out to the viewer.

Let me ask you this: prior to the podcast, did you ever think, “I’d enjoy Mad Men a lot more if the business lingo sounded more like stuff I’ve never heard and had to decipher.”

Haywood (profile) says:

“For all the talk by some about just how evil Google’s book scanning project is”

Maybe that should be was, as with all useful projects / technology, the gate keepers have broken it. It still works a little, but when you think of what might have been…..
Humanity as a whole could have benefited so much, at an admitted price to the gate keepers, but those 19th century folks need to move aside.

Beta (profile) says:

negative space

The one that grates on my ear is “need”. Today it’s horribly overused, and I think I first noticed it becoming ubiquitous in the ’90’s.

But what about the mastodons? The phrases that should be there but aren’t, and we don’t notice because they’re extinct today? The “-wise” suffix (used to turn nouns into adverbs, coinage-wise) was a bit of newspeak that was so common on Madison Avenue it was made fun of at the time. It should be easy enough for a writer to scan some old magazines for good examples, and they could serve a useful purpose in a script, underscoring a character’s immersion in vogue, or sad devotion to a movement that has no future. (Here’s hoping that the young audiences of 2060 don’t notice the lack of “war on terror” references.)

EF says:

Prochronisms is indeed a fascinating site. I watch Mad Men regularly and I enjoy the attention to detail in dress, decor, character, etc. I never considered the language of the series before until learning Prochronism. What enlightenment Prochronism offers. Very interesting. Small digression here, there was another series that I liked specifically for its apparent authenticity of language to the period and that was “Deadwood.” I wonder how far back the Prochronism effort could measure? Great point, Mike, on the wonderful “unexpected externalities” of the Google book scanning project.
P.S. hey Phil, shut your “whole”.

Gwiz (profile) says:

But people don’t pay nearly as much attention to how business changes and especially how the language of business changes over time.

I agree, but in one particular use of business language I did notice some of the changes – the reasons for layoffs:

– “middle management reduction”
– “downsizing”
– “re-engineering”
– “outsourcing”
– “cost cutting”

and lately – “the economy”

The Original Anonymous Coward (profile) says:

I'd like to see a similar study of "Deadwood"

I seem to remember that the “Deadwood” mini-series held some sort of record for the most cuss words used per minute, or something like that, and also that these words were not actually used back in the 1800’s. Words like f*ck, c*nt, etc…

Apparently the producers did their best to keep all the other parts of the episodes as close to that time period as possible, but unfortunately for them, the words that were shockers back in those days aren’t shocking anymore, so they had to “update” the cussing.

TtfnJohn (profile) says:

Re: I'd like to see a similar study of "Deadwood"

Those words were descriptive in those days as were words like tit. As you say, cuss words that shocked back then don’t raise an eyebrow anymore in most places.

Sort of like the shocked response of young male English speakers from North America when the nice daughter of the family at the B&B asking you when you want to knock her up in the morning. 😉 Translated that means “wake you up”.

TtfnJohn (profile) says:

Forever morphable English -- And it's free!!

It’s one of the things that fascinates me the most about English is how quickly words and phrases can change their meaning and, at times, how quickly that happens. while it’s something those of us brought up in the language are used to and take in stride. Newcomers to the language have a hell of a time with that.

I’d never thought of using books as a way of preserving the lexicon of the time but it makes perfect sense. For a story to sell well it needs to speak to English speakers in the lexicon being used at the time.

Even more to the popular songs of any given period which is why I’d love to see a similar study done on song lyrics from, oh, say, 1940 to 1980 to see who words and phrases changed in that period.

We’re so used to Shakespeare that we forget that he invented words as he went along if he couldn’t find one that fit. Which may be why 500 years later studying Shakespeare by reading his plays is hopeless while to see a performance we suddenly understand a whole lot of things we didn’t just be reading it.

I do wonder about the phrase “on hold” slowly becoming “on ignore” mostly when confronted by one of those damnable choice trees that take forever to get through if we wait that long.

And let’s not get into dialect in this language. 🙂

Anonymous Coward says:

Copying Not So Easy

Getting the language wrong was a fine example of copying not being so easy as many people seem to think it is. The writers of the show were attempting to copy the feel of the period, but they could not do it, despite trying really hard. Say they re-did the show and fixed that error. There would be bound to be other errors that they did not fix.

Copying is difficult. Innovators should stop worrying about someone else swooping in and copying what they are doing. Big companies, who are the threatening copiers that everybody is worried about, cannot copy successfully. They suffer from a “not invented here” syndrome. They never put in enough work on the details. They are arrogantly convinced that they have got something right, when there are still errors. Innovators should just concentrate on getting their own execution right and ignore the possibility of copiers. That is why the patent system is misguided. It prevents copying, at huge expense, when copying is a non-problem in practice.

Leave a Reply to Liz Cancel reply

Your email address will not be published. Required fields are marked *

Have a Techdirt Account? Sign in now. Want one? Register here

Comment Options:

Make this the or (get credits or sign in to see balance) what's this?

What's this?

Techdirt community members with Techdirt Credits can spotlight a comment as either the "First Word" or "Last Word" on a particular comment thread. Credits can be purchased at the Techdirt Insider Shop »

Follow Techdirt

Techdirt Daily Newsletter

Ctrl-Alt-Speech

A weekly news podcast from
Mike Masnick & Ben Whitelaw

Subscribe now to Ctrl-Alt-Speech »
Techdirt Deals
Techdirt Insider Discord
The latest chatter on the Techdirt Insider Discord channel...
Loading...