Future Boy

December 10, 2008 at 9:37 am

“I own” ownership seems like a dumb concept…

I’m amazed we humans got this far while carrying such incredible intellectual garbage around.

In my time, we have a resource based economy: almost everything we need is produced en masse via automation. The cost (in time and resources–not “money”) of most necessities is so close to zero that nobody worries about it.

Anonymous Coward

December 10, 2008 at 9:56 am

Libraries entered the data

The idea behind OCLC is that individual libraries actually enter the data. That way the data only has to be entered one time instead of every library repeating the effort. The work can be checked by other libraries, thereby giving every member library access to a catalog that is reasonably accurate. Everything depends on a cooperative spirit. That is what makes this information grab so odd to me. I just can’t see libraries going along with it.

Doug

December 10, 2008 at 11:42 am

Re: Libraries entered the data

Actually, this is not quite the case. I used to work at OCLC, and have friends and acquaintances who still work there. Many of them are doing nothing more than looking at stuff (books, maps, recordings, etc.) that library X sends them, determining the cataloging information (including supplying the Dewey Decimal number), and entering the data. For some, they specialize in materials written in certain, sometimes obscure, languages (such as Arabic, Welsh, or Ancient Greek). For others, they are more of a generalist, or specialize in other ways. But almost every one has a MLS or better, and they all determine the data and enter it. But this means that Future Boy’s idea that the cost in time/resources being close to zero is sometimes far off the mark, given that some of these folks doing the cataloging work are specialists who do hours of research just to figure out where an item goes in the great scheme of things. It is not just glancing at it for 30 seconds, saying that it will be with a catalog number of 521.075 instead of 520.684 (just pulling numbers out of the air). Instead, each item can potentially cost hundreds of dollars or more to catalog.

Now, regarding the ownership of the data, sidestepping the whole ownership argument for a second. I would say that if OCLC cataloged the item and entered the data, then the rights to records for the item should depend on the contract between the “library” and OCLC in effect at the time, for that item. But if the “library” entered it themselves, then the determination of the rights should belong to them rather than OCLC. But I would also agree that were it not from the fact that OCLC and the libraries spends money to catalog the items, it would be better to have the data in the public with the idea that you profit not from the data itself but instead profit with providing the service. And, there are ways to work around even that last bit.

As for OCLC’s unilateral change… yes, it probably will very much harm them in the short and long run. This is probably just the first echos heard here, and not the last echos heard anywhere to say the least.

Jesse

December 10, 2008 at 10:21 am

That bit of viral code seems interesting…if you duplicate the data, do you duplicate the ownership? Could the libraries just keep a copy of the data to maintain ownership then? Then the OCLC can do what it wants with its copy and individual libraries can do what they want…I don’t understand how this code changes anything.

Rob

December 10, 2008 at 10:49 am

Greed - yet again

The only rhyme or reason to any of this is the one thing that remains a constant in America – Greed. Something catches their eye and they think, “Hey, I can make a buck off this. It doesn’t even matter if I have a right to.” and off they go. They will not stop unless they are stopped. Libraries – take them to court and set a precedent so this junk stops.

The libraries are the ones that created the databases and populated them, this group is really only maintaining it and keeping it accessible. Are things a company makes now “owned” by the janitor?

hegemon13

December 10, 2008 at 11:19 am

Isn't this claiming ownership of facts?

The quantity of a particular book in a library’s collection is a fact. A list of those quantities is a list of facts. I can see how OCLC could claim ownership of their presentation of a library’s collection. However, isn’t claiming ownership of the actual data the same as the MLB claiming ownership of stats?

The whole concept of a database right is outrageous. Facts are facts; they aren’t created. To create another, similar example, should a library or video rental store be able to have a “collection right,” so that other libraries/stores could not have the same or very similar collection to theirs? Simply putting facts together in one place does not denote creation. The copyrightable parts of a database should be restricted solely to the table layout (iffy), interface, and reports, charts, and other presentations of the data.

twitter (user link)

December 10, 2008 at 3:14 pm

It's the CDDB all over again.

They don’t want to own the database, they want to control who can research. Protect your right to read by protecting your right to share.

Anon2

December 10, 2008 at 8:03 pm

bizarre

This all seems a bit bizarre to me, though Doug’s comment helps shed some light. Seems that there are several dimensions to the problem. On the one hand, it’s incredibly difficult for me to imagine any justification for claiming a proprietary interest in data that merely reflects facts such as which library holds copies of which work. To the extent that libraries are compiling the initial catalogs and simply exporting that data into a central, metadatabase, it doesn’t make sense even under the current IP legal structure. OTOH, if this organization is actually involved in creating a useful way to organize, index and search that data, and is providing the backbone to run that system, it is not simply compiling and re-transmitting data; it is contributing something creative and transformative to the process, and offering a new and valuable tool to researchers.

Leaving aside how you characterize the resources it invests in having staff whose expertise can assist libraries in properly cataloguing the works in their collection, I think there is some justification for it to expect some return on its investment. Some of that can, and probably should, come from the libraries, who are receiving a valuable service. But some ought to come from researchers (and more importantly in the case of most academic research, their institutions), who are utilizing the system to make their work more efficient and fruitful. Having still some dim recollection of my senior year in college, at a huge research university that even with its vast library did not hold the kinds of materials I needed to do my senior honors thesis, and the ridiculous amount of time I had to put in making phone calls, writing letters (no email back then), and ultimately having to travel to depositories I identified as likely prospects for the primary materials I needed, I would gladly have paid some small sum — or better, seen it included in my tuition or my annual student fees (whatever they were for) — rather than spend hundreds of hours just locating the materials I needed.

But nobody has to assert ownership in the raw data to accomplish this. If they’ve really built a better mousetrap, then I would think it a fantastic product to market to colleges, universities, private research institutions, corporations, pretty much anywhere real research is being done. And if I personally wanted to really track some stuff down for my own research, I would no doubt pay some reasonable fee for shorter-term access to the data, the same as my law firm does to access privately owned databases that do a heck of a better job indexing, cross indexing and providing other value-added aspects that make the task of doing legal research vastly more efficient and comprehensive (and ultimately, accurate) than the free databases out there, let alone the nightmare it used to be when we all still used difficult, dense and inevitably outdated digests to find the books we needed, and even more arcane sets of books to check whether the cases, statutes and regs we wanted to rely upon were still good law.

This seems to me to come down to the difference between bare compilation of facts and the manipulation and enhancement of those facts in ways that render the compilation transformative and useful.

John Jackson (user link)

December 11, 2008 at 8:44 am

Re: bizarre

Because it seems there are a lot of comments from non-librarians, I just wanted to give an example of the type of data we are talking about. It’s more than just a record of holdings. Catalogers create a huge amount of information for each item the library holds. For example, here is the record data for an edition of David Weinberger’s Everything is Miscellaneous:

And this doesn’t include everything, only the fields that the patrons see. Catalogers (of which I am not) know what each of these MARC numbers mean (i.e. 650 = subject headings). This record was first created by the Library of Congress, later touched by a vendor, then by my institution. OCLC never had a hand in it. So who owns the data? We all had a part in creating it.

Ben

December 11, 2008 at 6:26 am

And who compiled the data?

Let’s not forget that these library records were compiled first by the library’s employees, some of them as volunteers who did not think their charitable efforts were for corporate profit. Also, libraries are funded by taxpayers. So this landgrab is to get free information backed by local government funding without a contract.

This won’t get far if it goes to court.

Saturday
12:00	This Week In Techdirt History: May 10th - 16th (0)
Friday
19:39	Developer Promises To Keep Failed Online Game Servers Up: Art Deserves To Be Preserved (2)
15:24	Why The US Can't Adopt Ukraine's Innovative Approach To Unmanned Warfare Systems (14)
13:27	Let’s Help Children, Not Trial Lawyers (7)
11:03	Appeals Court Upholds Block Of ICE's BS 'Seven Day Notice' Detention Center Inspection Policy (3)
10:58	Daily Deal: Babbel Language Learning (All Languages) (0)
09:24	Trump's $10 Billion IRS Lawsuit May Become a $1.7 Billion Slush Fund for MAGA's Self-Proclaimed Victims (1)
05:30	Bari Weiss Let Benjamin Netanyahu Pick His Own Softball Interviewer (11)
Thursday
20:15	HHS Is A Chaos Engine: Marty Makary Out At FDA (8)
15:22	Congress Narrowed The GUARD Act, But Serious Problems Remain (1)

Landgrab For Ownership Of Library Catalog Data

from the not-good dept

Comments on “Landgrab For Ownership Of Library Catalog Data”

Libraries entered the data

Re: Libraries entered the data

Greed - yet again

Isn't this claiming ownership of facts?

It's the CDDB all over again.

bizarre

Re: bizarre

And who compiled the data?

Add Your Comment Cancel reply

Comment Options:

What's this?

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Saturday

Friday

Thursday

More

Tools & Services

Company

Contact

More

Landgrab For Ownership Of Library Catalog Data

from the not-good dept

Comments on “Landgrab For Ownership Of Library Catalog Data”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Saturday

Friday

Thursday

More

Email This Story

Tools & Services

Company

Contact

More