Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve

from the now-would-be-a-good-time-to-make-a-donation dept

Mon, Oct 5th 2020 03:45pm - Glyn Moody

As Techdirt has reported many times, open access is a self-evidently great idea, but one that is still beset with many problems. That’s not least because academic publishers are keen to remain in control of any transition to open access, and aim to maintain their extremely high profit margins whatever the publishing model. But there’s one problem for open access that ironically derives from its greatest strength — the fact that anyone can access journals at any time, for free. Because material is always available, librarians have tended not to worry about making some kind of backup. That’s not the case for traditional journals, where there is potentially a big problem if a subscription is cancelled. The end of a subscription often means that readers lose their existing access to journals. To address this, librarians have come up with a variety of ways to ensure “post-cancellation access“, explained well in a 2007 post on a blog about digital preservation, written by David Rosenthal. A recent article on the Internet Archive site provides some interesting statistics on the scale of the problem of creating permanent copies of open access titles:

Of the 14.8 million known open access articles published since 1996, the Internet Archive has archived, identified, and made available through the Wayback Machine 9.1 million of them… In the jargon of Open Access, we are counting only “gold” and “hybrid” articles which we expect to be available directly from the publisher, as opposed to preprints, such as in arxiv.org or institutional repositories. Another 3.2 million are believed to be preserved by one or more contracted preservation organizations, based on records kept by Keepers Registry… These copies are not intended to be accessible to anybody unless the publisher becomes inaccessible, in which case they are “triggered” and become accessible.

This leaves at least 2.4 million Open Access articles at risk of vanishing from the web… While many of these are still on publisher’s websites, these have proven difficult to archive.

That’s a pretty serious problem, and one which the Internet Archive is taking steps to address, for example by trawling through the petabytes of Web content that it has built up since 1996. There’s an editable catalog with an open API that aims to provide “Perpetual Access to Millions of Open Research Publications From Around The World”. Internet Archive has also created a full-text search index to over 25 million research articles and other scholarly documents.

Although few people are aware of this project, it is vital work. There is little point publishing open access titles, theoretically available to all, if their holdings simply disappear at some point in the future. The Internet Archive’s copies will ensure that doesn’t happen. They are yet another indication of the invaluable and unique role the site plays in the online world. Without it, we would already have lost so much of the amazing material that was once online, but which has since vanished except for the copies held by the Wayback Machine. Another good reason to support this incredible, free resource financially, and to help defend it from incredibly selfish attacks by publishers.

Comments on “Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve”

Pixelation

October 5, 2020 at 9:36 pm

What's in a name

When I hear Open Access, I don’t immediately know what that is. I wonder if a name change would make a difference? I guess, when I hear Open Access, I think, "Open Access to what?" Perhaps I’m being pedantic.

Brewster Kahle (user link)

October 6, 2020 at 8:33 am

Open Access institutions are building a new ecosystem

Thank you for the hat-tip to the Internet Archive and the project to support the commons– There are many of us supporting open access.

When materials are open access, then institutions can more easily cooperate because we do not need NDA’s, contracts, lawyers, firewalls, etc.

Open Access journal articles are cited more and apparently read more, so this is a great way to move forward. The Internet Archive can serve the role of backup, but also bulk access for researchers.

I am looking forward to the meta-science, the science of studying scholarly output, that can be more easily done because the materials are publicly accessible and in bulk.

-brewster

Samuel Abram (profile)

October 6, 2020 at 12:14 pm

Internet Archive under siege

The very facts that

the publishers are likely to win

is why I donate as much money as I can to them (as well as legally upload all I can as well). The Internet Archive is far too valuable a resource to perish.

jersey111

January 25, 2021 at 6:39 pm

Re: Internet Archive under siege

The lawsuit has no real actual merit, and the publishers know this. The Internet Archive is going to be fine, judging by recent events.

Crafty Coyote

October 6, 2020 at 3:27 pm

If copyright infringement is the same as stealing a car, then the Internet Archive is the dealership that has an infinite number of cars in its lot, and is currently asking for people to send in more and "steal" more.

Add Your Comment

Friday
19:39	Trump Fires Court-Appointed US Attorney One Hour After Appointment, Immediately Gets Sued (0)
15:14	DOJ Withdraws NY Times Subpoenas After Judge Notices It Never Bothered To Follow The Rules For Subpoenaing Reporters (2)
13:07	Ctrl-Alt-Speech: Live At TrustCon 2026 (0)
11:09	Election Commission Says Musk Likely Broke The Law By Paying Voters. Will Anyone Do Anything About It? (15)
11:04	Daily Deal: The Complete Raspberry Pi And Alexa A-Z Bundle (0)
09:25	Administration Accelerating Immigration Hearings To Ensure Migrants Miss New Court Dates (10)
05:22	Brendan Carr Lobs More Empty Threats At ABC For Not Airing Trump's Election Fraud Lies (7)
Thursday
20:05	Caleb Williams, George Gervin, An 'Iceman' Trademark And Insulated Boots...Oh My? (8)
15:40	“Digital Colonialism”: U.S. Demands To Access Africans’ Data Raise Privacy, Sovereignty Concerns (17)
13:20	The FTC’s National Nanny Returns: AI Edition (3)

Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve

from the now-would-be-a-good-time-to-make-a-donation dept

Comments on “Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve”

What's in a name

Open Access institutions are building a new ecosystem

Internet Archive under siege

Re: Internet Archive under siege

Add Your Comment Cancel reply

Comment Options:

What's this?

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Friday

Thursday

More

Tools & Services

Company

Contact

More

Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve

from the now-would-be-a-good-time-to-make-a-donation dept

Comments on “Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Friday

Thursday

More

Email This Story

Tools & Services

Company

Contact

More