In The Vacuum Of AI Legislation, Libraries Have The Playbook

from the always-listen-to-librarians dept

Mon, May 11th 2026 03:24pm - Katherine Klosek

The White House AI framework made official what we already knew: this administration has no interest in regulating AI. Any legislation that contradicts the framework will be a dead end. In this regulatory vacuum, it is instructive to turn to norms developed by libraries and archives through their decades of experience working through the same core issues that are now animating AI debate: understanding copyright law; providing machine access to data; contextualizing information; and adhering to responsible stewardship obligations to communities.

The Google Books Library Project can be instructive. In the mid-2000s, research libraries partnered with Google to digitize and preserve millions of volumes in their collections. To solve the problem of how to store and provide access to a massive number of scanned books, research libraries banded together to create HathiTrust, a secure, searchable repository that remains in use today. Of course, this didn’t happen without legal challenges. Authors Guild separately sued Google and HathiTrust for copyright infringement in what came to be known as the “Google Books” cases. But these cases ultimately established the legal precedent that copying books to create a digital searchable database is fair use. Based on this precedent, research methods such as text and data mining are possible because of mass digitization, and lawful under fair use.

Based on Google Books and other litigation, libraries put a stake in the ground when it comes to copyright law: training AI models on copyrighted works generally is fair use, a position articulated by the Library Copyright Alliance (LCA) in 2023, and updated in light of recent court decisions. In two of those decisions, Kadrey v. Meta and Bartz v. Anthropic, judges held that training AI models on copyrighted works is transformative and therefore fair use. It’s worth noting that these cases are in a commercial context. It is likely that a court would rule in favor of AI uses in educational, research, and scholarly contexts, as those are favored uses under fair use.

Meanwhile, disagreements over AI safety, harm prevention, bias mitigation, and abuse have held up federal AI legislation in the US. But these are not new problems for libraries, which have developed norms to balance the collection and preservation of sensitive information in archives and special collections with the imperative to provide the broadest possible user access to digitized content. One example is the 2010 ARL principles to guide vendor/publisher relations in large scale digitization projects with special collections, which calls for libraries to make material available to the public while providing context to aid in the understanding of that material. Libraries have also developed frameworks for stewarding materials of vulnerable communities and historically marginalized groups, like the Library of Congress access policy on culturally sensitive materials relating to Indigenous peoples, which includes transparent procedures for controlled access and use of culturally sensitive materials.

Congress has also been legislating in the dark around issues like transparency and provenance in AI training, and many of the proposals we have seen so misunderstand these concepts that they threaten to bring the university-based research enterprise to a halt. Libraries already do what Congress is trying to mandate — authenticating, contextualizing, and documenting collections — but the legislation is too disconnected from this expertise, and as a result unworkable for the institutions that actually practice rigorous provenance.

As AI governance debates continue to stall on Capitol Hill, library norms offer a foundation for approaching AI training and research in a way that is responsible, steeped in library expertise, and advances the public interest.

With gratitude to Betsy Rosenblatt, Professor of Law, Case Western Reserve University Law School

Katherine Klosek is the Director of Information Policy and Federal Relations at the Association of Research Libraries.

Comments on “In The Vacuum Of AI Legislation, Libraries Have The Playbook”

This is a sharp analysis, Katherine. You’ve hit on the fundamental truth that while the policy world is currently scrambling to define “responsible AI,” libraries have been quietly refining the actual mechanics of ethical data stewardship (and fair use!) for decades. The HathiTrust and Google Books precedents aren’t just legal history; they are the functional blueprints for how we balance massive scale with institutional integrity.

I’m particularly glad you called out the disconnect in current legislative proposals regarding transparency and provenance. There is a real risk that Congress, in an attempt to rein in “Big Tech,” will inadvertently create a regulatory regime that is unworkable for the very research libraries that have been the most rigorous practitioners of these values.

My only concern is that as we lean into these library norms, we must be careful. We want to ensure that library expertise guides the policy without letting the “playbook” be co-opted by commercial interests to justify licensing models that erode the permanent role of the library collection.

Grateful for your leadership on this at ARL!

Add Your Comment

Subscribe: RSS Leave a comment

This comment has been flagged by the community. Click here to show it.

Anonymous Coward

May 11, 2026 at 3:51 pm

So. You don’t ever want to get paid for what you write?

Mike Masnick (profile)

May 12, 2026 at 12:40 am

Re:

Wut?!?

May 12, 2026 at 8:15 am

Re: Re: What's 'her' name, btw?

Shouldn’t you be too busy productivitymaxxing with your AI secretary to pick dumb fights like this?

May 12, 2026 at 9:47 am

Re: Re: Re:

May 12, 2026 at 5:29 am

Re: Pseudonymous Coward

You posted this anonymously, is that because you don’t think you should be allowed to use your name online?

Kyle K. Courtney

May 13, 2026 at 7:56 am

Thursday
05:24	'Christian' Wireless Provider Promises To Censor All LGBTQ Content (0)
Wednesday
20:09	Valve Blocks Indie Dev Game For Including IP From The Same Indie Dev In Game (2)
15:17	South Africa Used AI To Write Its Now Withdrawn AI Policy. The Citations Were Fake. (2)
13:11	DOJ Possibly Facing Contempt Charges After Admitting DHS Press Release Was False (9)
11:05	John Roberts Is The Driver Who Wants Credit For All The People He Didn't Run Over (37)
11:00	Daily Deal: 5-in-1 MagSafe Wireless & Wired Charging Station (0)
09:22	Trump Already Has His 'Get Out Of Jail Free' Card. Now He Wants A 'Get Out Of IRS Audits' Card (17)
05:20	Ken Paxton Pretends To Care About Consumers, Sues Netflix To 'Protect The Children' (13)
Tuesday
20:04	Canadian 'Pickle Fest' Rebranded Under Bullshit Trademark Threat (19)
15:32	Prosecutors Had A Drugs-for-Votes Scheme “Locked Up.” Under Trump, They Were Told Not To Pursue Charges. (21)

In The Vacuum Of AI Legislation, Libraries Have The Playbook

from the always-listen-to-librarians dept

Comments on “In The Vacuum Of AI Legislation, Libraries Have The Playbook”

Re:

Re: Re: What's 'her' name, btw?

Re: Re: Re:

Re: Pseudonymous Coward

spot on!

Add Your Comment Cancel reply

Comment Options:

What's this?

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Thursday

Wednesday

Tuesday

More

Tools & Services

Company

Contact

More

In The Vacuum Of AI Legislation, Libraries Have The Playbook

from the always-listen-to-librarians dept

Comments on “In The Vacuum Of AI Legislation, Libraries Have The Playbook”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

The Techdirt Greenhouse

Trending Posts

Thursday

Wednesday

Tuesday

More

Email This Story

Tools & Services

Company

Contact

More