Internet Archive Fire Shows Vulnerability Of The World's Online Memory
from the taking-things-for-granted dept
The Internet Archive is a 501(c)(3) non-profit that was founded to build an Internet library. Its purposes include offering permanent access for researchers, historians, scholars, people with disabilities, and the general public to historical collections that exist in digital format.Here's the amazing scale of the project today:
Founded in 1996 and located in San Francisco, the Archive has been receiving data donations from Alexa Internet and others. In late 1999, the organization started to grow to include more well-rounded collections. Now the Internet Archive includes: texts, audio, moving images, and software as well as archived web pages in our collections, and provides specialized services for adaptive reading and information access for the blind and other persons with disabilities.
The Internet Archive Wayback Machine contains almost 2 petabytes of data and is currently growing at a rate of 20 terabytes per month. This eclipses the amount of text contained in the world's largest libraries, including the Library of Congress.The Internet Archive is the world's online memory, holding the only copies of many historic (and not-so-historic) Web pages that have long disappeared from the Web itself.
This morning at about 3:30 a.m. a fire started at the Internet Archive's San Francisco scanning center.
no one was hurt and no data was lost. Our main building was not affected except for damage to one electrical run. This power issue caused us to lose power to some servers for a while.
Some physical materials were in the scanning center because they were being digitized, but most were in a separate locked room or in our physical archive and were not lost. Of those materials we did unfortunately lose, about half had already been digitized. We are working with our library partners now to assess.
That loss is unfortunate, but imagine if the fire had been in the main server room holding the Internet Archive's 2 petabytes of data. Wisely, the project has placed copies at other locations:
We have copies of the data in the Internet Archive in multiple locations, so even if our main building had been involved in the fire we still would not have lost the amazing content we have all worked so hard to collect.
That's good to know, but it seems rather foolish for the world to depend on the Internet Archive always being able to keep all its copies up to date, especially as the quantity of data that it stores continues to rise. This digital library is so important in historical and cultural terms: surely it's time to start mirroring the Internet Archive around the world in many locations, with direct and sustained support from multiple governments. They can also help provide the Internet Archive with a wider, more international range of content, to make an even more representative store of the world's digital activity.
Unfortunately, that's not likely to happen anytime soon, as people seem happy to take for granted the amazing work of Brewster Kahle and his team. The next best thing would be to donate so that they can continue with their indispensable project -- and perhaps create a few more backup copies.