<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/">
<channel>
<title>Techdirt. Stories filed under &quot;archive&quot;</title>
<description>Easily digestible tech news...</description>
<link>http://www.techdirt.com/</link>
<language>en-us</language>
<image><title>Techdirt. Stories filed under &quot;archive&quot;</title><url>http://www.techdirt.com/images/td-88x31.gif</url><link>http://www.techdirt.com/</link></image>
<item>
<pubDate>Mon, 27 Aug 2012 17:00:00 PDT</pubDate>
<title>DailyDirt: The Future Of Storage</title>
<dc:creator>Michael Ho</dc:creator>
<link>http://www.techdirt.com/articles/20100925/01465811164/dailydirt-future-storage.shtml</link>
<guid>http://www.techdirt.com/articles/20100925/01465811164/dailydirt-future-storage.shtml</guid>
<description><![CDATA[ Storing digital data can be <a href="https://www.techdirt.com/articles/20100907/10153310924/dailydirt-year-2525.shtml">unreliable</a> if you want it to last a really long time. But there are many ways to store vast amounts of data, and if you're not in a hurry to retrieve the data, it can be somewhat cheap to maintain an enormous amount of information nowadays. Here are just a few examples of storing LOTS of data in somewhat unconventional ways.
 
<ul>
<li> <a title="http://www.scientificamerican.com/article.cfm?id=highest-possible-resolution-color-images-achieved" href="http://bit.ly/ONvb8Y">Printing at about 100,000 dots per inch in full color has been achieved by researchers at the Agency for Science, Technology and Research (A*STAR) in Singapore.</a> As a proof of principle, a test image of <a href="http://en.wikipedia.org/wiki/Lenna">Lenna</a> was formed on a silicon wafer covered with a nanoscale metal coating. [<a href="http://www.scientificamerican.com/article.cfm?id=highest-possible-resolution-color-images-achieved">url</a>]</li>

<li> <a title="http://aws.typepad.com/aws/2012/08/amazon-glacier-offsite-archival-storage-for-one-penny-per-gb-per-month.html" href="http://bit.ly/PbtnIi">Amazon is starting to offer archival storage for just $0.01 per gigabyte per month.</a> This Glacier storage service is aimed at replacing old tape archives and geographically distinct facilities, but retrieving the data isn't so convenient: data retrieval requests can take hours (hence the name Glacier) and there's also a retrieval fee after you've accessed more than 5% of your data vault in a month. [<a href="http://aws.typepad.com/aws/2012/08/amazon-glacier-offsite-archival-storage-for-one-penny-per-gb-per-month.html">url</a>]</li>

<li> <a title="http://www.nature.com/news/dna-data-storage-breaks-records-1.11194" href="http://bit.ly/Puvbze">Geneticists in Boston have stored a 5.27-megabit book (containing 53,246 words, 11 JPG image files and a JavaScript program) on DNA.</a> This is the highest density of non-biological data ever encoded by DNA, and a gram of material could store 455 exabytes of data. [<a href="http://www.nature.com/news/dna-data-storage-breaks-records-1.11194">url</a>]</li>

</ul>


If you'd like to read more awesome and interesting stuff, check out this unrelated (but not entirely random!) <a title="http://www.stumbleupon.com/to/stumble/stumblethru:www.techdirt.com" href="http://bit.ly/fagV8c">Techdirt post</a>.<br /><br /><a href="http://www.techdirt.com/articles/20100925/01465811164/dailydirt-future-storage.shtml">Permalink</a> | <a href="http://www.techdirt.com/articles/20100925/01465811164/dailydirt-future-storage.shtml#comments">Comments</a> | <a href="http://www.techdirt.com/articles/20100925/01465811164/dailydirt-future-storage.shtml?op=sharethis">Email This Story</a><br />
 ]]></description>
<slash:department>urls-we-dig-up</slash:department>
<wfw:commentRss>http://www.techdirt.com/comment_rss.php?sid=20100925/01465811164</wfw:commentRss>
</item>
<item>
<pubDate>Mon, 20 Aug 2012 17:00:00 PDT</pubDate>
<title>DailyDirt: In The Year 2525...</title>
<dc:creator>Michael Ho</dc:creator>
<link>http://www.techdirt.com/articles/20100907/10153310924/dailydirt-year-2525.shtml</link>
<guid>http://www.techdirt.com/articles/20100907/10153310924/dailydirt-year-2525.shtml</guid>
<description><![CDATA[ Computers have made it easier and easier for people to create incredible amounts of data -- passing the <a href="http://www.emc.com/collateral/analyst-reports/idc-extracting-value-from-chaos-ar.pdf">zettabyte barrier</a> in 2010. The growth of data is actually surpassing our ability to store it, and it's a bit concerning that our ability to store digital information for long periods of time isn't too reliable (just try to access stuff on a 5.25" floppy). Here are just a few interesting links on preserving information for thousands of years.

<ul>

<li> <a title="http://news.sciencemag.org/sciencenow/2012/07/a-million-year-hard-disk.html?" href="http://bit.ly/NF0Qx6">Digitally-stored information about nuclear waste needs to be accessible many thousands of years from now.</a> Engraving the info on sapphire discs with platinum is just one proposed solution that could work for future archaeologists -- but in what language should it be written? [<a href="http://news.sciencemag.org/sciencenow/2012/07/a-million-year-hard-disk.html?">url</a>]</li>

<li> <a title="http://millenniata.com/m-disc/" href="http://bit.ly/PqbaXW">If you thought your burned CDs/DVDs lasted forever, think again.</a> But if you still want to store your data on plastic discs, there's a company (Millenniata) that sells an optical disc engraving technology for writing CDs/DVDs that work with standard CD/DVD readers -- and that claims to last for about 1,000 years (or at least hundreds of years). [<a href="http://millenniata.com/m-disc/">url</a>]</li>

<li> <a title="http://news.nationalgeographic.com/news/2012/06/120614-neanderthal-cave-paintings-spain-science-pike/" href="http://bit.ly/RhTanZ">Neanderthals were making cave paintings over 40,800 years ago in Spain.</a> How much of our art will survive the next 40,000 years? [<a href="http://news.nationalgeographic.com/news/2012/06/120614-neanderthal-cave-paintings-spain-science-pike/">url</a>]</li>

</ul>


If you'd like to read more awesome and interesting stuff, check out this unrelated (but not entirely random!) <a title="http://www.stumbleupon.com/to/stumble/stumblethru:www.techdirt.com" href="http://bit.ly/fagV8c">Techdirt post</a>.<br /><br /><a href="http://www.techdirt.com/articles/20100907/10153310924/dailydirt-year-2525.shtml">Permalink</a> | <a href="http://www.techdirt.com/articles/20100907/10153310924/dailydirt-year-2525.shtml#comments">Comments</a> | <a href="http://www.techdirt.com/articles/20100907/10153310924/dailydirt-year-2525.shtml?op=sharethis">Email This Story</a><br />
 ]]></description>
<slash:department>urls-we-dig-up</slash:department>
<wfw:commentRss>http://www.techdirt.com/comment_rss.php?sid=20100907/10153310924</wfw:commentRss>
</item>
<item>
<pubDate>Tue, 7 Aug 2012 20:06:00 PDT</pubDate>
<title>Internet Archive Enables Over 1,000,000 Torrents Of Books, Music And Movies</title>
<dc:creator>Mike Masnick</dc:creator>
<link>http://www.techdirt.com/blog/innovation/articles/20120807/13525019957/internet-archive-enables-over-1000000-torrents-books-music-movies.shtml</link>
<guid>http://www.techdirt.com/blog/innovation/articles/20120807/13525019957/internet-archive-enables-over-1000000-torrents-books-music-movies.shtml</guid>
<description><![CDATA[ We sometimes hear people say that BitTorrent as a technology is only good for infringement.  We know that's not true, but then people will point to examples of how frequently it's used for infringement.  Of course, that's meaningless when you look at both the larger picture and the nature of trends.  When new distribution technologies are introduced, it's not surprising that they're used that way because there's so little legitimate activity on the system.  But that changes over time.  Remember, when the VCR first came about, nearly all activity on it was described as "infringing" by some, because there was no legitimate content being offered.  However, obviously, over time that changed and more and more legitimate content was offered.
<br /><br />
Over the years, we've certainly seen an increase in content being offered via BitTorrent, but today there's a big addition: the Internet Archive has <a href="http://blog.archive.org/2012/08/07/over-1000000-torrents-of-downloadable-books-music-and-movies/" target="_blank">enabled over one million torrents of books, music and movies from its collection</a>.
<blockquote><i>
The Internet Archive is now offering <a href="http://archive.org/details/bittorrent">over 1,000,000 torrents</a> including our <a href="http://archive.org/details/etree">live music concerts</a>, the <a href="http://archive.org/details/prelinger">Prelinger movie collection</a>, the <a href="http://archive.org/details/librivoxaudio">librivox audio book collection</a>, <a href="http://archive.org/details/feature_films">feature films</a>, <a href="http://archive.org/details/oldtimeradio">old time radio</a>, <a href="https://archive.org/details/toronto">lots and lots of books</a>, and all new uploads from our patrons <a href="http://archive.org/details/opensource_audio">into</a> <a href="http://archive.org/details/opensource_movies">Community</a> <a href="http://archive.org/details/opensource">collections</a>&nbsp;(with more to follow).
</i></blockquote>
And while some ignorant organizations may declare that the Internet Archive is a <a href="http://www.techdirt.com/articles/20110620/01370314750/universal-music-goes-to-war-against-popular-hip-hop-sites-blogs.shtml">"rogue site,"</a> I think most people recognize that it's a wonderful repository of all sorts of legal content, much of which is now available using the rather efficient distribution technology BitTorrent.<br /><br /><a href="http://www.techdirt.com/blog/innovation/articles/20120807/13525019957/internet-archive-enables-over-1000000-torrents-books-music-movies.shtml">Permalink</a> | <a href="http://www.techdirt.com/blog/innovation/articles/20120807/13525019957/internet-archive-enables-over-1000000-torrents-books-music-movies.shtml#comments">Comments</a> | <a href="http://www.techdirt.com/blog/innovation/articles/20120807/13525019957/internet-archive-enables-over-1000000-torrents-books-music-movies.shtml?op=sharethis">Email This Story</a><br />
 ]]></description>
<slash:department>torrent-away</slash:department>
<wfw:commentRss>http://www.techdirt.com/comment_rss.php?sid=20120807/13525019957</wfw:commentRss>
</item>
<item>
<pubDate>Fri, 29 Oct 2010 07:46:40 PDT</pubDate>
<title>Archive Of Geocities Released As A 1TB Torrent</title>
<dc:creator>Mike Masnick</dc:creator>
<link>http://www.techdirt.com/articles/20101029/03055711647/archive-of-geocities-released-as-a-1tb-torrent.shtml</link>
<guid>http://www.techdirt.com/articles/20101029/03055711647/archive-of-geocities-released-as-a-1tb-torrent.shtml</guid>
<description><![CDATA[ In early 2009, Yahoo announced that it was going to <a href="http://www.techdirt.com/articles/20090423/1201564621.shtml">put Geocities out of its misery</a> and finally shut down the site entirely, even as it was still getting 11 million unique visitors per month.  Soon after the announcement, we had heard about some projects to try to archive the entire site (with some claims that it couldn't be done in time).  The actual shut down occurred almost <a href="http://www.techdirt.com/articles/20091026/1104366677.shtml">exactly a year ago</a>, and yet a group calling itself The Archive Team is apparently <a href="http://www.techradar.com/news/internet/geocities-to-be-released-as-a-torrent-904057" target="_blank">releasing its entire Geocities archive</a>, blinking flashing "under construction signs" and all, as a nearly 1 TB torrent.  They don't think they got everything, but do believe they archived "a significant percentage" of the site.
<br /><br />
It's worth reading the blog post by the folks who did this <a href="http://ascii.textfiles.com/archives/2720" target="_blank">explaining why they did it</a>, noting how little people realized that this was basically erasing digital history and culture:
<blockquote><i>
What we were facing, you see, was the wholesale destruction of the still-rare combination of words digital heritage, the erasing and silencing of hundreds of thousands of voices, voices that representing the dawn of what one might call "regular people" joining the World Wide Web. A unique moment in human history, preserved for many years and spontaneously combusting due to a few marks in a ledger, the decision of who-knows for who-knows-what.
</i></blockquote>
This is interesting on a number of levels.  It is fascinating how little most people seemed to care about the loss of Geocities.  Yes, it was quite an abandoned digital hangout for much of the past decade, but the group is right that it represented an important watershed in recognizing that <i>anyone</i> could be a content creator.  I first learned to make websites via a Geocities account (before it was even called Geocities, mind you).  And yet, Yahoo just dumped it.
<br /><br />
Of course, someone <i>could</i> make the argument that this archive is copyright infringement.  I doubt anyone will, but it is an interesting question.  The archiving is an important point in preserving digital history, and yet it's also a moment of massive copyright infringement -- technically speaking.  This is the sort of bizarrely bad result you get in a world where copyright is automatically given to any content at the moment of creation.  Most of the people creating Geocities pages would have no reason (or desire) to copyright what they created, and yet they all got it by default.
<br /><br />
Hopefully, no one decides to pursue the copyright issue in any serious manner.  In the meantime, we'll leave you with the parting words of The Archive Team:
<blockquote><i>
While it's quite clear this sort of cavalier attitude to digital history will continue, the hope is that this torrent will bring some attention to both the worth of these archives and the ease at which it can be lost -- and found again.
</i></blockquote><br /><br /><a href="http://www.techdirt.com/articles/20101029/03055711647/archive-of-geocities-released-as-a-1tb-torrent.shtml">Permalink</a> | <a href="http://www.techdirt.com/articles/20101029/03055711647/archive-of-geocities-released-as-a-1tb-torrent.shtml#comments">Comments</a> | <a href="http://www.techdirt.com/articles/20101029/03055711647/archive-of-geocities-released-as-a-1tb-torrent.shtml?op=sharethis">Email This Story</a><br />
 ]]></description>
<slash:department>archiving-history</slash:department>
<wfw:commentRss>http://www.techdirt.com/comment_rss.php?sid=20101029/03055711647</wfw:commentRss>
</item>
<item>
<pubDate>Fri, 7 May 2010 02:17:25 PDT</pubDate>
<title>Library of Congress Responds To Privacy Gripes By Making Twitter Archive Less Useful</title>
<dc:creator>Karl Bode</dc:creator>
<link>http://www.techdirt.com/articles/20100503/1024339281.shtml</link>
<guid>http://www.techdirt.com/articles/20100503/1024339281.shtml</guid>
<description><![CDATA[ <p>We <a href="http://www.techdirt.com/articles/20100414/1346519014.shtml">recently noted</a> how the Library Of Congress and Twitter made a big deal of the fact that the LOC would now be archiving Twitter conversations. The idea is that researchers may find a mountain of largely-incoherent blathering about the <em>Twilight</em> films useful in providing context as they try to piece together events of the twenty-first century. Late last week a little more detail of the archiving process leaked out, the LOC saying that in response to privacy complaints they wouldn't store deleted tweets, and they'd also be placing all tweets under embargo for a period of six months (for whatever that's supposed to accomplish). For good measure, they're also apparently <a href="http://www.nytimes.com/2010/05/02/business/02digi.html">locking down the Twitter archive to &quot;qualified researchers&quot; and considering further restrictions</a>:</p><blockquote>&quot;<em>The library will embargo messages for six months after their original  transmission. If that is not enough to put privacy issues to rest, she  said, &ldquo;We may have to filter certain things or wait longer to make them  available.&rdquo; The library plans to dole out its access to its Twitter  archive only to those whom Ms. Anderson called &quot;qualified researchers</em>.&quot;</blockquote><p>The historical dangers of &quot;filtering certain things&quot; aside, what's the point of these restrictions? The entire archive is already being indexed by Google in real time, <a href="http://www.google.com/search?hl=en&#038;tbo=1&#038;tbs=mbl:1&#038;esrch=RTReplay&#038;q=obama">with no delays or restrictions</a> -- and other copies are being doled out to companies like Microsoft and Yahoo. In an age where phone companies are feeding every single byte of data in  real time to the NSA with questionable oversight -- worrying too much  about the storage of your clever Twitter barbs seems to be <a href="http://www.google.com/search?hl=en&#038;tbo=1&#038;esrch=RTReplay&#038;tbs=mbl%3A1&#038;q=missing+the+forest+for+the+trees&#038;aq=f&#038;aqi=g1g-m1&#038;aql=&#038;oq=&#038;gs_rfai=">missing   the forest for the trees</a>. The Library of Congress appears to have responded to these largely-senseless privacy concerns (you are communicating using a <strong>public</strong> service) by making the government's Twitter archive more annoying to use. Though hey, if the apocalypse manages to decimate every other copy of the Twitter archive -- you can sleep well knowing that you'll still be able to dig through <a href="http://twitter.com/ogochocinco">OchoCinco's insights</a> at The Library Of Congress with a laminated community college ID.   </p><br /><br /><a href="http://www.techdirt.com/articles/20100503/1024339281.shtml">Permalink</a> | <a href="http://www.techdirt.com/articles/20100503/1024339281.shtml#comments">Comments</a> | <a href="http://www.techdirt.com/articles/20100503/1024339281.shtml?op=sharethis">Email This Story</a><br />
 ]]></description>
<slash:department>qualifed-researcher-of-inane-arcana</slash:department>
<wfw:commentRss>http://www.techdirt.com/comment_rss.php?sid=20100503/1024339281</wfw:commentRss>
</item>
</channel>
</rss>