Anonymous Coward

My thought is that this looks more like a fishing expedition, trying to find alternate versions, previous versions, or even comments that might have been inserted during the review process.

It sets a very ugly precedent.

Anonymous Coward

February 17, 2011 at 3:29 am

Re: Re:

Ugly for people trying to hide something.

Anonymous Coward

February 17, 2011 at 4:03 am

Re: Re: Re:

No, ugly for anyone working on a document in it’s development stages. It will limit people’s desire to make comments, to offer ideas, or to disagree with anything in the document because they will not want their objection to be part of the metadata.

It is a huge negative to the entire process.

Anonymous Coward

February 17, 2011 at 4:52 am

Re: Re: Re: Re:

If people are afraid to do what is right and have no courage to voice those disagreements then I see no problem with them being silent, this is above all very true for the government.

Besides they can always comment anonymously using internal pseudonyms or other internal references.

Anonymous Coward

February 17, 2011 at 5:09 am

Re: Re: Re:² Re:

But it’s not fair for the government!

Anonymous Coward

February 17, 2011 at 8:56 am

Re: Re: Re:³ Re:

Have faith. Transparency is fair.

Michael (profile)

February 17, 2011 at 5:12 am

Re: Re: Re: Re:

“It is a huge negative to the entire process”

What is a huge negative to the process? Making the process itself transparent? Are they really throwing out comments in these documents that are not appropriate for public consumption?

Anonymous Coward

February 17, 2011 at 5:37 am

Re: Re: Re: Re:

Tough shit. Have some cojones. If your objections are good and they are discovered in the metadata, you might save yourself from a prison sentence or a the very least being associated with a crappy law.

Anonymous Coward

February 17, 2011 at 8:17 am

Re: Re: Re:² Re:

Apparently you have never been in the room when legal theories and possibly courses of action are discussed. It’s sort of like the comments here, all over the road.

Requiring full disclosure on every step and every part of the document, no matter what the end result is very likely to stop people from making suggestions, or at least stop them from using official channels to do so.

It’s the same logic as Piracy. Block protocol X, and they try a new one with more covering and more privacy. If you buy that logic, you know exactly what the government people are going to do in the future: Use undocumented ways of discussing and working on product so their intermediate workflow stuff cannot be easily obtained.

velox

February 17, 2011 at 8:51 am

Re: Re:

“this looks more like a fishing expedition”

The discovery phase of any lawsuit these days is almost always a “fishing expedition”.
That’s what lawyers do.

If you don’t like it, stop using software that hides a bunch of metadata.
For some reason, software engineers have this desire to store a bunch of metadata inside working files, and it rarely provides serves a purpose or benefit to the enduser.
MS Office is one of the worst, but it’s a widespread issue.

Rekrul

February 17, 2011 at 3:35 am

As for the unsearchable format, the judge slammed ICE for clearly going out of its way to make the document “more difficult or burdensome for the requesting party to use,” in violation of standard discover rules.

My first thought is that they simply scanned the documents as images and then compiled those images into a PDF. I see this done all the time. It’s faster than using OCR and then cleaning up the hundreds of mistakes that the software makes, however, images aren’t searchable.

John Duncan Yoyo

February 17, 2011 at 5:08 am

Re: Re:

Presumably ICE has an electronic version of this document somewhere in the system. Giving the rescan into a PDF would have had the same effect as a printout.

I would love all laws to come with meta data. This pork is identified with this congressman.

abc gum

February 17, 2011 at 6:09 am

ice ice baby

Shon Gale (profile)

February 17, 2011 at 6:11 am

Adobe will love that! Make it into a pdf to make it harder to use!

Borax Bob

February 17, 2011 at 7:38 am

Maybe Not Intentional

Obviously I have no idea if this is true or not, but where I work, probably over 50% of the people, who have a PDF “printer” installed on their computer of one sort or another, will still print out something, and then walk over and scan it. Even large documents. Heck, They have a 80 page change to a 400 page policy they put out a year ago; the original policy was searchable, the changes to it aren’t and the scan isn’t even all that good. It looks like someone wanted to highlight parts of the changes, so they printed it out, used a highlighter to do the highlighting, and then scanned it. I guess it makes sense then that the page numbers don’t match, the “Changes” appear to actually have been made to the entire original document, which is distributed electronically, why only the changes were put out, I have no idea. I was assured the changes were temporary, but it’s been over a year….
My point is, it might be (or, since it’s the Federal Government, it probably is) just plain incompetence on someone’s part.

Rabbit80

February 17, 2011 at 7:48 am

DMS

I work with in a scanning bureau and with various document management systems. We get sent tons of paperwork (mostly typed) for scanning which is presumably no longer available in its original (ie Word etc) form.

Some of the document management software we use will not produce searchable PDFs. The images are stored as single page TIFF with an accompanying XML or TXT file – not much use to anyone in that format! To make the PDFs searchable, they have to be run through a separate OCR process after exporting.

This software is also capable of redacting the image files before exporting them.

Its pretty easy to see why non-searchable PDFs may have been given.

Rabbit80

February 17, 2011 at 7:52 am

Re: DMS

I forgot to mention. We do not make PDFs searchable when scanning unless we are specifically asked to do so. It is a time consuming process (2 seconds per page. We can scan 80000+ pages per day) that can cause the processing PCs to crash frequently.

Anonymous Coward

February 17, 2011 at 8:02 am

Usually pretty easy to make a PDF searchable as long as it is in the form of an electronic file (unless the conversion gives you the dreaded “this document contains non-renderable text”). Redactions also create problems.

If a PDF is printed to paper, conversion via OCR is a nightmare.

I presume the FOIA requestor will hereafter make sure to request electronic copies of the originals so that metadata is preserved, but making them searchable is problematic as noted above. This would have at least one benefit. Printouts cannot be tossed over the transom as a compendium of several thousand pages that do not clearly demarcate where one file ends and another begins.

mischab1

February 17, 2011 at 10:01 am

If it’s anything like my company, half the meta data is meaningless anyway. Someone wants to copy the formatting style of a document so they copy an existing file and replace all the text with their own. (On a completely different subject.) But they don’t know anything about meta data so the file still lists as Author the person who clicked File -> New 5+ years ago.

Wednesday
15:39	Universal Music's Copyright Claim: 99 Problems And Fair Use Ain't One (12)
13:35	Techdirt Podcast Episode 388: Copyright Conundrum (1)
12:05	Biden Signs TikTok Ban Bill; Expect A Lawsuit By The Time You Finish Reading This Article (50)
10:50	DeSantis Signs Law Limiting Book Challenges After The Shitty People He Encouraged To Be Shitty Proved To Be Even Shittier Than He Thought They'd Be (34)
10:45	Daily Deal: The Premium Python Programming PCEP Certification Prep Bundle (0)
09:31	FTC Bans Non-Competes, Sparks Instant Lawsuit: The War For Worker Freedom (21)
05:31	Grindr Hit By UK Lawsuit For Reckless Sale Of Sensitive User Data (1)
Tuesday
20:00	David Chang Issues C&Ds Over 'Chile Crunch' Products, Then Apologizes And Promises To Stop (3)
15:34	Because It's Done Such A Great Job Policing Illegal Drugs, The DEA Decides It's Time To Start Engaging In Legal Drug Hysteria (24)
13:38	When You Need To Post A Lengthy Legal Disclaimer With Your Parody Song, You Know Copyright Is Broken (26)

Court Says Metadata Should Be Released Under Freedom Of Information Act Request

from the commence-metadata-scrubbing dept

Comments on “Court Says Metadata Should Be Released Under Freedom Of Information Act Request”

Re: Re:

Re: Re: Re:

Re: Re: Re: Re:

Re: Re: Re:² Re:

Re: Re: Re:³ Re:

Re: Re: Re: Re:

Re: Re: Re: Re:

Re: Re: Re:² Re:

Re: Re:

Re: Re:

Maybe Not Intentional

DMS

Re: DMS

Add Your Comment Cancel reply

Comment Options:

What's this?

The Techdirt Greenhouse

Trending Posts

Wednesday

Tuesday

More

Tools & Services

Company

Contact

More

Court Says Metadata Should Be Released Under Freedom Of Information Act Request

from the commence-metadata-scrubbing dept

Comments on “Court Says Metadata Should Be Released Under Freedom Of Information Act Request”

Add Your Comment Cancel reply

Comment Options:

What's this?

Techdirt Daily Newsletter

The Techdirt Greenhouse

Trending Posts

Wednesday

Tuesday

More

Email This Story

Tools & Services

Company

Contact

More