Online Facial Recognition Service Caught Cruising Through Graveyards To Fill Its Database

from the look-we-only-did-it-during-business-hours dept

Not literally, of course. Let’s get that out of the way. The company was not sending people out to dig up bodies to take photos to add to its facial recognition database. I mean, how would that even work. Not only that, but very few desiccated corpses utilize subscription-based facial recognition services. It’s not a workable business model.

Neither is this, though. This really shouldn’t be happening. Here’s a far more realistic depiction of what happened, as reported by Lydia Morrish for Wired.

Ancestry.com isn’t the only site that Scarlett checks regularly. In February 2022, the facial recognition search engine PimEyes surfaced non-consensual explicit photos of her at age 19, reigniting decades-old trauma. She attempted to get the pictures removed from the platform, which uses images scraped from the internet to create biometric “faceprints” of individuals. Since then, she’s been monitoring the site to make sure the images don’t return.

In January, she noticed that PimEyes was returning pictures of children that looked like they came from Ancestry.com URLs. As an experiment, she searched for a grayscale version of one of her own baby photos. It came up with a picture of her own mother, as an infant, in the arms of her grandparents—taken, she thought, from an old family photo that her mother had posted on Ancestry. Searching deeper, Scarlett found other images of her relatives, also apparently sourced from the site. They included a black-and-white photo of her great-great-great-grandmother from the 1800s, and a picture of Scarlett’s own sister, who died at age 30 in 2018. The images seemed to come from her digital memorial, Ancestry, and Find a Grave, a cemetery directory owned by Ancestry.

That would be the unfortunate findings of software engineer Cher Scarlett who, as the Wired article points out, has had previous traumatic run-ins with the Poland-based PimEyes. (PimEyes has offered a pretty defensive response to that reporting on its blog.)

PimEyes is, at best, controversial. It provides a for-pay service that allows users to perform reverse images on photos of themselves to discover where else these photos may have been posted on the internet. PimEyes is basically Clearview, except for regular people (although cops use it too).

Unlike Clearview, PimEyes positions itself as a service that only allows users to search for images of themselves, rather than just upload a photo of anyone and send PimEyes digging through its database of info scraped from publicly accessible websites. Obviously, internet sleuths seeking to identify people who participated in the January 6th insurrection weren’t uploading photos of themselves. The potential for abuse (which has been previously realized) is always present. And PimEyes’ landing page doesn’t exactly deter people from assuming this is a reverse image search they can engage by uploading any photo.

This particular scraping is the sort of thing you don’t expect, even from controversial services that make webscraping part of the business model. However, PimEyes says the scraping of Ancestry — which violated Ancestry’s terms of service — was a mistake.

Giorgi Gobronidze, PimEyes’ director, tells WIRED: “PimEyes only crawls websites who officially allow us to do so. It was … very unpleasant news that our crawlers have somehow broken the rule.” PimEyes is now blocking Ancestry’s domain and indexes related to it are being erased, he says.

It never should have happened in the first place. This may have been a mistake, but mistakes like this matter when your company has already generated a lifetime of privacy and security concerns. Sure, people can opt out. But only the ones who are still living.

Filed Under: , ,
Companies: ancestry.com, pimeyes

Rate this comment as insightful
Rate this comment as funny
You have rated this comment as insightful
You have rated this comment as funny
Flag this comment as abusive/trolling/spam
You have flagged this comment
The first word has already been claimed
The last word has already been claimed
Insightful Lightbulb icon Funny Laughing icon Abusive/trolling/spam Flag icon Insightful badge Lightbulb icon Funny badge Laughing icon Comments icon

Comments on “Online Facial Recognition Service Caught Cruising Through Graveyards To Fill Its Database”

Subscribe: RSS Leave a comment
16 Comments
Anathema Device (profile) says:

Serious question – is this vastly different in effect from what Google Image Search does?

And while it may be creepy, does it matter as a privacy rights thingie that dead people’s images are scraped?

Archaeologists routinely disinter corpses that were lovingly and religiously (according to the culture) buried, and there is no wholesale outcry about that violation (outside of Indigenous activists who are rightly angry about the way the material is kept in museums.)

Are you worried about it being a slippery slope to something worse, or am I missing the point somewhere?

That Anonymous Coward (profile) says:

Re:

If a website has a robots.txt that tells google to not index the site, google doesn’t index the site.

I am doubtful that Ancestry has terms that are very welcoming to those who just want to scrape a bunch of images and data to profit from.

If you are standing in front of your window naked and George Google walks by he can take a picture.

If you are standing in front of your window, with the shades drawn, and Pime Ye slips a straightened coat hanger inbetween the weather stripping to move the shade so he can take a picture its very different.

Imagine an archaeologist walking into the Smithsonian, walking into an exhibit space & taking credit for someone elses hard work and effort so he can get a grant to discover someone elses display.

Head Kangaroo (profile) says:

Re: Re: ToS

This is not a google or other search engine indexing question. The sites have to be indexed. But, according to their ToS, the content can only be used with their services. So, if Pinkeyes is actually scraping content from those sites, they are violating the ToS.

Both Ancestry and its subsidiary, FindaGrave, are public facing sites. But, the content is owned by their users and is protected by the ToS. Scraped photos may also be covered under copywriter protection, as well as, authored documents in jpg or png format.

I would think Ancestry would take action on the scraping. I informed them today. We will see what they do.

Head Kangaroo (profile) says:

Re: Incorrect

Ancestry and FindaGrave are genealogical sites. Photos of living people have to be by their permission. 99.999% of the photos are of non living people. The ToS of those sites restrict photo use to within the services of those sites.

Photos on those sites are not just of people, there are also numerous documents in photo format.

If in fact Pinkeyes is scraping those two sites, I am pretty sure that those sites will fix that vulnerability and take action. I informed Ancestry today concerning what is going on and they are looking into it.

Anonymous Coward says:

A database containing facial information of the public would only be of value to a facial recognition effort for a limited duration, right?

Ones face does seem to change a bit with age, I guess this database is going to be huge. Who is paying for all that silliness? You are of course.

Am I going to need facial recognition insurance? What will happen to the average citizen when this system falsely accuses them of criminal activity? It’s not like the average citizen has access to resources needed in their defense. Lately the crazies have stepped up attacks upon the public defenders office. Guess it’s off to jail then – gotta get those low income workers somehow.

Head Kangaroo (profile) says:

Re: Yes and No

I believe that you are correct that current photos of yourself would only be relevant and of a limited duration. Having looked at the Pinkeyes site, there is no way their opt out function is useful, and you also have to pay for it.

Assuming that Pinkeyes is scrapping photos on Ancestry and FindaGrave, there is a lot of content on those two sites that are in photo format but are not photographs. There are millions of documents in jpg and png format resident on those sites, some may be sensitive, i.e., birth, death, marriage and adoption records. All this stuff is IAW their ToS not to be used outside of their services. So, Pinkeyes is violating the ToS on both sites.

That Anonymous Coward (profile) says:

“very unpleasant news that our crawlers have somehow broken the rule”

So you’re incompetent or a really shitty liar?
You found a source of images that you could tap and up until you got caught it was a great plan.

You can claim you purged the offending images you gathered, but you couldn’t give your crawler instructions it would obey, you have no credibility.

Head Kangaroo (profile) says:

Pinkeyes Response

Pinkeyes seems to deny the Techdirt article in this response to me.

Dear user,

We do not possess any of your personal data. PimEyes is a search engine that can detect images of your likeness on the open web. We do not have any photo material in our database, nor do we own the copyright to it. Like any other search engine, such as Google, Bing, etc., PimEyes assists users in accessing information that is available online from already accessible sources. PimEyes searches only those web sites that officially allow data scraping and never searches for social media or other user-restricted online sources. Therefore, PimEyes cannot be legally responsible for the content published by administrators of other web sources, nor can we be responsible for the actions of third parties who might theoretically misuse our service. To prevent this, our company has elaborated a complex set of terms of service and privacy policies that directly correspond to existing privacy regulations; a violation of our terms of service is therefore also a violation of existing privacy regulations. If you have any further questions, please contact office@pimeyes.com as this is primarily a support channel.

Best Regards, PimEyes

Add Your Comment

Your email address will not be published. Required fields are marked *

Have a Techdirt Account? Sign in now. Want one? Register here

Comment Options:

Make this the or (get credits or sign in to see balance) what's this?

What's this?

Techdirt community members with Techdirt Credits can spotlight a comment as either the "First Word" or "Last Word" on a particular comment thread. Credits can be purchased at the Techdirt Insider Shop »

Follow Techdirt

Techdirt Daily Newsletter

Subscribe to Our Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

We don’t spam. Read our privacy policy for more info.

Ctrl-Alt-Speech

A weekly news podcast from
Mike Masnick & Ben Whitelaw

Subscribe now to Ctrl-Alt-Speech »
Techdirt Deals
Techdirt Insider Discord
The latest chatter on the Techdirt Insider Discord channel...
Loading...