Study: It’s Comically Easy To Identify ‘Anonymized’ Users In The ‘Metaverse’ With A Tiny Bit Of Motion Data

from the "anonymized"-doesn't-mean-anonymous dept

We’ve noted for a very long while how most of the explanations that corporations use to insist that your privacy is protected are effectively worthless.

For example, corporations will routinely inform you that it’s no big deal that they’re over-collecting and selling access to your browsing or location data to any idiot with a nickel because that data is “anonymized,” protecting your identity. In reality, that term means nothing, and study after study have shown it’s easy to identify you with only a few snippets of additional information.

With that in mind, a new study about user privacy in the virtual reality and augmented reality era (full study here) tracked 50,000 users in VR and found some interesting data. Most notably, that it takes incredibly little actual data collected from device microphones, cameras, and other tech to accurately identify a user’s real-world identity.

Like, very little:

The research analyzed more than 2.5 million VR data recordings (fully anonymized) from more than 50,000 players of the popular Beat Saber app and found that individual users could be uniquely identified with more than 94% accuracy using only 100 seconds of motion data.

Even more surprising was that half of all users could be uniquely identified with only 2 seconds of motion data. Achieving this level of accuracy required innovative AI techniques, but again, the data used was extremely sparse — just three spatial points for each user tracked over time.

Researchers found that the data they leave behind in virtual reality is more useful than a fingerprint to identify individuals. It also provides significantly more data to monetize, including a user’s height, handedness, gender, potential disability, strength, personal tics, etc.

Combine this data with the profiles already commonly being built at major companies and ad brokers, and you could see how this might be a bit of an issue in a country that’s literally too corrupt to pass even a basic privacy law for the internet era (there was just too much money to be made, sorry).

There have been so many studies at this point (including other previous studies of user VR data) showcasing how “anonymization” is a gibberish term. Yet the next time there’s a hack, breach, or huge batch of public data left unsecured in an Amazon cloud bucket, notice how quickly the term is immediately utilized as a catch all defense for sloppy privacy and security practices.

Filed Under: , , , , , , ,
Companies: facebook, meta

Rate this comment as insightful
Rate this comment as funny
You have rated this comment as insightful
You have rated this comment as funny
Flag this comment as abusive/trolling/spam
You have flagged this comment
The first word has already been claimed
The last word has already been claimed
Insightful Lightbulb icon Funny Laughing icon Abusive/trolling/spam Flag icon Insightful badge Lightbulb icon Funny badge Laughing icon Comments icon

Comments on “Study: It’s Comically Easy To Identify ‘Anonymized’ Users In The ‘Metaverse’ With A Tiny Bit Of Motion Data”

Subscribe: RSS Leave a comment
25 Comments
Anonymous Coward says:

Re: Re:

““I know you’ve been waiting for this. Everyone has been waiting for this,” said a visually improved avatar version of Zuckerberg in Tuesday’s presentation. “But seriously, legs are hard, which is why other virtual reality systems don’t have them either.” ”

https://www.vox.com/recode/2022/10/11/23399439/metaverse-mark-zuckerberg-connect-avatar-legs-meta-microsoft-apple-vr-ar

HotHead says:

Re: Assuming that your question is serious

  1. Just because an app chooses not to display legs doesn’t mean it can’t collect data about your leg movements.
  2. Legs are not the only part of your body which can move and thus generate movement data.
  3. You don’t need to map leg data to actual leg movements. You just need enough leg data to dinstinguish one person’s leg data from another’s. If you tell me your height using a fictional unit of measurement, I don’t need to convert your height to a standard unit of measurement to distinguish you from other people as long as I have other people’s heights in the fictional unit of measurement.
Anonymous Coward says:

Re:

From the paper:

We define a “session” as a continuously-recorded sequence of replays from a single user where no more than 10 minutes have elapsed between each replay. Our dataset contains an average of 13 such sessions per user. For each user, we reserve 70% of the sessions for training, 10% for validation, and 20% for testing, with a minimum of 1 session per set. As such, our models always perform true cross-session user identification rather than merely learning session-specific features, such as the exact position of a user within their room.

“Real-world” identity in this case means the username that generated the Beat Saber replay. I don’t think it would be possible to cross-reference this model with motion-tracking captured by non-VR applications, since the motion features used basically only exist when playing the game.

Anonymous Coward says:

Privacy fear-mongers are forever trying to force their ludicrous concerns onto everyone else, and, thankfully so far, they have been largely failing. For the vast majority of people, “privacy” is of negative benefit, preventing online interaction that would benefit users by improving their experience with things useful to them.

Anonymous Coward says:

Re: Re:

I am otherwise automatically sent to moderation because the site host hates my viewpoints, and I don’t feel like waiting for hours or days to participate in the conversation. I’m well-known here to the people who dislike me, and am usually recognized by my use of the term “woke ideology”. If the host would stop trying to harass me into leaving, I would be happy to comment as signed-in.

Anonymous Coward says:

Re: Re: Re:

Hyman.

You keep spouting Nazi ideology and you can’t even wonder WHY you’re being flagged, warned, told to leave and moderated?

You are not silenced EVERYWHERE. You are silenced HERE. Mike won’t be so petty as to drop a SLAPP on you or worse, even though you do deserve it.

And you keep continuing to BE A NAZI< even though by your own admission, you’re probably a Jew, or Polish, and your parents or grandparents have lived through World War 2 and the ACTUAL NAZI REGIME.

It just means that you are a shame to them. And worse.

HotHead says:

Re: Re: Re:

I am otherwise automatically sent to moderation because the site host hates my viewpoints, and I don’t feel like waiting for hours or days to participate in the conversation.

Uh huh. You are trying to achieve a direct benefit from privacy. Only you and the people you like deserve privacy. Everyone else who wants privacy must be a whining baby.

PaulT (profile) says:

Re: Re:

More accurately – it’s been trained on data that was online during the training period. That doesn’t mean results are accurate (it can return results based on the training, but they might not be accurate depending on the query), or out of date (something changed after the training period).

If an inaccurate result is returned, it doesn’t mean it’s “making it up”, it means it’s returning an inaccurate result based on the dataset it’s using. Which just means that you have to deal with machine learning as suggestion not a fact, which is good advice no matter what you’re trying to search for.

Nemo_bis (profile) says:

So virtual reality has something to offer after all

All this focus on the “metaverse” always sounded idiotic to me. Decades-old reheated soup. However:

Researchers found that the data they leave behind in virtual reality is more useful than a fingerprint to identify individuals. It also provides significantly more data to monetize, including a user’s height, handedness, gender, potential disability, strength, personal tics, etc.

Oh, so maybe Zuck does know what he’s doing. When everyone pointed out Facebook is a massive threat to privacy, a mass surveillance mechanism and a way to sell personal data to advertisers, he went “Oh, all that sounds great! Is there any way to do it even more?”. And the answer apparently was virtual reality.

Add Your Comment

Your email address will not be published. Required fields are marked *

Have a Techdirt Account? Sign in now. Want one? Register here

Comment Options:

Make this the or (get credits or sign in to see balance) what's this?

What's this?

Techdirt community members with Techdirt Credits can spotlight a comment as either the "First Word" or "Last Word" on a particular comment thread. Credits can be purchased at the Techdirt Insider Shop »

Follow Techdirt

Techdirt Daily Newsletter

Subscribe to Our Newsletter

Get all our posts in your inbox with the Techdirt Daily Newsletter!

We don’t spam. Read our privacy policy for more info.

Ctrl-Alt-Speech

A weekly news podcast from
Mike Masnick & Ben Whitelaw

Subscribe now to Ctrl-Alt-Speech »
Techdirt Deals
Techdirt Insider Discord
The latest chatter on the Techdirt Insider Discord channel...
Loading...