Don't Let This Get Lost In The Shuffle: The Data Transfer Project Is Expanding, And Could Help Create Real Competition Online

from the this-is-important dept

While lots of people are angling to break up the big internet companies in the belief that will lead to more competition, we've long argued that such a plan is unlikely to work. Instead, if you truly want more competition you need to end the ability of these companies to lock up your data. Instead, we need to allow third parties access so that the data is not stuck in silos, but where users themselves both have control and alternative options that they can easily move to.

That's why we were quite interested a year ago when Google, Facebook, Microsoft and Twitter officially announced the Data Transfer Project (which initially began as a Google project, but expanded to those other providers a year ago). The idea was that the companies would make it ridiculously easy to let users automatically transfer their own data (via their own control) to a different platform. While some of the platforms had previously allowed users to "download" all their data, this project was designed to be much more: to make switching from one platform to another much, much easier -- effectively ending the siloing of data and (worse) the lock-in effects that help create barriers to competition. As we noted last year:

But the really important thing that this may lead to is not so much about transferring your data between one of the giant platforms, but hopefully in opening up new businesses which would allow you to retain much greater control over your data, while limiting how much the platforms themselves keep. This is something we've talked about in the past concerning the true power of data portability. Rather than having it tied up in silos connected to the services you use, wouldn't it be much better if I could keep a "data bank" of my data in a place that is secure -- and where if and when I want to I can allow various services to access that data in order to provide the services I want?

In other words, for many years I've complained about how we've lost the promise of cloud computing in just building up giant silos of data connected to the various online services. If we can separate out the data layer from the service layer, then we can get tremendous benefits, including (1) more end-user control over their own data (2) more competitive services and (3) less power to dominate everything by the biggest platforms. Indeed, we could even start to move towards a world of protocols instead of platforms.

So it's good news to see the latest announcement about the project is that it's expanding once again. While the headlines are that Apple has joined the program (to round out the biggest internet companies) it's also notable that two other very interesting, but much smaller, players are joining as well: the federated Mastodon project and Tim Berners-Lee's Solid, which is an attempt to build the kind of "protocols, not platforms" approach that we keep advocating for.

There are still many open questions about how well all of this will work -- but if you believe in true competition among internet services this is the project to pay attention to it, as it has the highest likelihood of actually creating such competition. Plans to "break up" big tech just creates a few more data silos and effectively locks in some pre-selected (slightly smaller) giants, thanks to network effects. What the Data Transfer Project does is flip the equation. It makes it so that more competition can thrive without taking away the network effects that make the internet so powerful. It's the most interesting, and most compelling approach to generating actual competition among internet services.

I still hope that the project goes even further in knocking down silos and opening up for competition, but it's already quite encouraging. Of course, it got almost no attention at all because anti-trust is sexy, whereas companies opening themselves up to competition through technological means is apparently boring.

Filed Under: competition, data portability, data transfer project, lock-in, mastodon, solid
Companies: apple, facebook, google, microsoft, solid

Reader Comments

The First Word

That's why we were quite interested a year ago when Google, Facebook, Microsoft and Twitter officially announced the Data Transfer Project (which initially began as a Google project, but expanded to those other providers a year ago). The idea was that the companies would make it ridiculously easy to let users automatically transfer their own data (via their own control) to a different platform.

Have you actually looked at Facebook's "ridiculously easy" data? I downloaded mine a few months ago, and looking at it from the perspective of a programmer, it's garbage. It's exactly what I would do if I wanted to set up a system specifically designed to look like openness to an unskilled outside observer (such as a politician or regulator) while being worthless for the purpose of actually enabling data transfer to a competitor.

The devil, it has been said, is in the details, and when you look at the details of the data Facebook gives you, (and what they don't give you), you definitely see a diabolical entity emerge. The most important subtle little problem is that there are no unique identifiers.

For example, in your Friends data, it gives you the name of each Friend, and a few bits of data they've shared, but no username or other token that identifies them specifically. Then in your Comments data, it says which post you commented on, and the name of the person who posted it... but without a unique identifier you have no way of knowing if this Bob Smith is the same Bob Smith in your Friends list or someone else who happens to have that name.

You may say "well sure, but how likely are you to have two friends by the same name, or go commenting on someone's post with the same name as one of your friends?" And you'd probably be right... but that's exactly what makes this such a subtly evil problem. Because it looks just fine to any individual user, but if you try to use the data for its primary intended purpose--to facilitate competition by enabling people to move to a competing system--the lack of unique identifiers makes it impossible to reconstruct the social graph. If I'm running the MasonBook network and I import data from Dave, Fred, and Janet, and all of them have a friend named Bob Smith, I have no way to determine if they're all friends with the same person or not.

Facebook's "participation" in the Data Transfer Project is nothing but transparency theater, to borrow a concept from the world of security. It's just more of the same from a company that's never bothered to even pretend they're not being evil.

—Mason Wheeler

Subscribe: RSS

View by: Time | Thread

  1. identicon
    Anonymous Coward, 2 Aug 2019 @ 5:43pm

    Re: Re: Exactly HOW am I better off with MORE corps having my de


    There is no "retracting" of any data. Only a promise against reusing it.

    You can't provably remove the data from FaceBook's, or anyone else's, server. A legal injunction would be your best option, but they can just as easily move the data and it's processing out of any jurisdiction to render such an injunction worthless literally at the push of a button.

    The only real option you have is to simply not play the game. I.e. Never give them the data in the first place. Of course there are so many idiots in the US and elsewhere who don't care about uploading data that doesn't belong to them, that giving no data at all is currently an impossibility.

    Maybe making it a requirement during a data breach to divulge the identities of the uploaders so that affected parties can sue them for reparations might fix this. (Companies can avoid liability by claiming the individual uploaders who gave them the exposed data. It's about the only way you'd get such a law passed Congress.) At the very least, a few big lawsuits under such a law would give those who don't care about the privacy of others pause when hitting that submit button.

    All the Data Transfer Project does is cover up the real issue. It's not about who has your data, it's about them having it and abusing it in the first place. All the cover up does is redistribute the data amongst the abusers, attempts to legitimize the abuse, and profits gained from said abuse, with the excuse of "well it's OK as long as X doesn't do it, right?", and lowers the barrier to entry. Allowing old victims to gain new abusers, and old abusers access to new victims.

Add Your Comment

Have a Techdirt Account? Sign in now. Want one? Register here

Subscribe to the Techdirt Daily newsletter

Comment Options:

  • Use markdown. Use plain text.
  • Remember name/email/url (set a cookie)

Follow Techdirt
Special Affiliate Offer

Report this ad  |  Hide Techdirt ads
Essential Reading
Techdirt Deals
Report this ad  |  Hide Techdirt ads
Techdirt Insider Chat
Report this ad  |  Hide Techdirt ads
Recent Stories
Report this ad  |  Hide Techdirt ads


Email This

This feature is only available to registered users. Register or sign in to use it.