Don't Let This Get Lost In The Shuffle: The Data Transfer Project Is Expanding, And Could Help Create Real Competition Online

from the this-is-important dept

While lots of people are angling to break up the big internet companies in the belief that will lead to more competition, we've long argued that such a plan is unlikely to work. Instead, if you truly want more competition you need to end the ability of these companies to lock up your data. Instead, we need to allow third parties access so that the data is not stuck in silos, but where users themselves both have control and alternative options that they can easily move to.

That's why we were quite interested a year ago when Google, Facebook, Microsoft and Twitter officially announced the Data Transfer Project (which initially began as a Google project, but expanded to those other providers a year ago). The idea was that the companies would make it ridiculously easy to let users automatically transfer their own data (via their own control) to a different platform. While some of the platforms had previously allowed users to "download" all their data, this project was designed to be much more: to make switching from one platform to another much, much easier -- effectively ending the siloing of data and (worse) the lock-in effects that help create barriers to competition. As we noted last year:

But the really important thing that this may lead to is not so much about transferring your data between one of the giant platforms, but hopefully in opening up new businesses which would allow you to retain much greater control over your data, while limiting how much the platforms themselves keep. This is something we've talked about in the past concerning the true power of data portability. Rather than having it tied up in silos connected to the services you use, wouldn't it be much better if I could keep a "data bank" of my data in a place that is secure -- and where if and when I want to I can allow various services to access that data in order to provide the services I want?

In other words, for many years I've complained about how we've lost the promise of cloud computing in just building up giant silos of data connected to the various online services. If we can separate out the data layer from the service layer, then we can get tremendous benefits, including (1) more end-user control over their own data (2) more competitive services and (3) less power to dominate everything by the biggest platforms. Indeed, we could even start to move towards a world of protocols instead of platforms.

So it's good news to see the latest announcement about the project is that it's expanding once again. While the headlines are that Apple has joined the program (to round out the biggest internet companies) it's also notable that two other very interesting, but much smaller, players are joining as well: the federated Mastodon project and Tim Berners-Lee's Solid, which is an attempt to build the kind of "protocols, not platforms" approach that we keep advocating for.

There are still many open questions about how well all of this will work -- but if you believe in true competition among internet services this is the project to pay attention to it, as it has the highest likelihood of actually creating such competition. Plans to "break up" big tech just creates a few more data silos and effectively locks in some pre-selected (slightly smaller) giants, thanks to network effects. What the Data Transfer Project does is flip the equation. It makes it so that more competition can thrive without taking away the network effects that make the internet so powerful. It's the most interesting, and most compelling approach to generating actual competition among internet services.

I still hope that the project goes even further in knocking down silos and opening up for competition, but it's already quite encouraging. Of course, it got almost no attention at all because anti-trust is sexy, whereas companies opening themselves up to competition through technological means is apparently boring.

Filed Under: competition, data portability, data transfer project, lock-in, mastodon, solid
Companies: apple, facebook, google, microsoft, solid


Reader Comments

The First Word

That's why we were quite interested a year ago when Google, Facebook, Microsoft and Twitter officially announced the Data Transfer Project (which initially began as a Google project, but expanded to those other providers a year ago). The idea was that the companies would make it ridiculously easy to let users automatically transfer their own data (via their own control) to a different platform.

Have you actually looked at Facebook's "ridiculously easy" data? I downloaded mine a few months ago, and looking at it from the perspective of a programmer, it's garbage. It's exactly what I would do if I wanted to set up a system specifically designed to look like openness to an unskilled outside observer (such as a politician or regulator) while being worthless for the purpose of actually enabling data transfer to a competitor.

The devil, it has been said, is in the details, and when you look at the details of the data Facebook gives you, (and what they don't give you), you definitely see a diabolical entity emerge. The most important subtle little problem is that there are no unique identifiers.

For example, in your Friends data, it gives you the name of each Friend, and a few bits of data they've shared, but no username or other token that identifies them specifically. Then in your Comments data, it says which post you commented on, and the name of the person who posted it... but without a unique identifier you have no way of knowing if this Bob Smith is the same Bob Smith in your Friends list or someone else who happens to have that name.

You may say "well sure, but how likely are you to have two friends by the same name, or go commenting on someone's post with the same name as one of your friends?" And you'd probably be right... but that's exactly what makes this such a subtly evil problem. Because it looks just fine to any individual user, but if you try to use the data for its primary intended purpose--to facilitate competition by enabling people to move to a competing system--the lack of unique identifiers makes it impossible to reconstruct the social graph. If I'm running the MasonBook network and I import data from Dave, Fred, and Janet, and all of them have a friend named Bob Smith, I have no way to determine if they're all friends with the same person or not.

Facebook's "participation" in the Data Transfer Project is nothing but transparency theater, to borrow a concept from the world of security. It's just more of the same from a company that's never bothered to even pretend they're not being evil.

—Mason Wheeler

Subscribe: RSS

View by: Time | Thread


  1. icon
    Scary Devil Monastery (profile), 2 Aug 2019 @ 5:31am

    Re: Re: Exactly HOW am I better off with MORE corps having my de

    "Think of it this way: instead of data being locked up with the service, you'd have your own personal data broker who would hold your private/public data. You would then grant various services the right to access that data on a limited basis."

    Nice if it worked but fails on a few practical considerations:

    1) Data storage and administration isn't free. So we're looking a solution which one way or the other requires the user to pay.

    2) For said data to make any sense what so ever means that the platform to which you extend "gathering" rights will have to reveal the questions it asks and the answers it receives to a 3rd party commercial entity. Something which may in many jurisdictions mean a tripartite contract must be established for every platform the user wants to access.

    3) The user in heaviest need of this type of service is unfortunately often the user who doesn't really care that they've casually clicked their way past any expectation to privacy and data ownership with twenty different platforms.

    4) The idea that you can retain your data while letting anyone else view it is absurd to begin with. You should assume that if a given piece of information is shown to anyone else that information will be public property tomorrow. The saying that "Three may keep a secret if two of them are dead" is as true today as it was when old Benjie first coined the phrase.


Add Your Comment

Have a Techdirt Account? Sign in now. Want one? Register here



Subscribe to the Techdirt Daily newsletter




Comment Options:

  • Use markdown. Use plain text.
  • Remember name/email/url (set a cookie)

Follow Techdirt
Techdirt Gear
Show Now: Takedown
Advertisement
Report this ad  |  Hide Techdirt ads
Essential Reading
Techdirt Deals
Report this ad  |  Hide Techdirt ads
Techdirt Insider Chat
Advertisement
Report this ad  |  Hide Techdirt ads
Recent Stories
Advertisement
Report this ad  |  Hide Techdirt ads

Close

Email This

This feature is only available to registered users. Register or sign in to use it.