• ballmerpeaking
    link
    fedilink
    arrow-up
    6
    ·
    edit-2
    2 days ago

    De-identified data is an oxymoron. Basically any dataset that’s in any way interesting is identifiable.

    • sus
      link
      fedilink
      arrow-up
      4
      ·
      edit-2
      2 days ago

      no it’s not. If you reduce the information in the datapoints until none of them are unique, then it is very obviously impossible to uniquely identify someone from them. And when you have millions of users the data can definitely still be kept interesting

      (though there’s pretty big pitfalls here, as their report seems to leave open the possibility of not doing it correctly)