Once upon a time, it may have been sufficient from a privacy point of view to concern ourselves only with “public use versions” of data sets, where information such as names, telephone numbers, social security numbers, personal addresses, and certain types of medical information or information in public arrest records were redacted.

But we don’t just live any more in a world of data sets containing personally identifiable information (PII) and personal health information (PHI) – we live in a world of data sets where pieces of information that themselves are concededly “non-PII” and “non-PHI” can be combined together in a myriad of ways using advanced analytical techniques. Through the use of data mining and data linkage, including tapping into available external sources of data, in the world we live in now “de-identified” persons can be “re-identified,” and anonymous data can be de-anonymized.