Most companies and law firms are in the dark about their data — they’ve been collecting it for years, since the advent of computers, and don’t have a clue what they’re holding on to. Most of this data is redundant, obsolete, or trivial digital data they continue to retain even though the information has no business or legal value.

While some of the data is well-organized in structured databases like Oracle or SQL, the vast majority of the accumulated data results from interactions between people and is referred to as unstructured data because it is data that cannot be easily categorized, analyzed or stored in formalized repositories.  It is found in such places as email, websites, instant messages, file shares, mobile applications and more. In the legal world, such unstructured data includes client matters, case files, court filings, deposition transcripts, personnel records, contracts and more.