• Home
  • News
  • Firms & Lawyers
  • Courts
  • Judges
  • Surveys/lists
  • Columns
  • Verdicts
  • Public Notices
  • Advertise
  • Subscribe

Home > The 'Streetlight Effect' on E-Discovery

Font Size: increase font decrease font

Law Technology News

Previous

  • 1
  • 2
  • 3

Next

The 'Streetlight Effect' on E-Discovery

December 10, 2012

  •    
  •    
  •    
  •      
 

C. The number and nature of documents or communications in the system or index which are not searchable as a consequence of the system or index being unable to extract its full text or metadata.

D. Any limitation in the system or index, or in the search syntax to be employed, tending to limit or impair the effectiveness of keyword, Boolean or proximity search in identifying documents or communications that a reasonable person would understand to be responsive to the search.

A court will permit "discovery about discovery" like this when a party demonstrates why an inadequate index is a genuine problem. So, let's explore the rationale behind each inquiry:

A. Tokenization Rules. When machines search collections of documents for keywords, they rarely search the documents for matches; instead, they consult an index of words extracted from the documents. Machines cannot read, so the characters in the documents are identified as "words" because their appearance meets certain rules in a process called "tokenization." Tokenization rules aren't uniform across systems or software. Many indices simply don't index short words (e.g., acronyms). None index single letters or numbers.

Tokenization rules also govern such things as the handling of punctuated terms (as in a compound word like "wind-driven"), case (will a search for "roof" also find "Roof?"), diacriticals (will a search for Rene also find René?) and numbers (will a search for "Clause 4.3" work?). Most people simply assume these searches will work. Yet, in many search tools and archives, they don't work as expected, or don't work at all, unless steps are taken to ensure that they will work.

B. Stop Words. Some common "stop words" or "noise words" are simply excluded from an index when it's compiled. Searches for stop words fail because the words never appear in the index. Stop words aren't always trivial omissions. For example, "all" and "city" were stop words; so, a search for "All City" will fail to turn up documents containing the company's own name! Words like side, down, part, problem, necessary, general, goods, needing, opening, possible, well, years and state are examples of common stop words. Computer systems typically employ dozens or hundreds of stop words when they compile indices.

Because users aren't warned that searches containing stop words fail, they mistakenly assume that there are no responsive documents when there may be thousands. A search for "All City" would miss millions of documents at All City Indemnity (though it's folly to search a company's files for the company's name).

C. Non-searchable Documents. A great many documents are not amenable to text search without special handling. Common examples of non-searchable documents are faxes and scans, as well as .tiff images and some Adobe PDF documents. While no system will be flawless in this regard, it's important to determine how much of a collection isn't text-searchable, what's not searchable and whether the portions of the collection that aren't searchable are of particular importance to the case.

If All City's adjusters attached scanned receipts and bids to email messages, the attachments aren't keyword searchable absent optical character recognition.

Other documents may be inherently text-searchable but not made a part of the index because they're password-protected (i.e., encrypted) or otherwise encoded or compressed in ways that frustrate indexing of their contents. Important documents are often password-protected.

Continue reading

Previous

  • 1
  • 2
  • 3

Next



Subscribe to Law Technology News

You must be signed in to comment on an article

Find similar content

Most viewed stories

    
  1. New District Judge Takes Firm Line on Attorney Conduct
    •      
  2. Workplace Bullying: Managing the Organizational Playground
    •      
  3. Bernstein Upholds $78.4 Mil. Verdict in Phila. Med Mal Case
    •      
  4. Third Circuit Rejects NLRB Recess Appointment
    •      
  5. Judges Want Master to Develop Record in Retirement Age Case
    •      
lawjobs.com

TOP JOBS

MORE JOBS

POST A JOB

From the Law.com Network

Taking the Reins of Legal Department Operations

In-House Law: Now in 3-D!

Simpson Helps Yahoo, Tumblr Connect for $1 Billion Deal

Kasowitz Benson Launches in Los Angeles

Contrite Companies Can Win Forgiveness in Bribery Cases
  •      
    • Subscription Required

Plaintiffs Want to See Toyota's 'Crown Jewels'
  •      
    • Subscription Required

Collaboration Is Key to Defending Cyberattacks

Stanford Law Builds on Role as Legal Tech Incubator

Prolific ADA Plaintiff Faces Nemesis in Harassment Suit

Ullyot Exit Closes Chapter for Facebook

Rothstein Bankruptcy Trustee Files New Reorganization Plan
  •      
    • Subscription Required

Fla. Bar Wants Disbarment for Former Judge
  •      
    • Subscription Required

Appellate Division To Roll Out Electronic Case Filing System

Court Limits Liability for Injury Or Death of One Invited To Help
  •      
    • Subscription Required

The Affordable State-Specific Practice Solution
Available in NY, NJ, PA and CT editions - research, draft and prepare even the most complex cases with ease.

Judge Declines to Block Act-of-War Defense in 9/11 Case
  •      
    • Subscription Required

Panel Finds 'Excessive' City Fine for Poaching Antenna From Trash
  •      
    • Subscription Required

Lawsuit Testing Federal Porn Regulation Allowed to Survive

Ex-College QB Can Press Claim Over EA's Video Game
  •      
    • Subscription Required

Law Schools Are Looking Beyond LSATs, Says Mich. Dean

Is Freezing Your Eggs the Solution?

Water Warriors: Local Governments Bring Pollution Suits
  •      
    • Subscription Required

Sanction Reversed; Filing of Sexually Explicit Chat OKd
  •      
    • Subscription Required

Brooks Looks To Political Ally For Criminal Defense

Attorney Fee Hearing in Waffle House Sex Case Heats Up
  •      
    • Subscription Required

Corporate Bribery Case Part Of National Trend
  •      
    • Subscription Required

Court Continues To Grant Lawyers Fraud Immunity
  •      
    • Subscription Required

  • About |
  • ALM Properties |
  • ALM Reprints |
  • Customer Support |
  • Privacy Policy |
  • Terms & Conditions |
  • ALM User License Agreement
ALM Media