Though still trying to adapt to a world of social media and emojis, modern e-discovery technology has progressed by leaps and bounds. Document review, for instance, has reached a level of automation that was once unthinkable. By using technology assisted review (TAR) and new machine learning techniques, even the review of small foreign language document sets, with sparse relevant documents, can be automated.

To be sure, foreign language documents do not present much of a challenge to e-discovery technology. Sure, to review different languages, e-discovery platforms need to first be able to render their unique syntaxes, which may include a lack of spacing between characters. But most of the time, this can be solved by adding certain Unicode scripts and implementing a tokenization system to delineate spaces between characters.