You Too Can Be a Programmer: How Generative AI Can Upskill Any E-Discovery Professional to Write Code
"In this article, we’ll show you how to use these tools to write your own computer program," write Jay Leib and Ye Chen of Reveal.
November 14, 2024 at 11:00 AM
6 minute read
Since we began our careers in e-discovery in 1997, there’s always been a divide between project managers, who understand the business needs, and programmers, who build the technical solutions. Today, that gap is closing, thanks to generative AI tools like ChatGPT and Claude. These tools empower e-discovery professionals to take on tasks that previously required coding knowledge—whether it’s building a simple program, massaging a load file, or interacting with databases via APIs.
In the world of comics, Green Lantern wields a power ring that channels his creativity to conjure any object he can imagine, giving him an advantage in battling villains. His real strength and differentiator is his problem-solving ability, not just the power ring itself. Today’s e-discovery professionals can similarly use their creativity combined with generative AI to build impactful solutions. Think of generative AI tools as your own power ring, giving you the ability to create without needing a computer science degree.
In this article, we’ll show you how to use these tools to write your own computer program. This is just the start of your coding journey, assisted by the power of generative AI.
|
Generative AI: Your New Superpower
Generative AI, like ChatGPT and Claude, has the power to write entire programs from just a simple prompt. It can help you develop useful scripts, write SQL queries to interact with your databases, or even create full-blown applications—all without deep technical knowledge. Imagine needing to write an application to repeatedly transform different types of load files; you can start building that application today. This kind of capability bridges the gap between your e-discovery expertise and deep technical knowledge.
|
Key Terms to Know to Build Our App
Before we get started, here’s a glossary of key terms you’ll come across:
- Prompt: A prompt is a question or command given to an AI model to shape its response—whether a simple query or a specific coding instruction. Clear prompts = better results.
- Programming Language: Programming languages are the core tools for creating software. Here’s a quick rundown:
- Python: Simple and versatile, ideal for data tasks and automation.
- C#: Powerful for Windows applications.
- Swift: Built for fast, efficient iOS and Mac apps.
- Pick based on your project—Python for data, Swift for iOS, etc.
- IDE (Integrated Development Environment): An IDE is where you write, test, and debug code. New to coding? Try VS Code. For iOS or Mac, use Xcode.
- GitHub Repositories (Repos): GitHub, a Microsoft platform, lets you save, share, and collaborate on code. A repository, or “repo,” is a dedicated project space where you store your code and files, allowing for version tracking and teamwork with other developers.
- API (Application Programming Interface): APIs let programs talk to each other, often used to connect with databases or services.
Let’s Build Our First Application: The Data Organizer
Let’s start with a practical example. Imagine you have a list of documents with metadata contained within a text file, and you want to reorganize it by date. Using ChatGPT or Claude, you can create a simple Python script that allows you to import a file, sort the data, and display the results in a user-friendly console interface.
Prompt 1 (Python):
“Write a Python program that reads a text file containing document titles and dates, sorts the documents by date, and displays the sorted results in a nice, clear format in the console. Keep the program simple, using only basic Python libraries. Include helpful comments for each step.”
This prompt instructs the AI to produce a Python program that is clean, straightforward, and ready for direct use. The code should be formatted for a seamless copy-paste into an IDE, such as Google Colab, where it can run instantly. The design is intended to keep things intuitive and efficient, perfect for those who need a quick solution for sorting document data or as a foundation for more complex workflows.
You can view the code generated by GPT-4 in the following GitHub repo file:
https://github.com/LTProgrammer/eDiscovery/blob/main/reorganize_docs_by_date.py
If you’d like to understand each part of the code more deeply, you can ask the LLM to add a description for every line, explaining exactly what it does. Simply prompt it to write as detailed comments as you need, giving you full control over the level of insight.
|
Let’s Enhance Our Code
Prompt 2 (Python):
“Update the code with comments on every line explaining what it does to a beginner programmer.”
Voila! The code is now fully annotated with clear, beginner-friendly comments on each line. As someone who’s led development teams for years, I find this nothing short of amazing.
You can view the updated code generated by GPT-4 in the following GitHub repo file:
https://github.com/LTProgrammer/eDiscovery/blob/main/reorganize_docs_by_date_v2.py
|
Where to Build and Run Your Application
You can run this code in any IDE that supports Python, like Microsoft’s VS Code, or directly in an online Python environment such as Replit or Google Colab—no installation needed if you go the online route. If you encounter issues, just copy and paste any error messages into the LLM with a prompt like:
“Please see the errors from our code; can you update it to fix these errors so it runs?”
Want to add more options, like sorting by date, author, or subject? Update your prompt to request those features, and the LLM can take it from there.
|
Congratulations, You’re a Programmer!
You’ve just created your first e-discovery program from scratch! Like the Green Lantern, you’ve harnessed your creativity as a superpower, with generative AI as your guiding tool. Now, with these skills, you can build solutions, automate tasks, and open doors to endless possibilities. Remember, the only limit is your imagination—so keep experimenting, creating, and pushing boundaries.
Want to develop a desktop or iPhone app to share with friends or colleagues? Just prompt the LLM to write it in Swift or C#. Looking for a touch of elegance? Ask it to “create an interface in the style of an Apple app.”
The possibilities are yours to explore.
|
Save Your Code on GitHub for the E-Discovery Community
As you start building these small programs, we encourage you to save your work on GitHub. Here, you can share your scripts, collaborate with others, and see what others are building in the e-discovery community.
For the sample code above, we have it published in the following GitHub public repository:
https://github.com/LTProgrammer/eDiscovery
https://github.com/LTProgrammer/eDiscovery/blob/main/reorganize_docs_by_date.py
https://github.com/LTProgrammer/eDiscovery/blob/main/reorganize_docs_by_date_v2.py
Jay Leib and Ye Chen are industry veterans in e-discovery and technology innovation, having worked together in the field for 25 years. Jay is the chief strategy and innovation officer at Reveal, a leading AI company in e-discovery. Ye is the AI solution architect at Reveal, specializing in the integration of advanced machine learning techniques to streamline and enhance legal workflows. To reach the authors, or if you’d like to contribute to the e-discovery public GitHub repository, email: [email protected].
NOT FOR REPRINT
© 2024 ALM Global, LLC, All Rights Reserved. Request academic re-use from www.copyright.com. All other uses, submit a request to [email protected]. For more information visit Asset & Logo Licensing.
You Might Like
View AllTrending Stories
Who Got The Work
Michael G. Bongiorno, Andrew Scott Dulberg and Elizabeth E. Driscoll from Wilmer Cutler Pickering Hale and Dorr have stepped in to represent Symbotic Inc., an A.I.-enabled technology platform that focuses on increasing supply chain efficiency, and other defendants in a pending shareholder derivative lawsuit. The case, filed Oct. 2 in Massachusetts District Court by the Brown Law Firm on behalf of Stephen Austen, accuses certain officers and directors of misleading investors in regard to Symbotic's potential for margin growth by failing to disclose that the company was not equipped to timely deploy its systems or manage expenses through project delays. The case, assigned to U.S. District Judge Nathaniel M. Gorton, is 1:24-cv-12522, Austen v. Cohen et al.
Who Got The Work
Edmund Polubinski and Marie Killmond of Davis Polk & Wardwell have entered appearances for data platform software development company MongoDB and other defendants in a pending shareholder derivative lawsuit. The action, filed Oct. 7 in New York Southern District Court by the Brown Law Firm, accuses the company's directors and/or officers of falsely expressing confidence in the company’s restructuring of its sales incentive plan and downplaying the severity of decreases in its upfront commitments. The case is 1:24-cv-07594, Roy v. Ittycheria et al.
Who Got The Work
Amy O. Bruchs and Kurt F. Ellison of Michael Best & Friedrich have entered appearances for Epic Systems Corp. in a pending employment discrimination lawsuit. The suit was filed Sept. 7 in Wisconsin Western District Court by Levine Eisberner LLC and Siri & Glimstad on behalf of a project manager who claims that he was wrongfully terminated after applying for a religious exemption to the defendant's COVID-19 vaccine mandate. The case, assigned to U.S. Magistrate Judge Anita Marie Boor, is 3:24-cv-00630, Secker, Nathan v. Epic Systems Corporation.
Who Got The Work
David X. Sullivan, Thomas J. Finn and Gregory A. Hall from McCarter & English have entered appearances for Sunrun Installation Services in a pending civil rights lawsuit. The complaint was filed Sept. 4 in Connecticut District Court by attorney Robert M. Berke on behalf of former employee George Edward Steins, who was arrested and charged with employing an unregistered home improvement salesperson. The complaint alleges that had Sunrun informed the Connecticut Department of Consumer Protection that the plaintiff's employment had ended in 2017 and that he no longer held Sunrun's home improvement contractor license, he would not have been hit with charges, which were dismissed in May 2024. The case, assigned to U.S. District Judge Jeffrey A. Meyer, is 3:24-cv-01423, Steins v. Sunrun, Inc. et al.
Who Got The Work
Greenberg Traurig shareholder Joshua L. Raskin has entered an appearance for boohoo.com UK Ltd. in a pending patent infringement lawsuit. The suit, filed Sept. 3 in Texas Eastern District Court by Rozier Hardt McDonough on behalf of Alto Dynamics, asserts five patents related to an online shopping platform. The case, assigned to U.S. District Judge Rodney Gilstrap, is 2:24-cv-00719, Alto Dynamics, LLC v. boohoo.com UK Limited.
Featured Firms
Law Offices of Gary Martin Hays & Associates, P.C.
(470) 294-1674
Law Offices of Mark E. Salomone
(857) 444-6468
Smith & Hassler
(713) 739-1250