Tracking what data in used in a machine learning data set can be a daunting task, but as regulations and public scrutiny intensifies, lawyers say it could be a useful tool to stay compliant.

Last week Facebook Inc. joined other developers in the quest to better trace the data used in data sets. In a research paper titled “Radioactive Data: Tracing Through Training” Facebook announced a new method to trace images used in data set for training software.