Scrubbing and Cleaning of Sensitive Data Workshop
Kelly Bakulski, Jonathan Reader, and Nicolas May will present the workshop at the data science annual symposium
November 10, 2020
2:45 am - 4:15 am
Online in Zoom
Sponsored by: Michigan Institute for Data Science (MIDAS)
Contact Information: Kristin Burgard, burgardk@umich.edu
More Information & Registration
Before analysis, data must be retrieved, scrubbed of identifiable information, cleaned (e.g., address missing data, reshape appropriately), and delivered. Using biomedical and transportation datasets as examples of how this generalizable process works, this workshop will walk attendees through a real-world pipeline used to process and deliver datasets. Documentation and code will be made available through GitLab to allow for coding along with the demonstration. As a result of this workshop, attendees will leave with a practical template for implementing their own a data science pipeline.