Data is inherently dirty. It often contains more information than is needed, columns are often mislabeled, and values are often missing or in the wrong format. This project will require you to use most of what you have learned to fix just such a problem in a real-world data set.
See the assignment details in the Mini-Project-Part-1 and the Mini-Project-Part-2 subfolders.