Refer to the folder files_for_activities
for this exercise. You can continue using the same jupyter notebook that you craeted in class (where you worked on file1 and file2). Please save your work (jupyter notebook) as we will build on the same activities later.
- Load data (
file3.xlsx
andfile4.xlsx
) in a new Jupyter notebook. You might face the error saying that optional dependency xlrd is missing. In that case, they should install it usingpip
. If you don't get the error, move to the next step. 😄 - Print data columns for both files.
- Check the names and order of columns in the files, and compare them with the "data" DataFrame created in class.
- Change the names of required columns in the new dataframes read before concatenating the files with data.
- Change data columns from uppercase to lowercase.
We will merge the dataframes in the next exercise