Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 813 Bytes

README.md

File metadata and controls

4 lines (3 loc) · 813 Bytes

Diploma in Data Science and Machine Learning

Data Exploration and Data Cleaning Assignment (in spanish)

We worked with the Kaggle competition's dataset on property sales price estimation in Melbourne, Australia, specifically with the reduced data set produced by DanB. In the first part, we performed exploratory data analysis, outlier detection, feature selection and creation of new features by merging with an AirBnB dataset. In the second part, we encoded the variables, tried different imputation methods for the columns with missing values and applied PCA to reduce the dimensionality