Kaggle-House-Prices-Advanced-Regression-Techniques

I went through all the fundamental steps you need to preprocess a large dataset and then used the Linear Regression model.

Deep Neural Networks were used to beat the Mean Absoulte Error of the baseline model.

The dataset can be downloaded here.

Use the Pandas Profiling notebook only if you want to learn it, else use the "01_Linear_Regression.ipynb" file.

This notebook is divided into 5 portions:

`1. Pandas Profiling :`

I used the built in Pandas Profiling to generate a profiling report in Colab Notebook.

`2. Feature Selection :`

Feature selection was done based on missing values, feature correlation and Backward Elimination. All these methods are described briefly.

`3. Data Preprocessing :`

Missing values were filled using mean and categorical columns were coded using cat.code.

`4. Visualiztion :`

Just a trivial visualization of the value distribution among all the columns.

`5. Modeling :`

A Linear Regression model was used to fit the preprocessed data and then then I used Mean Absolute Error and Mean Squared Error as evaluation methods.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Deep Neural Networks		Deep Neural Networks
01_Linear_Regression.ipynb		01_Linear_Regression.ipynb
02_Linear_Regression_With_Pandas_Profiling.ipynb		02_Linear_Regression_With_Pandas_Profiling.ipynb
PreprocessedDataset.csv		PreprocessedDataset.csv
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kaggle-House-Prices-Advanced-Regression-Techniques

`1. Pandas Profiling :`

`2. Feature Selection :`

`3. Data Preprocessing :`

`4. Visualiztion :`

`5. Modeling :`

Spoiler Alert: Mean Absolute Error stood at 21k against an average Sale Price value of 180k.

About

Uh oh!

Releases

Packages

Languages

Abuzariii/Kaggle-House-Prices-Advanced-Regression-Techniques

Folders and files

Latest commit

History

Repository files navigation

Kaggle-House-Prices-Advanced-Regression-Techniques

1. Pandas Profiling :

2. Feature Selection :

3. Data Preprocessing :

4. Visualiztion :

5. Modeling :

Spoiler Alert: Mean Absolute Error stood at 21k against an average Sale Price value of 180k.

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`1. Pandas Profiling :`

`2. Feature Selection :`

`3. Data Preprocessing :`

`4. Visualiztion :`

`5. Modeling :`

Packages