This project is about predicting the final sale price of a house. The data is collected from Kaggle. The data set consists of 1460 observations with 81 variables. All the predictors explain the various features of the house, the data frame consists of one output variable 'Sale Price'. Data cleaning steps such as introducing new classes to missing categorical data, filling mean values for missing numerical data (Imputation) are used. Various plots such as scatter plots, violin plots, box plots, bar graphs etc. are plotted to explore the relationships between the output variable 'Sale Price' and predictors. ML algorithms such as Linear Regression, Ridge Regression, Lasso Regression are used to explore the positive and negative coefficients that influence the final Sale Price. The concept of Cross Validation is used to extract the best RMSE (Root mean squared error) score to analyse the best algorithm of all the algorithms applied. Regression plot and Residual plots are plotted to get the visualizations of the performance of the model on test data.
-
Notifications
You must be signed in to change notification settings - Fork 13
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and s…
NikhilaThota/CapstoneProject_House_Prices_Prediction
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and s…
Topics
python
machine-learning
correlation
linear-regression
cross-validation
data-visualization
data-extraction
data-analysis
regularization
standardization
datawrangling
predictive-modeling
ridge-regression
data-exploration
k-fold
lasso-regression
encoding-library
parameter-tuning
root-mean-squared-error-metric
regression-analysis
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published