GitHub - Data5ci3nc3/Walmart_sales: Build a machine learning model able to predict the weekly sales in Walmart stores with the best possible precision.

Walmart sales project

Build a machine learning model able to predict the weekly sales in Walmart stores with the best possible precision.

Conduct exploratory data analysis (EDA) and make all necessary preprocessings to prepare data for machine learning
Train a linear regression model (baseline)
Avoid overfitting by training a regularized regression model

In Machine learning, depending on the problem to be solved, basic models can sometimes outperform more complex ones.
Training a model with a small amount of data will produce misleading scores. It is not beacause our model gives a R2 score of 98 % at the first try that we should think that the model is the best model ever. We should investigate and try to understand what is going on under the hood. The three models designed for the purpose of this project perform more or less correctly despite the fact that they have been trained with limited data. Nevertheless, training these models with more data would have helped them generalize better - prevent overfitting - and make better predictions.
Machine learning being an iterative process and no model being a priori guaranteed to work better, we can experiment with different models to see if they produce better scores. It is better to run the model and record its scores rather than making assumptions about a particular model's performance.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
01-Walmart_completed.ipynb		01-Walmart_completed.ipynb
README.md		README.md