This code will easily run with the default anaconda packaged libraries. Still, I am mentioning the important modules below:
- Pandas
- Numpy
- Sklearn
- Maplotlib
- Collections
- Seaborn
This project came as a part of Data Scientist Nanodegree offered by Udacity.
After perusing the Stack Overflow’s developer survey dataset, I got curious to know about the following things:
- What methods are being suggested to help people dive into this field?
- What are the salary trends?
- What mode of formal education plays a vital role?
- Did bootcamp have any impact on developer's career?
Here I have kept 2 python notebooks:
- DiveIntoTheField - To answer first 4 questions (EDA)
- Salary_Prediction - To predict salary based on given features
The main findings of the analysis could be found in the post here
Optimization approach I followed is described in the post here.
Feel free to use the above code as you would like!