The Spark Foundation Data Science and Business Analytics internship tasks repository.
TASK 1 : Prediction of the Students Score
To predict the score of a student based on the number of hours studied, using Linear regression model on the independent variable (Hours) to predict the dependable variable (Scores) and further used this regression model to predict the score of a student who studies for 9.25 hrs/ day. The model has been evaluated with Goodness of Fitness - R2, MSE to evaluate the model. What will be predicted score if a student studies for 9.25 hrs/ day?
Data can be found on : https://raw.githubusercontent.com/AdiPersonalWorks/Random/master/student_scores%20-%20student_scores.csv
TASK 2 : Prediction using Unsupervised ML Problem Statement: From the given ‘Iris’ dataset, predict the optimum number of clusters and represent it visually.
Data can be found at : https://drive.google.com/file/d/11Iq7YvbWZbt8VXjfm06brx66b10YiwK-/view
TASK 3: Exploratory Data Analysis - Retail
● Perform ‘Exploratory Data Analysis’ on dataset ‘SampleSuperstore’
● As a business manager, try to find out the weak areas where you can work to make more profit.
● What all business problems you can derive by exploring the data?
Dataset:-https://drive.google.com/file/d/1lV7is1B566UQPYzzY8R2ZmOritTW299S/view
TASK 4: Exploratory Data Analysis - Terrorism ● Perform ‘Exploratory Data Analysis’ on dataset ‘Global Terrorism’.
● As a security/defense analyst, try to find out the hot zone of terrorism.
● What all security issues and insights you can derive by EDA?
Dataset: https://bit.ly/2TK5Xn5