Skip to content

pagalboy00/OIBSIP

Repository files navigation

OIBSIP - Oasis Infobyte Intern Program

OASIS INFOBYTE offers a variety of Data Science,Machine learning and AI services. Got the chance for 1 month intern of Data Science. There are 5 task given to complete. After completing 3 tasks, if everything looks fine, the intern will be certified. Glad to be a part of OASIS INFOBYTE intern program.

Task1 - Iris Flower classification

The aim of the iris flower classification is to predict flowers based on their specific features by help of machine learning models.

• The project code completely done using Python

• This project dataset contains iris dataset iris.csv link :

  https://www.kaggle.com/datasets/saurabh00007/iriscsv

• Required packages are for this project is pandas, numpy, matplotlib, seaborn, pickle, sklearn, Logistic Regression, Decision Tree, K-Neighbors, Naïve Bayes, SVC.

• Model trained and tested with supportive models like Decision Tree, K-Neighbors, Naïve Bayes, SVC

Task2 - Unemployment Prediction

Unemployment Analysis with machine learning .Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19.

• The project code completely done using Python

• Required packages installed, that are pandas, numpy, plotly, seaborn, matplotlib, calendar, datetime.

• This project dataset contains Unemployment in India dataset Unemployment in India.csv and Unemployment_Rate_upto_11_2020.csv link :

 https://www.kaggle.com/datasets/gokulrajkmv/unemployment-in-india

Task3 - Car Price Prediction

To be able to predict used cars market value can help both buyers and sellers. There are lots of individuals who are interested in the used car market at some points in their life because they wanted to sell their car or buy a used car. In this process, it’s a big corner to pay too much or sell less then it’s market value.

• The project code completely done using Python

• Required packages installed, that are pandas, sklearn, keras_tuner, seaborn, tensorflow, tabulate, matplotlib, LinearRegression, RandomForestClassifier, metrics.

• This project dataset contains CarPrice dataset CarPrice.csv link :

  https://github.com/amankharwal/Website-data/blob/master/CarPrice.csv

Task4 - Email Spam Detection

One of the primary methods for spam mail detection is email filtering. It involves categorize incoming emails into spam and non-spam. Machine learning algorithms can be trained to filter out spam mails based on their content and metadata.

• The project code completely done using Python

• Required packages installed, that are pandas, re, nltk, sklearn, seaborn, matplotlib, missingno, wordcloud, collections,Logistic Regression, Decision Tree,RandomForestClassifier, K-Neighbors, Naïve Bayes, SVC.

• This project dataset contains spam dataset spam.csv link :

  https://www.kaggle.com/datasets/uciml/sms-spam-collection-dataset

Task5 - Sales Prediction

Predicting the sales of a store

• The project code completely done using Python

• Required packages installed, that are pandas, numpy, seaborn, matplotlib, sklearn, LinearRegression.

• This project dataset contains Sales dataset Advertising.csv link :

  https://www.kaggle.com/datasets/bumba5341/advertisingcsv