Skip to content

uminskib/Car_price_prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Car price prediction - data science project

Introduction

The repository contains a data science project based on a database that comes from the Kaggle The database, called Car Features and MSRP, contains 11914 car models sold in the U.S. between 1990 and 2018.Each car is described by 16 variables. The main goal of the study is to create a model that best predicts car prices based on the information collected. The structure of the project is as follows:

  1. Introduction
  2. Development environment preparation
  3. Data structure and data cleaning
  4. Exploratory and Data analysis
  5. Feature selection
  6. Modelling
  7. Conclusion

Project created to test and solidify skills in using pandas, scikit-learn and seaborn tools. It constitutes a project for portfolio.

Technologies

  • Python 3.9.2
  • pandas 1.1.3
  • numpy 1.19.1
  • scikit-learn 0.24.1
  • seaborn 0.11.0
  • matplotlib 3.3.3

Rest of required packages in requirements.txt file

Setup

To run the project:

  1. Download the entire repository and unzip it.
  2. Install the required packages included in the requirements.txt file:
    • Launch any command-line interface (e.g. Anaconda Prompt).
    • If you have one, set up a custom virtual environment for the program in which the project will run.
    • Set the destination path to the folder with the project: "cd 'your destination path to project'"
    • Type "pip install -r requirements.txt".
  3. Start Jupyter Notebook or LupyterLab and select the file Car_price_notebook.ipynb
  4. Run all cells

Application developed and tested in Jupyter Notebook.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published