The primary objective of this project is to develop a pricing model that can effectively predict the price of used cars and can help the business devise profitable strategies using differential pricing.
- 🤓 Description
- 💻 Dataset Overview
- 🛠️ Feature Engineering
- 📊 Exploratory Data Analysis
- 🏗️ Model Building
- ✨ Recommendations
- 📗 Notebooks
- 📧 Contact Information
There is a huge demand for used cars in the Indian Market today. As sales of new cars have slowed down in the recent past, the pre-owned car market has continued to grow over the past years and is larger than the new car market now. Cars4U is a budding tech start-up that aims to find footholes in this market.
In 2018-19, while new car sales were recorded at 3.6 million units, around 4 million second-hand cars were bought and sold. There is a slowdown in new car sales and that could mean that the demand is shifting towards the pre-owned market. In fact, some car sellers replace their old cars with pre-owned cars instead of buying new ones. Unlike new cars, where price and supply are fairly deterministic and managed by OEMs (Original Equipment Manufacturer / except for dealership level discounts which come into play only in the last stage of the customer journey), used cars are very different beasts with huge uncertainty in both pricing and supply. Keeping this in mind, the pricing scheme of these used cars becomes important in order to grow in the market.
As a data analyst at Cars4U, you have to come up with a pricing model that can effectively predict the price of used cars and can help the business in devising profitable strategies using differential pricing. For example, if the business knows the market price, it will never sell anything below it.
Objectives
- Explore and visualize the dataset.
- Build a linear regression model to predict the prices of used cars.
- Generate a set of insights and recommendations that will help the business.
The dataset source file can found through the following link:
The used cars database contains 14 variables. The data dictionary below explains each variable:
Data Dictionary
S.No.
: Serial NumberName
: Name of the car which includes Brand name and Model nameLocation
: The location in which the car is being sold or is available for purchase CitiesYear
: Manufacturing year of the carKilometers_driven
: The total kilometers driven in the car by the previous owner(s) in KM.Fuel_Type
: The type of fuel used by the car. (Petrol, Diesel, Electric, CNG, LPG)Transmission
: The type of transmission used by the car. (Automatic / Manual)Owner
: Type of ownershipMileage
: The standard mileage offered by the car company in kmpl or km/kgEngine
: The displacement volume of the engine in CC.Power
: The maximum power of the engine in bhp.Seats
: The number of seats in the car.New_Price
: The price of a new car of the same model in INR Lakhs.(1 Lakh = 100, 000)Price
: The price of the used car in INR Lakhs (1 Lakh = 100, 000)
There was a significant amount of data pre-processing required prior data visualization. These steps can be seen in the following section.
The step by step data cleaning and wrangling can be observed in this section
The Univariate and Bivariate analysis can be seen here.
The data model preparation and linear regression steps can be seen here.
- Engine Size: Consider offering a range of engine sizes to cater to different customer preferences. Smaller engines are typically more fuel-efficient, so emphasize their benefits for cost-conscious buyers. For those looking for more power, highlight the advantages of larger engines in terms of performance.
- Car Category: Focus on popular car categories in your region. If compact cars are in demand, ensure you have a diverse selection of models in that category. Promote the benefits of each category, such as fuel efficiency for compact cars and spaciousness for SUVs.
- Region: Tailor your inventory and marketing to match regional preferences. For example, if SUVs are popular in suburban areas, stock a variety of SUV models and emphasize their suitability for family and outdoor activities.
- Fuel Type: Offer cars with different fuel types, including gasoline, diesel, and hybrid options. Highlight the cost savings and environmental benefits of fuel-efficient and hybrid vehicles, especially in regions where eco-friendliness is a priority.
- Mileage: Clearly communicate the mileage and maintenance history of each vehicle. Lower mileage vehicles can be priced higher, so ensure prospective buyers have access to this information. Offer special promotions or warranties for low-mileage cars.
The Notebook for the "Data Exploration" can be accessed below:
The Notebook for the "Feature Engineering" can be accessed below:
The Notebook for the "Exploratory Data Analysis" can be accessed below:
The Notebook for the "Model Building" can be accessed below:
- Email: sean_dhanasar@msn.com
- LinkedIn: Sean Dhanasar