This project is an exploratory data analysis (EDA) of Airbnb listings in Texas. The primary objective is to identify key factors influencing Airbnb pricing and understand variations in pricing across different cities in Texas. The analysis focuses on variables such as room type, house type, location, property characteristics, and ratings.
- Price Influencers: Examine how factors like room type, house type, location, and property characteristics influence Airbnb prices.
- City-wise Pricing: Investigate price variations across different cities to identify contributing factors.
- Top Hosts: Identify top hosts based on ratings.
- Popular House Types: Determine the most popular house types in the dataset.
The dataset used for this analysis is sourced from Kaggle, consisting of 14,861 observations and 18 variables. After data cleaning, the dataset comprises 13,246 observations and 21 variables.
- Addressed missing values and removed outliers.
- Created new variables to capture specific details like location, house type, ratings, bedrooms, and baths.
- Used summary statistics and data visualization techniques to explore patterns, outliers, and relationships.
- Room Type Influence: Different room types exhibit varying price trends, with "Entire home/apt" listings generally being the most expensive.
- Location Impact: Prices vary significantly across locations, with Bee Cave having the highest prices and Austin being budget-friendly for shared room bookings.
- Bedrooms and Baths: Listings with 3-5.5 bathrooms and 4-8 bedrooms tend to have higher prices.
- Ratings and Reviews: No strong linear correlation between ratings, number of reviews, and prices, though some patterns are noticeable.
- Neighborhood Effect: Neighborhoods significantly influence listing prices, with 78712 having the highest mean price and 78719 the lowest.
- Popular House Types: "Home" is the most popular house type, followed by "Rental Units" and "Condos."
- Top Hosts: The top 10 hosts are based in Austin and have perfect ratings.
Room type, house type, location, neighborhood, number of bedrooms, and bathrooms are significant determinants of Airbnb prices in Texas. Ratings and reviews have some influence on pricing, but the correlation is not strong. Future research could expand the dataset to include more cities and longer time frames.
data/
: Contains the dataset used for analysis.scripts/
: Includes R scripts for data cleaning, EDA, and visualization.output/
: Contains the results of the analysis, including visualizations and summary statistics.
- Niharika Patil
- Pratiksha Gadhe
- Yashi Agarwal