- Marijose Cavazos
- Javier Robles
- Roberto Barron
Main question: Which US State has a steeper tendency of hybrid and electric cars usage to be able to answer the question: What state would be more convinient to build an electric car dealership?
Is there a correlation between electric car buying and demographic characteristics of the population suchs as:
- education level
- mean income
- age?
Is there a correlation between electric car buying and political factors such as:
- fuel state tax
- political preference
- state incentives?
Is there a correlation between electric car buying and charging stations availability?
Exhaustive research and analysis about electric and hybrid cars usage along the US comparing to different variables. We decided to use different variables and analyze them to see which variable had a correlation >0.60 and use those variables only to get the final results and be able to answer our first question. We had so many options to study and concluded to only study 5 of them to reduce project development time. The variables to be studied to start with were: education, political (taxes, democrat or republican, and state incentives), age by state, income per capita,
- Demographic information: US CENSUS
- Vehicle Counts by State: Vehicle Registration Counts by State
- Charging Stations and law incentives: US Department of Energy API
- Gas prices: U.S Energy Information Administration
- Gas tax by state: IFEN tax by state
- Presidential Elections since 1976: Harvard Dataverse
- Grouped Electric and Hybrid Electric vehicles into "Alternative vehicles"
- Grouped Diesel and Gasoline vehicles into "Fuel vehicles"
- Measured state proportion of Alternative vehicles over Total vehicles.
% of Alternative Vehicles Rate was used to find correlation with other factors.
We used our variables as different perspectives that we could use to measure the probability of our new location being succesfull or not. We used different variables to make our analysis well-rounded.
- The correlation between'Unfinished High School (%)' and alternative car usage is 0.32 with a p-value of 0.02368063575363978
- The correlation between'Finished High School (%)' and alternative car usage is -0.32 with a p-value of 0.02368063575363965
- The correlation between'Finished Grad School (%)' and alternative car usage is 0.73 with a p-value of 9.178719718199633e-10
- The correlation between'Finished Post-grad School (%)' and alternative car usage is 0.74 with a p-value of 4.689613484272619e-10
These results tell us that:
- States where a bigger portion of the population achieved a Graduate or Post-Graduate Degree are more likely to have higher rates of Alternative Vehicles, making this a good predictor. We will be using these factors in our final results.
- We found that the rates of people that only obtained High School degrees or less has a low correlation with a state having more alternative vehicles. We won't be using these factors in our final results
The correlation between Income Per Capita and Alternative Rate(%) is 0.73 with a p-value of 1.580506370637526e-09
F_onewayResult(statistic=8.11512860974485, pvalue=0.000920529748181304)
The state with the highest income per capita is District of Columbia ($63793.00)
- The correlation between'21-30 year olds' and alternative car usage is 0.31 with a p-value of 0.02508172069375031
- The correlation between '31-50 year olds' and alternative car usage is 0.69 with a p-value of 1.642038801736538e-08
- The correlation between '51-70 year olds' and alternative car usage is -0.04 with a p-value of 0.8025663158509312
- The correlation between '71+ year olds' and alternative car usage is -0.30 with a p-value of 0.02997162985336855
Data limitations due to the privacy of dataset. There is no public data about who exactly is the owner of each car (no way to know the vehicle type by age). Though, we made a generalized analysis and had good findings despite the data limitations.
ANOVA isn't exact due to the lack of data but there was 85% certainty of difference between the groups.
F_onewayResult(statistic=1.7810678055878766, pvalue=0.1520002662561829)
Constant upward trend on Gasoline Prices and this could be accelerating adoption of alternative vehicles.
We can’t find a dataset with historical prices by state so we crossed Alternative rate vs Gas Tax by state but we found that are not correlated.
- The correlation between Gasoline Tax / gallon and Alternative Rate(%) is 0.36 with a p-value of 0.010517503577449759
To find if there is correlation between Political preferences and % Alternative Rate we follow the next steps:
- Taking an historical dataset (1976-2020) of Presidential elections it can be determined the “political preference” of each state according to the % of wins of specific party.
- States with democrat preference has a higher mean but we ran a T-test and found that statistically there was a difference in the means. Ttest_indResult(statistic=-5.036026493630137, pvalue=4.853783005218271e-05)
- Ran a correlation between states with democrat preference an % Alternative rate.
The correlation between Democrat Wins(%) and Alternative Rate(%) is 0.64 with a p-value of 4.1007651001537584e-07
Findings: states with democrat preferences and % alternative rate have a high level of predictivity.
Crrelation analysis to found if there is correlation between states with higher incentives to own an alternative vehicle and we found those variables are highly predictivity among them.
The correlation between Total Incentives and Alternative Rate(%) is 0.65 with a p-value of 2.8704453970170873e-07
The correlation between Stations per vehicle and Alternative Rate(%) is 0.82 with a p-value of 1.7167868528303151e-13
We first did the correlations to see which variables were found along with a higher percent of alternative vehicle usage. After we decided which variables to use as predictors (considered variables that had a correlation coefficient of >.60 only) we did a point based data frame in which we could summerize the ones that had more points by weighing the different variables and suming them to see which state would be the best decision to build a new alternative vehicle store. Our final findings were:
- The highest correlated factors with % alternative rate in our analysis was % of grad and post grad school, a highest income per capita and Highest % of population of 31-50.
Using the found correlations, we weighted an average to rate the states (100 points max)
From the data shown aboveand cosidering only the tendency of hybrids and electric car usage, we can conclude that the best state to open a new electric-car factory and/or warehouse would be 1.- DC 2.- Massachusetts 3.- Washington because they're the states with a steeper tendency. Though this results are not considering building or maintenance costs.



















