Skip to content

abhishekmanglaa/linear-regression-bikeshare

Repository files navigation

Linear Regression Bike Share

Link to the Markdown file

Installation

Using tidyverse and the strength of ggplot2.

library(tidyverse)
library(readr)

You can also knit the file to a html document

Usage

install.packages('readr')
install.packages('tidyverse')

About R Markdown - https://r4ds.had.co.nz/r-markdown.html#r-markdown-basics

Data

Bikeshare systems are popular in major cities around the world and are increasingly viewed as an important mechanism to reduce auto traffic, improve air quality, reduce use of fossil fuels, and improve the health of the population. It is particularly important to be able to predict the use level of the bikesharing system in order to plan maintenance, distribution of the bikes, and make decisions on whether and where to add more bike stations. This data is from Washington, DC’s Capital bikeshare system from the years 2011-2012. Riders can rent bikes from one location and return to a different location in the system. There are two types of users: CASUAL users, who rent the bike for a one-time fee, and REGISTERED users, who pay a yearly membership fee in exchange for unlimited bike rental. The data in the accompanying file bikeshare.csv is publicly available at the UC Irvine Machine Learning Repository: http://archive.ics.uci.edu/ml/datasets/Bike+Sharing+Dataset The file contains daily information on the number of trips taken. The variables in the data set are:

  1. MONTH

  2. HOLIDAY: whether the day is a US holiday or not

  3. WEEKDAY: if day is neither weekend nor holiday WEEKDAY is YES, otherwise WEEKDAY is NO.

  4. WEATHERSIT: The values are (1) Clear/Few clouds (2) Misty (3) Light snow or light rain (4) Heavy rain, snow, or thunderstorms

  5. TEMP Normalized temperature in Celsius. The values are derived as !"!#$% , where 𝑡*+, = −8 and 𝑡*01 = +39

  6. ATEMP: Normalized “feels like” temperature in Celsius. The values are derived as !"!#$% , where 𝑡*+, = −16 and 𝑡*01 = +50 !#&'"!#$%

  7. HUMIDITY: Normalized humidity on a scale of 0 to 1.

  8. WINDSPEED: Normalized wind speed on a scale of 0 to 1.

  9. CASUAL: count of bikes rented by casual bikeshare users.

  10. REGISTERED: count of registered users.

  11. COUNT: count of total rental bikes including both casual and registered.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT