Skip to content

Jupyter Notebooks with R scripts for developing and analysing multiple linear regression models.

Notifications You must be signed in to change notification settings

johnmalcolm/Regression-Modelling-with-R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Applied Regression Modelling

This repository contains Jupyter Notebooks with R scripts inspired by the book Applied Regression Modelling by Iain Pardoe.

The book is mostly focused on the mathematical foundations of these models as opposed to implementation in R or Python, but in this repository I have created Jupyter Notebooks with R code that gives examples of how to build multiple linear regression models, interpret the result, check model assumptions, perform transformations and interpret influential points. These essential data science tasks are worked through along side mathematical formulas in the notebooks to show the underlying maths of each step.

Notebooks

  1. Model Comparisons, Assumptions & Predictions in Cars City Miles Efficiency
  2. There is value in visually examining the data, don't just compare the model params
  3. Compare Models, Check Assumptions & F-Test with Mortality & Air Pollution
  4. RSS, F-Test & Anova in predicting box office success
  5. Illustrating unimportant predictors with shipping labour hours dataset
  6. Transformations into Quadratic and Square Root Models
  7. Prediction Intervals and transformations for home tax dataset
  8. Transformations for GDP~Internet Model
  9. Analysis of interactions in multivariate analysis
  10. Removal of interaction terms
  11. Confounding levels in qualitative factors
  12. Diagnostic Plots, Leverage, Cooks Distance & Outliers

The Jupyter Server

I am running a Jupyter Notebook server on my AWS EC2 instance accessible at https://stats.fieldmap.me/. If you would like access to the server please contact hi@johnmalcolmdesign.com . Datasets are hosted on S3.

linear model

About

Jupyter Notebooks with R scripts for developing and analysing multiple linear regression models.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published