Linear Regression

Overview

Linear regression is a method to find the straight line that best fits a set of data points. It helps us understand how one variable (dependent variable) changes as another variable (independent variable) changes. By estimating the slope and intercept of the line, we can make predictions and analyze the relationship between the variables.

Formula

$\hat{y} = b_{0} + b_{1} x$

$\hat{y}$ represents the predicted value of the dependent variable, $x$ represents the independent variable, and $b_{0}$ and $b_{1}$ represent the estimated intercept and slope coefficients, respectively.

$b_{1} = \frac{\sum_{i = 1}^{n} (x_{i} - \bar{x}) (y_{i} - \bar{y})}{\sum_{i = 1}^{n} (x_{i} - \bar{x})^{2}}$

$b_{0} = \bar{y} - b_{1} \bar{x}$

Step-by-Step Implementation

A drugs dataset (Kaggle) was used with the columns as,

age
sex
bmi
children
smoker
region
charges (y – dependent variable)

The dataset can be used to classify what were the medical costs billed by health insurance for a particular person. There are no classes in this since this is a regression method which means the output is a continuous value. (Learn more about regression)

See implementation in Jupyter Notebook

References

To learn more about Artificial Intelligence concepts, see Artificial Intelligence, Machine Learning, and Deep Learning..
Learn ML with Google Machine Learning Crash Course.

Home
Machine Learning
- Supervised Learning
- Unsupervised Learning
Deep Learning
Recommender Systems

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear Regression

Overview

Formula

Step-by-Step Implementation

References

Clone this wiki locally