Power Predictor

In this project, I used Bayesian Regression, a.k.a. Gaussian Process Regression, to solve the Kaggle Energy Efficiency Dataset.

The input feature vector contains 8 traits of simulated buildings, and the output is a vector of two elements containing the cooling load & heating load.

The data was split into a training set, and a test set.

What are Gaussian Processes?

Gaussian Processes (GPs) are a probabilistic model that define a distribution over random functions to make predictions about future data given some samples.

GP regression is a method that estimates the posterior distribution over functions from the dataset, by making an optimized inference of the data's mean & covariance.

My favorite source on GPs is this one from Stanford, which also leads to other great sources.

Results & Conclusion

The two models experimented with are:

Bayesian Regression on a Gaussian Process (GP) implemented with scikit-learn, fitted with the Radial Basis Function (RBF) kernel, and optimized using the Limited Memory BFGS parameter tuning algorithm.
A vanilla neural network implemented with PyTorch, fitted using Mean Squared Error.
A support vector regressor implemented with scikit-learn, and also fitted using the RBF kernel.

The accuracy results in $R^2$ score were very good for both models:

For the GP Regression, train set score 0.9966, test set score 0.9893
For the neural network, train score 0.9943, test score 0.9912
For the Support Vector Regression, train score 0.99230, test score 0.9844

More information is in the notebook 😄

Future goal: implement the entire Bayesian Regression algorithm from scratch, using no machine learning libraries.

Overall, this project helped me get into statistics & optimization, which are both very cool topics that have myriad innovative applications!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
GP_Power_Prediction.ipynb		GP_Power_Prediction.ipynb
README.md		README.md
kaggle_energy_Dataset.csv		kaggle_energy_Dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Power Predictor

What are Gaussian Processes?

Results & Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Languages

WilliamZhang20/Power-Predictor

Folders and files

Latest commit

History

Repository files navigation

Power Predictor

What are Gaussian Processes?

Results & Conclusion

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages