Skip to content
This repository has been archived by the owner on Mar 22, 2020. It is now read-only.

Latest commit

 

History

History
37 lines (23 loc) · 1.55 KB

README.md

File metadata and controls

37 lines (23 loc) · 1.55 KB

The Taarifa Water Pumps Problem - Classification & Machine Learning Challenge

Using data from Taarifa and the Tanzanian Ministry of Water, can you predict which pumps are functional, which need some repairs, and which don't work at all? This is an intermediate-level practice competition. Predict one of these three classes based on a number of variables about what kind of pump is operating, when it was installed, and how it is managed. A smart understanding of which waterpoints will fail can improve maintenance operations and ensure that clean, potable water is available to communities across Tanzania.


Project Structure

This project contains 3 directories:

  • data: datasets;
  • scripts: algorithms, libraries imports, comments;
  • reports: all project plots (Boxplots, plots, hist) and documentation.

Utilised Packages

  • caret: Test/training sample separation library;
  • naiveBayes: Deterministic Learning;
  • gml2: Generalized Linear Models;
  • randomForest: Ensemble Learning;

References Links

About Taarifa:

Challenge's Description:

Dataset's Description: