Skip to content

Latest commit

 

History

History
41 lines (30 loc) · 1.18 KB

README.md

File metadata and controls

41 lines (30 loc) · 1.18 KB

Data analysis for stroke liklihood

This project will focus on an analysis of a data-set showing patient health data, and if the patient has suffered a stroke. The analysis will perform initial data cleanup and validation, linear approaches, KNN approaches, and finally a random forest analysis and SVN.

Installation

To install:

  • Download and Install Anaconda:
  • Go to Anaconda’s official website.
  • Download the appropriate version for your operating system.
  • Run the installer and follow the on-screen instructions.
  • Verify Installation:
  • Open your terminal or command prompt.
  • Run the command:
conda --version

If the version number is displayed, Anaconda is installed correctly.

Launching Jupyter Notebook

  • Activate the Anaconda Environment (Optional):
conda activate <environment_name>
  • Start Jupyter Notebook:
  • Open your terminal or Anaconda Prompt.
  • Run:
jupyter notebook

This will open Jupyter Notebook in your default web browser.

Data

This repo features an adjusted data set of patient health data, including dummy data to help skew results for testing validation.

The data file is located at ./data/healthcare-dataset-stroke-data.csv