Skip to content

Exploratory data analysis in the stroke prediction dataset

Notifications You must be signed in to change notification settings

vitornegromonte/EDA_stroke

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stroke Prediction Dataset - Exploratory Data Analysis

👋| About

This repository contains a project on Exploratory Data Analysis (EDA) applied to the Stroke Prediction Dataset. The goal is to investigate and understand the patterns and characteristics of the data.

🔬 | Project description

The project utilizes R and Python for conducting the analysis of the dataset. R is employed for the unidimensional analysis of qualitative variables, making use of its extensive range of packages specialized in data visualization and analysis. The main packages utilized in this phase are as follows:

  • tidyverse
  • esquisse
  • ggthemes
  • data.table
  • outliers
  • BHH2
  • latex2exp
  • moments
  • modeest

For the unidimensional analysis of quantitative variables and the bidimensional analysis, Python is preferred due to its versatility in handling data analysis tasks. Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn are utilized extensively. These libraries provide robust data manipulation capabilities, statistical functions, and advanced visualization capabilities. The bidimensional analysis is conducted using Python, leveraging its extensive data manipulation capabilities and the rich set of statistical and visualization libraries.

Programming Languages Diagram

Note: The project requires R, Python and Jupyer to be installed on your system along with the necessary packages mentioned above. To install them you can use:

R Python Jupyter Notebook

🌱 | Getting Started

  1. Clone the repository:

    git clone https://github.com/vitornegromonte/EDA_stroke.git
  2. Navigate to the project directory:

    cd EDA_stroke
  3. For run the R scripts execute the file R/packages.R to install all the required packages.

Releases

No releases published

Packages

No packages published