This project is conducted as part of the Exploratory Data Analysis (EDA) project for ICCS361 at Mahidol University International College. This project aims to analyze gene expression data to understand the differences between breast tumors and normal tissue through gene expression analysis.
- Data:
- The dataset is available in two versions:
- Non-preprocessed Version: Contains FPKM values for each gene in each sample.
- Preprocessed Version: Adds metastasis information and restructures the data so that each gene in each sample occupies a single row.
- The dataset is available in two versions:
- Code: Scripts for data analysis and visualization.
- Presentation Slides: A presentation introducing the dataset and answering five key questions with visualization.
- Dataset: The gene expression dataset used in this project can be accessed on NCBI GEO.
- Visualization Tutorial: For detailed instructions on generating visualizations, refer to this YouTube tutorial.