This project focuses on analyzing sequencing data to uncover molecular mechanisms in neurological diseases and evaluate immunotherapy potential in breast cancer. It includes Python and R scripts for data processing, analysis, and visualization.
- Objective: Analyze RNA-seq data to identify molecular deficiencies in a mouse knockout model.
- Key Steps:
- Preprocessing RNA-seq data using Python and
kallisto
. - Importing and analyzing transcript-level data in R using
tximport
. - Conducting differential gene expression analysis with
DESeq2
.
- Preprocessing RNA-seq data using Python and
- Objective: Predict the effectiveness of immunotherapy for a breast cancer patient.
- Key Steps:
- Preprocessing single-cell RNA-seq data with
Seurat
. - Clustering cells and identifying cell types based on marker genes.
- Visualizing spatial distributions and answering clinical questions.
- Preprocessing single-cell RNA-seq data with
- Languages: Python, R
- Tools: kallisto, tximport, DESeq2, Seurat
- Visualization: UMAP, PCA, ggplot2
- Platforms: GitHub for version control
.
├── finel_projet_part1.ipynb # Python script for preprocessing RNA-seq data
├── genomics_project_Q1.R # R script for Part 1 - analysis of bulk sequencing
├── genomics_project_Q2.R # R script for Part 2 - analysis of single cell sequencing
├── Neuro Genomics Project.pdf # Detailed project report
├── neuro-genomics project instructions.pdf # Project instructions
├── README.md # Project documentation
- Python 3.x
- R with the following packages installed: kallisto, tximport, DESeq2, Seurat
- Clone the repository:
git clone https://github.com/12danielLL/Neurogenomics_Project.git cd neuro-genomics-project
- Install dependencies:
- Python:
pip install -r requirements.txt
- R: Install the required packages manually or use the script
install_packages.R
.
- Python:
- Run the Python preprocessing script:
python scripts/preprocess.py
- Execute R scripts for analysis:
- Bulk analysis:
Rscript scripts/bulk_analysis.R
- Single-cell analysis:
Rscript scripts/single_cell_analysis.R
- Bulk analysis:
- Key pathways identified: lipid metabolism, neurological function, and cell structure.
- Potential treatment strategies proposed: supporting myelin production.
- Immune cells comprise over 50% of the biopsy.
- Immune cells are spatially mixed with tumor cells, increasing the likelihood of immunotherapy success.
This project is licensed under the MIT License - see the LICENSE file for details.
- Daniel Broker
- Or Shachar