Tayab Soomro1, Sophie Watts1, Zoë Migicovsky1, Sean Myles*1,2
1Department of Plant, Food, and Environmental Sciences, Faculty of Agriculture, Dalhousie University
2Agriculture & Agri-Food Canada, Kentville Research and Development Centre, Kentville, Nova Scotia
- About
- Abstract
- Figures
3a. Figure 1
3b. Figure 2 - Analysis
4a. File Architecture
4b. Getting the code
4c. Dependencies
4d. Reproducing the results
This repository contains data and scripts used to repoduce analyses in the manuscript "Quantifying the differences between cider and dessert apples".
├── analyses
├── data
│ ├── processed
│ └── raw
├── data_curation
├── figures
│ ├── density
│ ├── distance
│ ├── final_figures
│ ├── missing_data
│ └── pca
└── themes
- The
data
directory contains all the data that was used for this analysis. - The
data_curation
directory contains the scripts generated to generate the final curated dataset. - The
analyses
directory contains the scripts generated for performing various analyses of this project - The
figures
directory contains the intermediary and the final figures generated for this project.
You can download a copy of all the files in this repository by cloning the git repository:
$ git clone https://github.com/MylesLab/cider-vs-dessert.git
All the code is written purely in R and the library dependencies for running the code are enumerated below:
ggpubr
ggthemes
readxl
reshape2
tidyverse
usedist
viridis
xlsx
Given that you have above dependencies installed in your working machine, you should be able to run any of the code in this repository. However, in order to reproduce the analyses and obtain the same results, here is the list and order of the scripts you should run:
1. Data Curation
You first need to generate the dataset which is going to be used for running the analyses.
1a. data_curation/generate-final-dessert-apples-list.R
- Running this file will generate data/processed/final_dessert_apple_phenotype_data.tsv
.
1b. data_curation/generate-final-cider-apples-list.R
- Running this file will generate data/processed/final_cider_apple_phenotype_data.tsv
.
1c. data_curation/generate-final-phenotype-table.R
- Running this file will generate data/processed/final_phenotype_table.tsv
which is the main file used for all the analyses.
2. Analyses
Once the data/processed/final_phenotype_table.tsv
file is generated, we can start to run the anlyses.
2a. analyses/pca-density-analysis.R
- Running this file will generate Figure-2_density plot, as well as some of the intermediary data files which are required for generating final Figure 1 plot.
2b. analyses/distance-analysis.R
- Running this file will generate the final Figure 1 plot