Skip to content

This repository contains data and scripts used to repoduce analyses in the manuscript "Quantifying the differences between cider and dessert apples".

Notifications You must be signed in to change notification settings

MylesLab/cider-vs-dessert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

69 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Quantifying the differences between cider and dessert apples


Tayab Soomro1, Sophie Watts1, ‪Zoë Migicovsky1, Sean Myles*1,2
Affiliations
1Department of Plant, Food, and Environmental Sciences, Faculty of Agriculture, Dalhousie University
2Agriculture & Agri-Food Canada, Kentville Research and Development Centre, Kentville, Nova Scotia

Table of contents

  1. About
  2. Abstract
  3. Figures
    3a. Figure 1
    3b. Figure 2
  4. Analysis
    4a. File Architecture
    4b. Getting the code
    4c. Dependencies
    4d. Reproducing the results

1. About

This repository contains data and scripts used to repoduce analyses in the manuscript "Quantifying the differences between cider and dessert apples".

2. Abstract

3. Figures

3a. Figure 1

Principal Component Analysis and distance analysis of various phenotypes in commonly grown English and French cider varieties, and dessert varieties.
Figure1
Principal Component Analysis and distance analysis of various phenotypes in commonly grown English and French cider varieties, and dessert varieties. A) Biplot of PC1 vs PC2. B,C) The difference between Dessert and Cider (English & French) varieties along the PC1 and PC2 is shown in violin and boxplots. The p-values on top are the output of Wilcoxon Ranked Sum Test. D) Density plot for pairwise Euclidean distances for English cider and French cider vs. dessert apples, and within dessert apples. E) Scatter plot shows the average Euclidean distance of each of the dessert apples from English cider (x-axis) and French cider (y-axis) apple varieties.

3b. Figure 2

Density plots for various phenotypes across English cider apples, French cider apples, and dessert apples. The x-axis shows the phenotype associated with each of the apple types.
Figure2
Density plots for various phenotypes across English cider apples, French cider apples, and dessert apples. The x-axis shows the phenotype associated with each of the apple types.

4. Analysis

4a. File architecture

├── analyses
├── data
│   ├── processed
│   └── raw
├── data_curation
├── figures
│   ├── density
│   ├── distance
│   ├── final_figures
│   ├── missing_data
│   └── pca
└── themes
  • The data directory contains all the data that was used for this analysis.
  • The data_curation directory contains the scripts generated to generate the final curated dataset.
  • The analyses directory contains the scripts generated for performing various analyses of this project
  • The figures directory contains the intermediary and the final figures generated for this project.

4b. Getting the code

You can download a copy of all the files in this repository by cloning the git repository:

$ git clone https://github.com/MylesLab/cider-vs-dessert.git

4c. Dependencies

All the code is written purely in R and the library dependencies for running the code are enumerated below:

ggpubr
ggthemes
readxl
reshape2
tidyverse
usedist
viridis
xlsx

4d. Reproducing the results

Given that you have above dependencies installed in your working machine, you should be able to run any of the code in this repository. However, in order to reproduce the analyses and obtain the same results, here is the list and order of the scripts you should run:

1. Data Curation
You first need to generate the dataset which is going to be used for running the analyses.
1a. data_curation/generate-final-dessert-apples-list.R - Running this file will generate data/processed/final_dessert_apple_phenotype_data.tsv.
1b. data_curation/generate-final-cider-apples-list.R - Running this file will generate data/processed/final_cider_apple_phenotype_data.tsv.
1c. data_curation/generate-final-phenotype-table.R - Running this file will generate data/processed/final_phenotype_table.tsv which is the main file used for all the analyses.

2. Analyses
Once the data/processed/final_phenotype_table.tsv file is generated, we can start to run the anlyses.
2a. analyses/pca-density-analysis.R - Running this file will generate Figure-2_density plot, as well as some of the intermediary data files which are required for generating final Figure 1 plot.
2b. analyses/distance-analysis.R - Running this file will generate the final Figure 1 plot

About

This repository contains data and scripts used to repoduce analyses in the manuscript "Quantifying the differences between cider and dessert apples".

Topics

Resources

Stars

Watchers

Forks

Languages