This repository contains R scripts used for the statistical analysis of seed germination experiments conducted in two experimental phases. The analyses focus on germination dynamics under different moisture treatments and provenances, using classical survival analysis and accelerated failure time (AFT) models.
The workflow integrates descriptive summaries, exploratory visualization, correlation analysis, Kaplan–Meier estimators, and parametric survival models.
-
Experiment phases
- Phase 1: Germination monitored for up to 71 days
- Phase 2: Germination monitored for up to 103 days (with a short 71-day subset for comparison)
-
Species
- Abies alba
- Abies nordmanniana
- Fagus sylvatica
- Fagus orientalis
-
Treatments
- Moist vs. dry moisture treatments
- Multiple provenances per species
-
Experimental unit
- Individual seeds (100 seeds per tray)
- Germination time recorded as time-to-event data
- Non-germinated seeds treated as right-censored observations
The analysis relies on the following R packages:
library(survival)
library(ggplot2)
library(flexsurv)
library(dplyr)
library(tibble)
library(survminer)
library(PerformanceAnalytics)All packages are available from CRAN.
- Phase 1 and Phase 2 datasets are imported separately.
- Tray identifiers are made unique across phases.
- Metadata and cumulative germination counts are separated from individual seed observations.
- Final germination counts and percentages are extracted per provenance.
- Moist and dry treatments are combined where appropriate.
- Seed dry weight and post-stratification moisture content are added to the summary tables.
- Time-series plots of cumulative germination counts by species, provenance, and treatment.
- Visual inspection of germination dynamics across experimental phases.
-
Pearson correlation analysis between:
- Final germination counts
- Seed dry weight
- Moisture content
-
Results indicate no strong linear association between final germination success and seed weight or moisture content.
-
Individual seed-level datasets are constructed.
-
Germination time is treated as the event time.
-
Non-germinated seeds are right-censored at:
- Day 72 (Phase 1 / short Phase 2)
- Day 104 (long Phase 2)
-
Survival curves (probability of not germinating) are estimated:
- Separately for Abies and Fagus
- Stratified by provenance
-
Results are visualized using Kaplan–Meier plots.
-
Parametric AFT models are fitted using:
- Exponential
- Weibull
- Log-normal
- Log-logistic
- Gaussian distributions
-
Model selection is based on AIC.
-
Log-normal models provide the best fit for both Abies and Fagus.
-
Final models include provenance and moisture treatment as covariates.
- Short (71-day) and long (103-day) Phase 2 observations are compared.
- AFT models are fitted separately to assess the impact of observation window length on parameter estimates.
- Cleaned and combined data tables for downstream analysis
- Publication-ready figures for germination dynamics and survival curves
- Parametric survival model summaries with effect estimates and confidence intervals
- Provenances with no or extremely low germination are excluded from parametric modeling to ensure model stability.
- The scripts are written as a complete, linear analysis pipeline and are intended to be run top-to-bottom.
Mert Çelik PhD-level analysis script prepared for academic research in plant ecology and seed biology.