To the extent possible under law,
Titus Brown and
Shannon Joslin
have waived all copyright and related or neighboring rights to
GGG 298, Winter 2021 at UC Davis.
This work is published from:
United States.
C. Titus Brown (IOR) (ctbrown@ucdavis.edu), Shannon Joslin (sejoslin@ucdavis.edu).
This course will provide a practical introduction to common tools used in data-intensive research, including the UNIX shell, version control with git, RMarkdown, JupyterLab, and workflows with snakemake. The associated discussion section will connect the lab practicals to foundational concepts in data science, including repeatability/reproducibility, statistics, and publication ethics.
This course is open to all graduate students. No prior computational experience is required or assumed. There will be some minimal overlap with GGG 201(b) topics. All materials will be open to the community and freely available online.
Week 1: Introduction to the course, and a basic RNAseq pipeline -- lab notes
Week 2: UNIX shell for file manipulation -- lab notes
Week 3: Conda for software installation -- lab notes
Week 4: Snakemake for workflows -- lab notes
Week 5: cancelled for Internet outage reasons.
Week 6: Integrating it all, round 1 - a sourmash project!
Week 7: Git and GitHub for file tracking and sharing -- lab notes
Week 8: Project organization and more UNIX shell -- lab notes
Week 9: Slurm and the Farm cluster for doing analysis -- lab notes
Week 10: Integrating it all, round 2 - a sourmash project!
Week 11: R/Rmarkdown for Reports, Documentation and beyond -- lab notes