Skip to content

Latest commit

 

History

History
38 lines (27 loc) · 5.41 KB

README.md

File metadata and controls

38 lines (27 loc) · 5.41 KB

Influenza virus deep mutational scanning data sets visualized by dms-view

Data sets

Hemagglutinin (HA)

  • Mutational tolerance of the H1 HA from A/WSN/1933 as measured by Doud and Bloom (2016). See here for an interactive dms-view visualization that shows the mutational tolerance focusing on sites within 5 angstroms of the receptor. The raw data are here.
  • Mutational Antigenic Profiling of A/WSN/1933 H1 HA from monoclonal antibodies measured by Doud, Lee, and Bloom, 2018. See here for an interactive dms-view visualization focusing on the antibody S139. The raw data are here.
  • Mutational Antigenic Profiling of Perth09 H3 HA from human and ferret sera measured by Lee et al., 2019. See here for an interactive dms-view visualization focusing on sites strongly selected by different human sera. The raw data are here.
  • Mutational tolerance of the H3 HA from A/Perth/2009 as measured by Lee et al., (2018). See here for an interactive dms-view visualization that shows the mutational tolerance focusing on sites within 5 angstroms of the receptor.

Adding data sets

  1. Make a new branch or a fork of this repository.

  2. Navigate to the ./data/ directory and create a subdirectory for the appropriate protein and study. Data are organized first by protein and then by study, with studies named by the first author and year, such as ./data/HA/Doud2016/.

  3. Within that subdirectory, add the three input files that are needed to visualize the data. As described in the dms-view docs, these are a CSV file of the data, a protein structure PDB file, and a Markdown metadata file. In general, you should name these with the CSV data file named according to the study name (e.g., ./data/HA/Doud2016/Doud2016.csv), the PDB file simply being the PDB (e.g., ./data/HA/Doud2016/1rvx_1trimer.pdb), and the Markdown metadata file also named according to the study name (e.g. ./data/HA/Doud2016/Doud2016.md). In addition, you should add a README within the subdirectory explaining the origin of the files (e.g., ./data/HA/Doud2016/README.md). In some cases (such as if the same data is mapped to multiple protein structures) you may need to extend or modify this naming scheme. If you need to process other rawer forms of the data (e.g., from paper supplements) into the final data file for dms-view, then include those rawer data and the processing scripts if possible, explaining in the subdirectory repo.

  4. Make a pull request for your branch or fork to add the data.

  5. In this top-level README file add a short description of the study and a link to an appropriate dms-view view of the dataset.

  6. Make a pull request for your branch or fork to add the link.