Skip to content

Commit

Permalink
Built site for gh-pages
Browse files Browse the repository at this point in the history
  • Loading branch information
Quarto GHA Workflow Runner committed Oct 1, 2024
1 parent 125e095 commit acdd940
Show file tree
Hide file tree
Showing 4 changed files with 88 additions and 86 deletions.
2 changes: 1 addition & 1 deletion .nojekyll
Original file line number Diff line number Diff line change
@@ -1 +1 @@
a5fbdaff
3e7206df
103 changes: 47 additions & 56 deletions mod_data-disc.html
Original file line number Diff line number Diff line change
Expand Up @@ -367,20 +367,11 @@ <h2 id="toc-title">On this page</h2>
<ul>
<li><a href="#overview" id="toc-overview" class="nav-link active" data-scroll-target="#overview">Overview</a></li>
<li><a href="#learning-objectives" id="toc-learning-objectives" class="nav-link" data-scroll-target="#learning-objectives">Learning Objectives</a></li>
<li><a href="#panel-discussion" id="toc-panel-discussion" class="nav-link" data-scroll-target="#panel-discussion">Panel Discussion</a>
<ul class="collapse">
<li><a href="#pre-prepared-questions" id="toc-pre-prepared-questions" class="nav-link" data-scroll-target="#pre-prepared-questions">Pre-Prepared Questions</a></li>
</ul></li>
<li><a href="#panel-discussion" id="toc-panel-discussion" class="nav-link" data-scroll-target="#panel-discussion">Panel Discussion</a></li>
<li><a href="#data-repositories" id="toc-data-repositories" class="nav-link" data-scroll-target="#data-repositories">Data Repositories</a></li>
<li><a href="#general-data-searches" id="toc-general-data-searches" class="nav-link" data-scroll-target="#general-data-searches">General Data Searches</a>
<ul class="collapse">
<li><a href="#search-operators" id="toc-search-operators" class="nav-link" data-scroll-target="#search-operators">Search Operators</a></li>
<li><a href="#data-inventory-value" id="toc-data-inventory-value" class="nav-link" data-scroll-target="#data-inventory-value">Data Inventory Value</a></li>
</ul></li>
<li><a href="#downloading-data" id="toc-downloading-data" class="nav-link" data-scroll-target="#downloading-data">Downloading Data</a>
<ul class="collapse">
<li><a href="#general-data-searches" id="toc-general-data-searches" class="nav-link" data-scroll-target="#general-data-searches">General Data Searches</a></li>
<li><a href="#downloading-data" id="toc-downloading-data" class="nav-link" data-scroll-target="#downloading-data">Downloading Data</a></li>
<li><a href="#data-format-and-structure" id="toc-data-format-and-structure" class="nav-link" data-scroll-target="#data-format-and-structure">Data format and structure</a></li>
</ul></li>
<li><a href="#additional-resources" id="toc-additional-resources" class="nav-link" data-scroll-target="#additional-resources">Additional Resources</a>
<ul class="collapse">
<li><a href="#papers-documents" id="toc-papers-documents" class="nav-link" data-scroll-target="#papers-documents">Papers &amp; Documents</a></li>
Expand Down Expand Up @@ -446,8 +437,8 @@ <h2 class="anchored" data-anchor-id="panel-discussion">Panel Discussion</h2>
</div>
</div>
</div>
<section id="pre-prepared-questions" class="level3">
<h3 class="anchored" data-anchor-id="pre-prepared-questions">Pre-Prepared Questions</h3>
<section id="pre-prepared-questions" class="level4">
<h4 class="anchored" data-anchor-id="pre-prepared-questions">Pre-Prepared Questions</h4>
<ul>
<li>What policies are in place to ensure responsible use of your data?</li>
<li>What challenges (technical and scientific) do you see in integrating data across platforms and organizations?</li>
Expand Down Expand Up @@ -540,8 +531,8 @@ <h2 class="anchored" data-anchor-id="data-repositories">Data Repositories</h2>
<section id="general-data-searches" class="level2">
<h2 class="anchored" data-anchor-id="general-data-searches">General Data Searches</h2>
<p>If you don’t find what you’re looking for in a particular data repository (or want to look for data not included in one of those platforms), you might want to consider a broader search. For instance, <a href="https://www.google.com">Google</a> is a suprisingly good resource for finding data and–for those familiar with Google Scholar for peer reviewed literature-specific Googling–there is a dataset-specific variant of Google called <a href="https://datasetsearch.research.google.com/">Google Dataset Search</a>.</p>
<section id="search-operators" class="level3">
<h3 class="anchored" data-anchor-id="search-operators">Search Operators</h3>
<section id="search-operators" class="level4">
<h4 class="anchored" data-anchor-id="search-operators">Search Operators</h4>
<p>Virtually all search engines support “operators” to create more effective queries (i.e., search parameters). If you don’t use operators, most systems will just return results that have any of the words in your search which is non-ideal, especially when you’re looking for very specific criteria in candidate datasets.</p>
<p>See the tabs below for some useful operators that might help narrow your dataset search even when using more general platforms.</p>
<div class="tabset-margin-container"></div><div class="panel-tabset">
Expand Down Expand Up @@ -591,10 +582,19 @@ <h3 class="anchored" data-anchor-id="search-operators">Search Operators</h3>
<i class="callout-icon no-icon"></i>
</div>
<div class="callout-title-container flex-fill">
Activity: Data Inventory
Data Inventory Value
</div>
</div>
<div class="callout-body-container callout-body">
<p>Documenting potential datasets (and their metadata) thoroughly in a data inventory provides numerous benefits! These include:</p>
<ul>
<li>Well-documented datasets make it easier for researchers to find and access specific data for reproducible research</li>
<li>Documentation will help researchers to quickly understand the context, scope, and limitations of the data, reducing the time spent on preliminary data assessment</li>
<li>Detailed documentation will speed up the data publication process (e.g., data provenance, the difference among methods, etc.)</li>
<li>When you need to generate metadata for your own synthesis data product you’ll already have much of the information you need</li>
</ul>
<section id="activity-data-inventory" class="level4">
<h4 class="anchored" data-anchor-id="activity-data-inventory">Activity: Data Inventory</h4>
<p><strong>Part 1</strong> (~25 min)</p>
<p>In your project groups:</p>
<ul>
Expand All @@ -604,9 +604,9 @@ <h3 class="anchored" data-anchor-id="search-operators">Search Operators</h3>
<li><em>Later, each person will download their assigned dataset</em></li>
</ul></li>
<li>Discuss what key information is needed to determine if each dataset is useful for your project.</li>
<li>Once you’ve identified the necessary information, start completing the second sheet of your data inventory Google Sheet.
<li>Once you’ve identified the necessary information, start completing the detailed data inventory Google Sheet (tab separated by project groups).
<ul>
<li><em>This sheet is more detailed and will be shared with another group later</em></li>
<li><em>This sheet will be shared with another group later</em></li>
</ul></li>
</ul>
<p><strong>Part 2</strong> (~10 min)</p>
Expand All @@ -620,6 +620,7 @@ <h3 class="anchored" data-anchor-id="search-operators">Search Operators</h3>
<li>Do you agree with the information entered in the data inventory?</li>
<li>Is there any information you think should be in the data inventory that wasn’t?</li>
</ul>
</section>
</div>
</div>
<div class="callout callout-style-default callout-warning no-icon callout-titled">
Expand All @@ -641,45 +642,11 @@ <h3 class="anchored" data-anchor-id="search-operators">Search Operators</h3>
</div>
</div>
</section>
<section id="data-inventory-value" class="level3">
<h3 class="anchored" data-anchor-id="data-inventory-value">Data Inventory Value</h3>
<p>Documenting potential datasets (and their metadata) thoroughly in a data inventory provides numerous benefits! These include:</p>
<ul>
<li>Well-documented datasets make it easier for researchers to find and access specific data for reproducible research</li>
<li>Documentation will help researchers to quickly understand the context, scope, and limitations of the data, reducing the time spent on preliminary data assessment</li>
<li>Detailed documentation will speed up the data publication process (e.g., data provenance, the difference among methods, etc.)</li>
<li>When you need to generate metadata for your own synthesis data product you’ll already have much of the information you need</li>
</ul>
</section>
</section>
<section id="downloading-data" class="level2">
<h2 class="anchored" data-anchor-id="downloading-data">Downloading Data</h2>
<p>Once you’ve found data, filled out your data inventory, and decided which datasets you actually want, it’s time to download some of them! There are several methods you can use and it’s possible that each won’t work in all cases so it’s important to be at least somewhat familiar with several of these tools.</p>
<p>Most of these methods will work regardless of the format of the data (i.e., its file extension) but the format of the data will be important when you want to ‘read in’ the data and begin to work with it.</p>
<div class="callout callout-style-default callout-note no-icon callout-titled">
<div class="callout-header d-flex align-content-center">
<div class="callout-icon-container">
<i class="callout-icon no-icon"></i>
</div>
<div class="callout-title-container flex-fill">
Activity: Data Download
</div>
</div>
<div class="callout-body-container callout-body">
<ul>
<li>Each member work on the data that you have been assigned.</li>
<li>Discuss with your group how to collaborate on coding without creating merge conflicts
<ul>
<li><em>Many right answers here so discuss the pros/cons of each and pick one that feels best for your group!</em></li>
</ul></li>
<li>Write a script <strong>for your group</strong> to download data using your chosen method</li>
<li>Zoom rooms for each download method will be available. You are encouraged to join the room that corresponds to your chosen method to discuss with others working on the same approach.
<ul>
<li>If no datasets in your group’s inventory need the download method you chose, try to run the example code included below</li>
</ul></li>
</ul>
</div>
</div>
<p>Below are some example code chunks for five methods of downloading data in a scripted way. There will be contexts where only a <u>G</u>raphical <u>U</u>ser <u>I</u>nterface (“GUI”; [GOO-ee]) is available but the details of that method of downloading are usually specific to the portal you’re accessing so we won’t include an artificial general case.</p>
<div class="tabset-margin-container"></div><div class="panel-tabset">
<ul class="nav nav-tabs" role="tablist"><li class="nav-item" role="presentation"><a class="nav-link active" id="tabset-3-1-tab" data-bs-toggle="tab" data-bs-target="#tabset-3-1" role="tab" aria-controls="tabset-3-1" aria-selected="true">Data Entity URL</a></li><li class="nav-item" role="presentation"><a class="nav-link" id="tabset-3-2-tab" data-bs-toggle="tab" data-bs-target="#tabset-3-2" role="tab" aria-controls="tabset-3-2" aria-selected="false">R Package</a></li><li class="nav-item" role="presentation"><a class="nav-link" id="tabset-3-3-tab" data-bs-toggle="tab" data-bs-target="#tabset-3-3" role="tab" aria-controls="tabset-3-3" aria-selected="false">Batch Download</a></li><li class="nav-item" role="presentation"><a class="nav-link" id="tabset-3-4-tab" data-bs-toggle="tab" data-bs-target="#tabset-3-4" role="tab" aria-controls="tabset-3-4" aria-selected="false">API Call</a></li><li class="nav-item" role="presentation"><a class="nav-link" id="tabset-3-5-tab" data-bs-toggle="tab" data-bs-target="#tabset-3-5" role="tab" aria-controls="tabset-3-5" aria-selected="false">Command Line</a></li></ul>
Expand Down Expand Up @@ -867,8 +834,33 @@ <h2 class="anchored" data-anchor-id="downloading-data">Downloading Data</h2>
</div>
</div>
</div>
<section id="data-format-and-structure" class="level3">
<h3 class="anchored" data-anchor-id="data-format-and-structure">Data format and structure</h3>
<div class="callout callout-style-default callout-note no-icon callout-titled">
<div class="callout-header d-flex align-content-center">
<div class="callout-icon-container">
<i class="callout-icon no-icon"></i>
</div>
<div class="callout-title-container flex-fill">
Activity: Data Download
</div>
</div>
<div class="callout-body-container callout-body">
<ul>
<li>Each member work on the data that you have been assigned.</li>
<li>Discuss with your group how to collaborate on coding without creating merge conflicts
<ul>
<li><em>Many right answers here so discuss the pros/cons of each and pick one that feels best for your group!</em></li>
</ul></li>
<li>Write a script <strong>for your group</strong> to download data using your chosen method</li>
<li>Zoom rooms for each download method will be available. You are encouraged to join the room that corresponds to your chosen method to discuss with others working on the same approach.
<ul>
<li>If no datasets in your group’s inventory need the download method you chose, try to run the example code included below</li>
</ul></li>
</ul>
</div>
</div>
</section>
<section id="data-format-and-structure" class="level2">
<h2 class="anchored" data-anchor-id="data-format-and-structure">Data format and structure</h2>
<p>CSV and TXT are common formats for data storage. In addition, formats like NetCDF, HDF5, Matlab, and Rdata/RDS are frequently used in research, along with spatial datasets such as geotiff, shapefiles, and raster files (refer to the spatial module for more details).</p>
<p>In the R environment, data structure are typically checked using the following functions.</p>
<div class="cell">
Expand All @@ -892,7 +884,6 @@ <h3 class="anchored" data-anchor-id="data-format-and-structure">Data format and
<span id="cb4-18"><a href="#cb4-18" aria-hidden="true" tabindex="-1"></a><span class="fu">anyNA</span>(lobster_df)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
</section>
</section>
<section id="additional-resources" class="level2">
<h2 class="anchored" data-anchor-id="additional-resources">Additional Resources</h2>
<section id="papers-documents" class="level3">
Expand Down
Loading

0 comments on commit acdd940

Please sign in to comment.