Skip to content

clizarraga-UAD7/geo-datascience2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

geo-datascience

Data Science Tools and Methods in Earth Sciences

Description

In Earth Science, the scientific community often needs to analyze a large volume of numerical scientific data using tools that facilitates open and reproducible science.

Python is a widely used, open-source programming language. In Earth science, scientific programming languages like Python, help you speed up and automate lengthy tasks like selecting and downloading large datasets or performing repetitive calculations that you might otherwise have to do manually.

Learning goals:

  • Describe the main open reproducible science tools (bash, Jupyter Notebooks, Github, …)
  • List the main file formats for Earth Data Science.
  • Identify the main Python libraries used in geospatial Data Science.
  • Use the Xarray and Zarr libraries to work with multidimensional data structures.
  • Use of the Geopandas library for working with vector data.
  • Use of the rasterio library to work with raster images.
  • Discuss current technologies in Cloud Optimized Data Structures (GeoJSON, Cloud Optimized GeoTIFF (COG), Cloud Optimized Point Clouds (COPC)) and SpatioTemporal Asset Catalog (STAC) catalogues.

Topics

Jupyter Notebook examples

Presentations

More resources


Updated: 02/08/2023

Carlos Lizárraga, Data Lab, Data Science Institute, University of Arizona.

CC BY-NC-SA 4.0