In Earth Science, the scientific community often needs to analyze a large volume of numerical scientific data using tools that facilitates open and reproducible science.
Python is a widely used, open-source programming language. In Earth science, scientific programming languages like Python, help you speed up and automate lengthy tasks like selecting and downloading large datasets or performing repetitive calculations that you might otherwise have to do manually.
- Describe the main open reproducible science tools (bash, Jupyter Notebooks, Github, …)
- List the main file formats for Earth Data Science.
- Identify the main Python libraries used in geospatial Data Science.
- Use the Xarray and Zarr libraries to work with multidimensional data structures.
- Use of the Geopandas library for working with vector data.
- Use of the rasterio library to work with raster images.
- Discuss current technologies in Cloud Optimized Data Structures (GeoJSON, Cloud Optimized GeoTIFF (COG), Cloud Optimized Point Clouds (COPC)) and SpatioTemporal Asset Catalog (STAC) catalogues.
- Introduction to Geospatial Data Science
- List of General Geo Data Science Applications
- Introduction to Xarray
- Cloud Native Data in Geosciences
- Using Cloud Optimized GeoTIFF Files (COG) Examples
- Xarray & Zarr Example
- Introduction to COPC and LIDAR Point Cloud (DRAFT)
- UA Data Science Workshops
- UA Data Science Institute Events Calendar
- Digital Learning Resources Library. Data Science Institute. The University of Arizona.
Updated: 02/08/2023
Carlos Lizárraga, Data Lab, Data Science Institute, University of Arizona.