Skip to content

on exploring spatio-temporal encoding strategies for county-level yield prediction.

Notifications You must be signed in to change notification settings

ellaampy/SpatioTemporalYield

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

On exploring spatio-temporal encoding strategies for county-level yield prediction

Crop yield information plays a pivotal role in ensuring food security. Advances in Earth Observation technology and the availability of historical yield records have promoted the use of machine learning for yield prediction. Significant research efforts have been made in this direction, encompassing varying choices of yield determinants and particularly how spatial and temporal information are encoded. However, these efforts are often conducted under diverse experimental setups, complicating their inter-comparisons. In this paper, we present our findings on multiple strategies for encoding spatial-spectral information at the county level—specifically through average pixel values, pixel sampling, and image histograms alongside approaches for temporal information, including recurrent neural networks, temporal convolutions, and attention mechanisms.

Structure of the repository

  • conf - configuration management files for the experiments
  • data_preparation - dataset classes for transforming data (time x channel x pixels) into different data structures
  • models - model classes for classical and deep learning architectures
  • train_XXX.py - scripts to execute training of deep learning models according to input data structure

Study area

The United States of America (USA) is the world’s largest producer of corn, accounting for approximately one-third of global production. We conduct a case study focusing on the USA’s top five corn-producing states: Iowa, Illinois, Indiana, Nebraska, and Minnesota. Altogether, they accounted for over one-half of the USA’s corn(grain) production in 2021.

Example Image

Fig.1 : Map of the study area showing the difference in corn yield for 2012(drought) and 2011 (predrought)

Data

The effectiveness of machine learning for crop yield prediction also depends on selecting the appropriate sets of features. We review selected studies that apply machine learning (ML) to remote sensing data for predicting crop yields to determine our predictors. The figure below shows that the selected features adequately captures variations in yield.

  • MODIS surface reflectance (MODIS
  • Meterological factors (temperature and precipitation)
  • Spectral indices (NDVI and NDWI)

Example Image

Fig.2: Temporal variability in spectral reflectance, weather variables, and spectral indices for a pre-drought year (2011 -blue) and drought year (2012 - red) for a selected county in Nebraska.

Data download

The data used for the study is available on huggingface.

Experiment setup

The workflow follows the compression or transformation of spatio-temporal information into other data structures. Depending on the type of structure, a befitting machine learning model is applied.

Example Image

Fig.3: Demonstrating the data preparation workflow and experiment setup

Results

The figure presents the percentage difference between observed and predicted crop yield for the year 2021. Interested readers are encouraged to consult the main paper for additional evaluation metrics for the extensive list of models compared.

Example Image

Fig.4: Demonstrating the data preparation workflow and experiment setup

Citation

Please cite our work as:

TBD

Acknowledgement

Model implementations from this work are sourced from:

About

on exploring spatio-temporal encoding strategies for county-level yield prediction.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published