Predicting spring freeze for midwestern wheat yields

Bayu Wilson

Overview

In this project I leverage the XGBoost machine learning library and publically available weather data to forecast the date of last spring freeze ($T\leq 28$ degrees Farenheit) of a farm in Wyandotte county in Kansas (i.e. K.C. Farm School). Using the XGBoost model, I decreased the mean absolute error (MAE) by 25% in comparison to standard prediction methods. These solutions can be generalized for any cold-sensitive crops in a variety of environments but I am mainly focused on spring freeze injury to Kansas wheat. See this publication for more details. I gathered this dataset using NOAA's Climate Data Online tool.

Main results

In Figure 1, I show an example of minimum temperature and temperature fluctuation forecasts after Jan. 31, 2010. Generally the average and variation of the true and predicted values are similar. We can also see form the purple vertical lines that the predicted and true last day of spring freeze (LDSF) are only 3 days apart. Farmers and gardeners often use the farmer's almanac but its average-based predictions don't capture any variations. For example, using the average, the absolute difference from the true LDSF in 2010 is 7 days, over 2 times larger than my prediction.

Figure 1. The true minimum temperature and temperature fluctuatuation distributions are in blue. The XGBoost model was trained prior to "today" and everything after "today" is forecasted (exept True distribution obviously). The data is noisy so it is not surprising that the predictions are not perfect. But since we have prior information of the seasonality (sine function) we can get pretty close. Our machine learning model can focus on temperature fluctuations rather than the total temperature. `T_flucs(t+1)` is the prediction for temperature fluctuations one day into the future. `T_flucs(t+7)` is the prediction for temperature fluctuations seven days into the future. LDSF is the last day of spring frost for the prediction and truth.

Here I show the mean absolute error (MAE) between the predicted and true LDSFs.

MAE(Avg. LDSF over previous years, truth)	MAE(sine model minus $1\sigma$, truth)	MAE(XGBoost 1-day forcast, truth)
10.2 days	8.9 days	7.6 days

Using the XGBoost model, the MAE decrease by a factor of 25%

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
spring_freeze_prediction.ipynb		spring_freeze_prediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting spring freeze for midwestern wheat yields

Bayu Wilson

Overview

Main results

About

Releases

Packages

Languages

bayu-wilson/spring_freeze_prediction

Folders and files

Latest commit

History

Repository files navigation

Predicting spring freeze for midwestern wheat yields

Bayu Wilson

Overview

Main results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages