Statistical learning approach for modelling the effects of climate change on oilseed rape yield

Behzad Sharif, Jørgen Eivind Olesen, Kirsten Schelde

Abstract


Statistical learning is a fairly new term referring to a set of supervised and unsupervised modelling and prediction techniques. It is based on traditional statistics but has been highly enhanced inspired by developments in machine learning and data mining. The main focus of statistical learning is to estimate the functions that quantify relations between several parameters and observed responses. These functions are further used for prediction, inference or a combination of both. For a particular case of quantitative responses, regularization techniques in regression are developed to overcome the weaknesses of ordinary least square (OLS) regression in prediction. These new shrinkage methods outperform OLS for prediction, especially in large datasets.

In this study, a large dataset of field experiments on winter oilseed rape in Denmark for 22 years (1992 to 2013) was collected. Biweekly climatic data along with sowing date, harvest date, soil type and previous crop are considered as the explanatory variables. Yield of winter oilseed rape is considered as response variable.

LASSO and Elastic Nets are the regularization techniques used to estimate the functions. Hold-one-out cross validation method for testing the prediction power reveals that these techniques are much useful in both prediction and inference. Since these techniques are included in recent versions of some software packages (e.g. R), they can be easily implemented by users at any level.

The estimated function (model) is further used to predict the oilseed rape yield responses to climate change for several scenarios using representative weather data produced by a weather generator.


Full Text:

Presentation (PDF)



Previous issues and volumes can be found in the 'Archives' section.

You can refer to a paper published in this series in the following format Author (2013) Title. FACCE MACSUR Reports 2: D-C1.3, where "D-C1.3" is the article ID en lieu of page range.