Article CC BY 4.0
refereed
published

Developing automated machine learning approach for fast and robust crop yield prediction using a fusion of remote sensing, soil, and weather dataset

GND
1303872234
ORCID
0000-0001-9569-5420
Affiliation
Julius Kühn Institute (JKI), Institute for Strategies and Technology Assessment, Germany
Kheir, Ahmed M. S.;
ORCID
0000-0002-0656-0004
Affiliation
International Center for Agricultural Research in the Dry Areas(ICARDA), Egypt
Govind, Ajit;
ORCID
0000-0001-5148-8614
Affiliation
International Center for Agricultural Research in the Dry Areas(ICARDA), Morocco
Nangia, Vinay;
ORCID
0000-0002-2348-4816
Affiliation
International Center for Agricultural Research in the Dry Areas(ICARDA), Morocco
Devkota, Mina;
Affiliation
University of Kassel, Section of Soil Science, Faculty of Organic Agricultural Sciences, Germany
Elnashar, Abdelrazek;
ORCID
0000-0003-0525-5398
Affiliation
International Center for Agricultural Research in the Dry Areas(ICARDA), Egypt
Omar, Mohie El Din;
GND
143656902
ORCID
0000-0002-1978-9473
Affiliation
Julius Kühn Institute (JKI), Institute for Strategies and Technology Assessment, Germany
Feike, Til

Estimating smallholder crop yields robustly and timely is crucial for improving agronomic practices, determining yield gaps, guiding investment, and policymaking to ensure food security. However, there is poor estimation of yield for most smallholders due to lack of technology, and field scale data, particularly in Egypt. Automated machine learning (AutoML) can be used to automate the machine learning workflow, including automatic training and optimization of multiple models within a user-specified time frame, but it has less attention so far. Here, we combined extensive field survey yield across wheat cultivated area in Egypt with diverse dataset of remote sensing, soil, and weather to predict field-level wheat yield using 22 Ml models in AutoML. The models showed robust accuracies for yield predictions, recording Willmott degree of agreement, (d > 0.80) with higher accuracy when super learner (stacked ensemble) was used (R2 = 0.51, d = 0.82). The trained AutoML was deployed to predict yield using remote sensing (RS) vegetative indices (VIs), demonstrating a good correlation with actual yield (R2 = 0.7). This is very important since it is considered a low-cost tool and could be used to explore early yield predictions. Since climate change has negative impacts on agricultural production and food security with some uncertainties, AutoML was deployed to predict wheat yield under recent climate scenarios from the Coupled Model Intercomparison Project Phase 6 (CMIP6). These scenarios included single downscaled General Circulation Model (GCM) as CanESM5 and two shared socioeconomic pathways (SSPs) as SSP2-4.5and SSP5-8.5during the mid-term period (2050). The stacked ensemble model displayed declines in yield of 21% and 5% under SSP5-8.5 and SSP2-4.5 respectively during mid-century, with higher uncertainty under the highest emission scenario (SSP5-8.5). The developed approach could be used as a rapid, accurate and low-cost method to predict yield for stakeholder farms all over the world where ground data is scarce.

Preview

Cite

Citation style:
Could not load citation form.

Access Statistic

Total:
Downloads:
Abtractviews:
Last 12 Month:
Downloads:
Abtractviews:

Rights

License Holder: 2024 The Author(s).

Use and reproduction: