Comparison of phenological weather indices based statistical, machine learning and hybrid models for soybean yield forecasting in Uttarakhand

Authors

  • YUNISH KHAN Department of Mathematics, Statistics and Computer science, G.B. Pant University of Agriculture and Technology, Pantnagar 263145, Uttarakhand, India
  • VINOD KUMAR Department of Mathematics, Statistics and Computer science, G.B. Pant University of Agriculture and Technology, Pantnagar 263145, Uttarakhand, India
  • PARUL SETIYA Department of Agrometeorology, College of Agriculture, G.B. Pant University of Agriculture and Technology, Pantnagar 263145, Uttarakhand, India
  • ANURAG SATPATHI Department of Agrometeorology, College of Agriculture, G.B. Pant University of Agriculture and Technology, Pantnagar 263145, Uttarakhand, India https://orcid.org/0000-0002-5884-4113

DOI:

https://doi.org/10.54386/jam.v25i3.2232

Keywords:

Crop yield prediction, Stepwise Multiple Linear Regression (SMLR), Principal component analysis (PCA), Artificial Neural Network (ANN)

Abstract

Early information exchange regarding predicted crop production could play a role in lowering the danger of food insecurity. In this study total six multivariate models were developed using past time series yield data and weather indices viz. SMLR, PCA-SMLR, ANN, PCA-ANN, SMLR-ANN and PCA-SMLR-ANN for three major soybean producing districts of Uttarakhand viz. Almora, Udham Singh Nagar and Uttarkashi. Further analysis was done by fixing 80% of the data for calibration and the remaining dataset for validation to predict soybean yield. Phenology wise average values were computed using the daily weather data. These average values are subsequently employed in the computation of both weighted and unweighted weather indices. The PCA-SMLR-ANN, SMLR-ANN and PCA-ANN models were found to be the best soybean yield predictor model for Almora, Udham Singh Nagar and Uttarkashi districts, respectively. The overall ranking based on the performances of the models for all locations can be given as: SMLR-ANN > PCA-ANN > PCA-SMLR-ANN ≈ ANN > PCA-SMLR > SMLR. The study results indicated that hybrid models outperformed the individual models well for all the study regions.

References

Ahmad, T., Sahoo, P. M., Singh, M. and Biswas, A. (2021). Crop Cutting Experiment techniques for determination of yield rates of field crops. DOI:10.13140/RG.2.2.16327.62883.

Aravind, K., Vashisth, A., Krishanan, P. and Das, B. (2022). Wheat yield prediction based on weather parameters using multiple linear, neural network and penalised regression models. J. Agrometeorol., 24(1): 18-25. DOI:https://doi.org/10.54386/jam.v24i1.1002

Banerjee, P., Mukherjee, B., Venugopalan, V. K., Nath, R., Sarath Chandran, M. A., Dessoky, E. S., Ismail, I. A., El-Hallous, E. I. and Hossain, A. (2021). Thermal Response of Spring–Summer-Grown Black Gram (Vigna mungo L. Hepper) in Indian Subtropics. MDPI. DOI:https://doi.org/10.3390/atmos12111489.

Basso, B., Cammarano, D. and Carfagna, E. (2013). Review of crop yield forecasting methods and early warning systems. In Proceedings of the first meeting of the scientific advisory committee of the global strategy to improve agricultural and rural statistics, FAO Headquarters, Rome, Italy (Vol. 18, p. 19).

Bhartiya, A., Chandra, N., Pal, R. S., Aditya, J. P. and Bajeli, J. (2017). Comparative yield gaps, economic impact and constraint analysis in front line demonstrations of soybean and black soybean in Uttarakhand hills. Indian J. Agric. Res., 51(5): 483-487.

Brejda, J. J., Moorman, T. B., Karlen, D. L. and Dao, T. H. (2000). Identification of Regional Soil Quality Factors and Indicators, I. Central and Southern High Plain. Soil Sci. Soc. Am. J., 64:2115–2124.

Cao, J., Zhang, Z., Luo, Y., Zhang, L., Zhang, J., Li, Z. and Tao, F. (2021). Wheat yield predictions at a county and field scale with deep learning, machine learning, and google earth engine. Eur. J. Agron., 123: 126204.

Dahikar, S.S. and Rode, S. V. (2014). Agricultural crop yield prediction using artificial neural network approach. Int. J. Inov. Res. Electr. Electron. Instrum. Control Eng., 2: 683–686.

Das, B., Nair, B., Reddy, V. K. and Venkatesh, P. (2018). Evaluation of multiple linear, neural network and penalised regression models for prediction of rice yield based on weather parameters for west coast of India. Int. J. Biometeorol., 62(10): 1809-1822.

Das, B., Nair, B., Arunachalam, V., Reddy, K. V., Venkatesh, P., Chakraborty, D. and Desai, S. (2020). Comparative evaluation of linear and nonlinear weather-based models for coconut yield prediction in the west coast of India. Int. J. Biometeorol., 64: 1111-1123.

Das, B., Murgaonkar, D., Navyashree, S. and Kumar, P. (2022). Novel combination artificial neural network models could not outperform individual models for weather-based cashew yield prediction. Int. J. Biometeorol., 66(8): 1627-1638.

FAO (2017). The future of food and agriculture – Trends and challenges. Rome. Available online: https://www.fao.org/3/i6583e/i6583e.pdf (Accessed on 20 April 2023).

Ji, Z., Yaozhong, P., Xiufang, Z., Jinyun, W. and Qiannan L. (2021). Prediction of Crop Yield Using Phenological Information Extracted from Remote Sensing Vegetation Index. Sensors., 21(4): 1406. DOI:https://doi.org/10.3390/s21041406.

Kuhn, M. (2008). Building Predictive Models in R Using the caret Package. J. Stat. Softw., 28: 1–26.

Kumar, S. (2019): Comparison of linear regression and artificial neural network technique for prediction of a soybean biodiesel yield. Energy Sources, Part A: Recovery, Utilization, and Environmental Effects., DOI:https://doi.org/10.1080/15567036.2019.1604858

Li, X., Sha, J. and Wang, Z. L. (2017). Chlorophyll-A Prediction of lakes with different water quality patterns in China based on hybrid neural networks. Water, 9: 524.

Mishra, M., Bhavyashree, S. and Sharma, H. (2017). Comparative study of the performance of artificial neural network and multiple linear regression techniques for predicting the soybean yield using its attributing characters. Int. J. Life Sci., 12(1): 177-181.

Rajaee, T. and Jafari, H. (2018). Utilization of WGEP and WDT models by wavelet denoising to predict water quality parameters in rivers. J. Hydrol. Eng., 23.

Satpathi, A., Setiya, P., Das, B., Nain, A. S., Jha, P. K., Singh, S. and Singh, S. (2023). Comparative Analysis of Statistical and Machine Learning Techniques for Rice Yield Forecasting for Chhattisgarh, India. Sustainability, 15(3): 2786.

Setiya, P., Satpathi, A., Nain, A. S. and Das, B. (2022). Comparison of weather-based wheat yield forecasting models for different districts of Uttarakhand using statistical and machine learning Techniques. J. Agrometeorol., 24(3): 255–261. DOI:https://doi.org/10.54386/jam.v24i3.1571

Soystats (2022). A Reference Guide to Important Soybean Facts & Figures, http://soystats.com/wp-content/uploads/Soy-Stats-2022-Web.pdf (Accessed on 20 April 2023).

Sridhara, S., Ramesh, N., Gopakkali, P., Das, B., Venkatappa, S. D., Sanjivaiah, S. H. and Elansary, H. O. (2020). Weather-based neural network, stepwise linear and sparse regression approach for rabi sorghum yield forecasting of Karnataka, India. Agron., 10(11): 1645.

Sridhara, S., Manoj, K. N., Gopakkali, P., Kashyap, G. R., Das, B., Singh, K. K. and Srivastava, A. K. (2023). Evaluation of machine learning approaches for prediction of pigeon pea yield based on weather parameters in India. Int. J. Biometeorol, 67(1): 165-180.

Verma, U., Piepho, H. P., Goyal, A., Ogutu, J. O. and Kalubarme, M. H. (2016). Role of climatic variables and crop condition term for mustard yield prediction in Haryana. Int. J. Agric. Stat. Sci., 12: 45–51.

Downloads

Published

31-08-2023

How to Cite

KHAN, Y., KUMAR, V., SETIYA, P., & SATPATHI, A. (2023). Comparison of phenological weather indices based statistical, machine learning and hybrid models for soybean yield forecasting in Uttarakhand. Journal of Agrometeorology, 25(3), 425–431. https://doi.org/10.54386/jam.v25i3.2232