Predicting Regional Spatial Distribution of Soil Texture Fractions in Sistan Flood Plain using Random Forest Method
Subject Areas : New topics in soil physicsMohammad Shahriari 1 , Masumeh Delbari 2 , Peyman Afrasiab 3 , Mohammadreza Pahlavan-Rad 4
1 - Researcher, Agricultural Engineering Research Department, Agricultural and Natural Resources Research and Education Center, AREEO, Zabol, Iran.
2 - Associate Professor of Irrigation and Drainage Engineering Department of Water Engineering Faculty of Water and Soil University of Zabol Bonjar Ave., Zabol, Iran P.O. Box: 98615-538,
3 - Associate Professor of Irrigation and Drainage Engineering Department of Water Engineering Faculty of Water and Soil University of Zabol, Iran.
4 - Soil and Water Research Department, Golestan Agricultural and Natural Resources Research and Education Center, AREEO, Gorgan, Iran.
Keywords: Environmental variable, Random forest, Remote sensing, Soil texture, Spatial variation,
Abstract :
Introduction: Soil properties are highly spatially variable in flood plains. Soli texture is an important physical soil properties that have affect many agricultural and environmental activities, so it has strongly influenced water retention curve, fertility, drainage and porosity. So, knowledge on its spatial distribution is essential especially in alluvial plains and large scale. Field studies of Spatial Distribution of soil properties, especially on a large scale and in floodplains, a large number of soil samples may need to be collected, which is cost- and time-consuming. Digital soil mapping (DSM) method using remote sensing data are coupled as significant impact on predicting spatial distribution soil properties.
Methods: The aim of this study was to predict the spatial distribution of soil texture fractions in Sistan flood plain at a regional scale (area 1300 km2). In this study, 160 soil samples collected under different of various soil series of the surface layer (0–30 cm) in the agriculture land of Sistan plain and soil texture fractions including percentage of sand, silt and clay content were measured. So, remotely sensed data including Landsat 8’s Band 1, Band 2, Band 3, Band 4, Band 5, Band 6, Band 7, Band 8 and Band 4/ Band 8, Band 4/ Band 3, NDVI index, brightness index, clay index, grain size index were used as auxiliary variables for interpolation of soil texture fractions. Random forest technique was used to examine the relation between auxiliary variables and the soil texture components. Random forest is a developed model of classification and regression tree (CART). In the RF model, hundreds or thousands of classification trees are produced. 80 % of data was used for prediction and 20 % of data was used for validation, and RMSE, nRMSE, Willmott index (dr), Effectiveness index (EF), MBE and MAE were used for evaluation.
Results: Pearson's correlation analysis showed that among soil texture fractions, sand content has the highest significant relationship with the most environmental variables. Band 8 had the highest correlation with sand, silt and clay content. The findings of the research show that the use of remote sensing data has increased the accuracy of predictions. The results show that the values of RMSE and MAE are lower for prediction set than validation set whereas the values of ME are similar for both sets. The values of RMSE of estimating percentage of sand, silt, clay at validation sites using RF method were 15.42, 12.56 and 8.97 %, respectively. Also, the values of RMSE of estimation by ordinary kriging were 18.2, 9.53 and 15.1% for sand, silt and clay, respectively that were 18, 5.9 and 11.2 % higher than those obtained by RF model. Also, the values of nRMSE were 0.19, 0.13 and 0.2 for prediction dataset and it was 0.39, 0.21 and 0.34 for validation dataset for sand, silt and clay fractions, respectively. The results of dr coefficient value shows that the modeling has been done with acceptable accuracy. Also value of EF shows that spatial maps of soil texture fraction produced by using RF model has good accurate.
So, RF method when combined by remotely sensed data is a suitable method for mapping soil texture fractions in a regional scale. Also, between auxiliary variables, results showed that the clay index and grain size index were the most important environmental variables for predicting soil texture by the random forest method in the study area. The results of Wilmot's coefficient of agreement (dr) show that the modeling has been carried out with acceptable accuracy. Also, the evaluation of the efficiency factor (EF) values of the model shows that the random forest method has correctly produced the maps of soil texture components in the studied area. Other environmental variables such as Band 4 - Band 8 ratio, Band 1, Band 8 and Band 7 also influenced soil texture fractions prediction.
Conclusion: Remote sensing data combined with the random forest model can be applied for an appropriate prediction of spatial distribution pattern of soil texture fractions in large scale floodplains with a hot and dry climate condition. Highly of RMSE value for sand and silt than clay, which could be due to the wider range of silt and sand over the study region. Another reason for this could be related to the number of samples used. Therefore, it is recommended that for better accuracy in soil property maps, especially physical properties, the number of soil sampling points be increased, and optimal sampling points in these areas be determined. For future works, the use of other co-variables such as land use map, distance from the river, soil series, and salinity map or remote sensing data of smaller resolution, as well as hyperspectral visible and near-infrared reflectance spectroscopy should be evaluated for a regional spatial prediction of soil fractions in floodplains.
Akpa, S.I.C., Odeh, I.O.A., Bishop, T.F.A., Hartemink, A.E., 2014. Digital mapping of soil particle size fractions for Nigeria. Soil Sci. Soc. Am. J. 78, 1953–1966.
Boettinger, J.L., Ramsey, R.D., Bodily, J.M., Cole, N.J., Kienast_Brown, S., Nield, S.J., Saundes,A.M., Stum, A.K., 2008. Landsat spectral data for digital soil mapping. In: Hartemink, A.E., McBratney, A.B., Mendonca Santos, M.L. (Eds.) , Digital Soil MappingWith Limited Data. Springer science, Australia, pp. 193–203
Bui, E. N., Ballabio, C., & Panagos, P., 2020. Soil texture mapping over Europe: Machine learning and its potential for sustainable soil management. European Journal of Soil Science, 71(1), 13-28.
Carvalho Junior, W., Chagas, C. D. S., Pereira, G. T. 2020. Regional-scale digital soil mapping in tropical areas using machine learning: A case study in Brazil. Geoderma, 363, 114151.
Carvalho Junior, W., Lagacherie, P., Chagas, C.S., Calderano Filho, B., Bhering, S.B., 2014. A regional-scale assessment of digital mapping of soil attributes in a tropical hillslope environment. Geoderma 232, 479–486.
Chagas, C.S., Junior, W.C., Bhering, S.B and Filho, B.C. 2016. Spatial prediction of soil surface texture in a semiarid region using random forest and multiple linear regressions. Catena, 139: 232–240
Chen, J.C., Chang, N.B., and Shieh, W.K. (2003). Assessing wastewater reclamation potential by neural network model. Journal of Engineering Application of Artificial Intelligence, 16:149-157.
Demattê, J.A.M., Fiorio, P.R., Ben-Dor, E., 2009. Estimation of soil properties by orbital and laboratory reflectance means and its relation with soil classification. Open Remote Sens. J. 2, 12–23.
Gambill, D.R., Wall, W.A., Fulton, A.J. and Howard, H.R. 2016. Predicting USCS soil classification from soil property variables using Random Forest. Journal of Terramechanics, 65:85–92
Heung, B., Bulmer, C. E., Schmidt, M. G., 2016. Predictive soil parent material mapping at a regional-scale: A random forest approach. Geoderma, 279, 68-80.
Jafari A., Finke P.A., de Wauw J.V., Ayoubi S., and Khademi H. 2012. Spatial prediction of USDA- great soil groups in the arid Zarand region, Iran: comparing logistic regression approaches to predict diagnostic horizons and soil types. E. J. Soil Sci, 63: 284–298.
Islam, K., Singh, B., McBratney, A., 2003. Simultaneous estimation of several soil properties by ultra-violet, visible, and near infrared reflectance spectroscopy. Aust. J. Soil Res. 41, 1101–1114.
Liao, K., Xu, S.,Wu, J., Zhu, Q., 2013. Spatial estimation of surface soil texture using remote sensing data. Soil Sci. Plant Nutr. 59, 488–500.
Liaw, A. and Wiener, M. 2002. Classification and regression by random Forest. R news The newsletter of the R project, 2(3): 18-22.
Ließ, M., Glaser, B., and Huwe, B. 2012. Uncertainty in the spatial prediction of soil texture comparison of regression tree and random forest models. Groderma, 170: 70-79.
Mirzaee, S.; Ghorbani-Dashtaki, S.; Mohammadi, J.; Asadi, H.; Asadzadeh, F. Spatial variability of soil organic matter using remote sensing data. Catena 2016, 145, 118–127.
Pahlavan-Rad, M.R., K.H. Dahmardeh, M. Hadizadeh, G. Keykha, N. Mohammadnia, M. Gangali, M. Keikha, N. Davatgar, C. Brungard. 2020. Prediction of soil water infitration using multiple linear regression and random forest in a dry flod plain, eastern Iran. Catena, 194(2020) 104715.
Poggio, L., de Sousa, L. M., Kempen, B., Heuvelink, G. B. M., Brus, D. J. 2021. Soil property mapping using machine learning models: A review and challenges. Earth-Science Reviews, 223, 103786.
Pringle, M.J., Romano, N., Minasny, B., Chirico, G.B., and Lark, R.M. 2007. Spatial evaluation of pedotransfer functions using wavelet analysis. Hydrology, 333: 182-198.
R Development Core Team, 2007. R: A Language and Environment for Statistical Computing.
Zhao, W., Zhang, G. L., Zhao, Y. G., & Pan, W., 2018. Predicting soil properties using remote sensing data and machine learning models. Geoderma, 322, 1-10.
Shahriari, M., Delbari, M., Afrasiab, P., & Pahlavan-Rad, M. R. 2019. Predicting regional spatial distribution of soil texture in floodplains using remote sensing data: A case of southeastern Iran. Catena, 182, 104149.
Taghizadeh-Mehrjardi, R., Schmidt, K., Mahmoodi, S., Omid, M. 2020. Spatial prediction of soil properties using machine learning techniques: A case study from Iran. Soil Systems, 4(4), 63.
Vaysse, K., Lagacherie, P., 2015. Evaluating Digital Soil Mapping approaches for mapping Global Soil Map soil properties from legacy data in Languedoc-Roussillon (France).Geoderma Reg. 4, 20–30.
Willmott, C.J., Robeson, S.M., Matsuura, K. (2011). A refined index of model performance. International Journal of Climatology, 32(13): 2088-2094.
Xiao, J., Shen, Y., Tateishi, R., Bayaer, W., 2006. Development of topsoil grain size index for monitoring desertification in arid land using remote sensing. Int. J. Remote Sens. 12, 2411–2422.