Preparation of flood_prone areas prediction map using random forest machine learning in the Siahkhor Watershed of Kermanshah Province
Subject Areas : Sustainable Development
Ali Kiani
1
,
Baharak Motamedvaziri
2
,
Mohammad Reza Khaleghi
3
,
Hassan Ahmadi
4
1 - PhD Candidate, Department of Nature Engineering, Faculty of Natural Resources and Environment, Science and Research Branch, Islamic Azad University, Tehran, Iran.
2 - Associate Professor, Department of Nature Engineering, Faculty of Natural Resources and Environment, Science and Research Branch, Islamic Azad University, Tehran, Iran.*(Corresponding Author)
3 - Associate Professor, Department of Agriculture and Natural Resources, Torbate- Jam Branch, Islamic Azad University, Torbate- Jam, Iran.
4 - گروه سیاست دانشگاه تهران
Keywords: Flood, Siahkhor Watershed, Site Selection, Machine Learning Algorithm.,
Abstract :
Background and Objective: Flash floods are considered among the most dynamic natural disasters, necessitating measures to minimize economic damages, adverse effects, and their consequences by enhancing flood sensitivity. Therefore, mapping the sensitivity of areas prone to flooding is a critical step towards flood management. Due to the scarcity of information in most of the country's watershed areas, many researchers rely on spatial analyses for hydrological studies and flood mapping. Consequently, identifying the most significant factors influencing the creation and intensification of floods, as well as mapping their sensitivity, can be one of the most important strategies for reducing flood risk. In the current study, the Siah Khor watershed in the northeast of Islam Abad-e Gharb (Kermanshah province) was selected for predicting flood-prone areas and identifying key factors affecting their occurrence.
Material and Methodology: To identify flood-prone areas in this study region, the Random Forest machine learning (ML) algorithm was employed within a Python environment. A map depicting the distribution of past flood events was created to predict future flooding.
Findings: Fifty-three flood events were recorded in the area, and forty-nine regions were identified as non-flood zones; 70% of the data was used for modeling and 30% for validation purposes. After reviewing previous studies and surveying the study area, eight influential factors (slope, drainage density, land use, distance from the river, geological characteristics, rainfall, slope direction, and elevation) were selected for flood zoning. The Relative Operational Characteristic (ROC) curve was utilized for model validation and efficiency evaluation.
Predicting Flood-Prone Areas Using Machine Learning Methods in the Siahkhor Watershed of Kermanshah Province
Discussion and Conclusion: The results indicated that among the eight factors affecting flood zones, rainfall (0.35), distance from the river (0.27), and elevation (0.21) have the most significant impact on flooding in the study area, respectively. Furthermore, the evaluation of model outputs revealed that the Area Under the Curve (AUC) value for the Random Forest (RF) model was 0.98, demonstrating the high efficiency and accuracy of the RF model in mapping flood sensitivity in the study area. The largest area of flood sensitivity in the RF model corresponds to the very low category. The findings suggest that employing the Random Forest machine learning (ML) algorithm can be effectively used in analyzing flood risk.
1. Sattarzadeh I, Amirpoua S, Haji Kandi H, Sadeghian MS. Spatial prediction of flood-prone areas in the Karkheh watershed of Lorestan province using the combined random forest model - weight of evidence. Watershed Research. 2023;36(2):87-103.
2. Fang J, Zhang C, Fang J, Liu M, Luan Y. Increasing exposure to floods in China revealed by nighttime light data and flood susceptibility mapping. Environmental Research Letters. 2021;16(10):104044.
3. Costache R, Arabameri A, Elkhrachy I, Ghorbanzadeh O, Pham QB. Detection of areas prone to flood risk using state-of-the-art machine learning models. Geomatics, Natural Hazards and Risk. 2021;12(1):1488-507.
4. Yousefi, H., Yonesi, H. A., Davoudimoghadam, D., Arshia, A., & Shamsi, Z.Determination of Flood potential Using CART, GLM and GAM Machine learning Models. Irrigation and Water Engineering, 2022; 12(4), 84-105.
5. Chezgi J, Poyan S. Determining Flood-Prone Areas Using Machine Learning Models in the Shahrestank Watershed Area of Khosef City. jwmseir 2024; 17 (63) : 4, 38-50.
6. Rezaei F, Pourqasmi H, Shamsi F, Khosravi S, Rasoul R. Spatial modeling and preparation of flood potential map using machine learning algorithms (case study: Bushehr province),. Comprehensive Watershed Management. 2024.
Qanawati E, Safari A, Beheshti JE, Mansoorian I. Flood potential zoning using CN and AHP hydrological model integration in GIS environment, case study: Balkhlo river basin. Zagros Landscape Geographical Quarterly. 2013;7(25):67-80.
7. Chen W., Li Y., Xue W., Shahabi H., Li Sh., Hong H., Wang X., Bian H., Zhang Sh, Pradhan B., Bin Ahmad B. Modeling flood susceptibility using data-driven approaches of naïve Bayes tree, alternating decision tree, and random forest methods, Science of The Total Environment,2020; 701,134979,ISSN 0048-9697.
8. Band, S.S., Janizadeh, S., Chandra Pal, S., Saha, A., Chakrabortty, R.; Melesse, A.M.; Mosavi, A. Flash Flood Susceptibility Modeling Using New Approaches of Hybrid and Ensemble Tree-Based Machine Learning Algorithms. Remote Sens. 2020; 12, 3568.
9. Isazade, V., Qasimi, A. B., Al Kafy, A., Dong, P., & Mohammadi, M. Simulation of flood-prone areas using machine learning and GIS techniques in Samangan Province, Afghanistan. Geodesy and Cartography, 2024; 50(1), 20–29.
10. Ghanavati E. A., Safari A., Beheshti Javid E.,Mansourian E.(2014). Flood Risk Zonation Using Compilation CN Model and AHP Via GIS (Case Study: River Basin Balekhlo), Physical Geography Quarterly, 25, 67-80.
11. Cheraghi Ghale-Seri A, Habibnejad R, Mahmoud SH. Preparation of flood sensitivity map using support vector machine (SVM) model and geographic information system. Natural Environment Hazards. 2019; 9(25):61-80.
12. Malekian A, Khozani O, Ashornjad G. Flood risk zoning of Akhtarabad watershed using fuzzy hierarchical analysis method. Natural Geography Research. 2013; 4(4):131-52.
13. Yamani M, Enayati M. The relationship between the geomorphological characteristics of basins and the flood potential. Geographical researches. 2015; 37(54):47-57.
14. Garde R. River morphology; 2006.
15. Mousavi SM, Saeed N, Rakhshani MH, Hosseinzadeh SM. Evaluation and zoning of flood risk using TOPSIS fuzzy logic in GIS environment, case study: Baghmolek watershed. Natural Environment Hazards. 2005;5(10):79-97.
16. Nasrin Nejad NE, Rangzen K, Nasraleh K, Saberi A. Zoning of flood potential of Baghan watershed using Fuzzy Hierarchy Analysis (FAHP), remote sensing and geographic information system in natural resources (application). GIS in Natural Resources Sciences. 2013; 5(4):15-34.
17. Parastar S. Zonation of flood hazard in the watershed Balkhly chay using by ANP model, M.Sc in Physical Geography (Hydro Geomorphology), University of Mohaghegh Ardabili, 2014, 617 p.
18. Talebi A, Gudarzi S, Pourqasmi HR. Investigating the possibility of preparing a landslide risk map using random forest algorithm (study area: Sardarabad watershed, Lorestan province). Natural Environment Hazards. 2017;7(16):46-54.
19. Breiman L. Random forests. Machine learning. 2001;45:5-32.
20. Hastie T, Tibshirani R, Friedman JH, Friedman JH. The elements of statistical learning: data mining, inference, and prediction: Springer; 2009.
21. Lee S, Kim J-C, Jung H-S, Lee MJ, Lee S. Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea. Geomatics, Natural Hazards and Risk. 2017;8(2):1185-203.
22. Avand MT, Moradi HR, Ramzanzadeh M. Preparation of flood sensitivity map using two random forest machine learning models and Bayesian generalized linear model. Environment and Water Engineering. 2019;6(1):83-95.
23. Timuri M, Vakili Tejareh F, Mazin M, Ramezani M. Comparison of machine learning models in flood sensitivity zoning of Karaj Dam watershed. Iranian Journal of Watershed Science and Engineering. 2023;17(61):30-40.
24. Razavi Terme SV, Malek MR. Preparation of flood sensitivity map using the combination of intuitive belief model (EBF) and analysis hierarchy process (AHP) case study: Jahorm city. Journal of Mapping and Geospatial Information Engineering. 2016;8(3):1-15.
25. Rostami F, Tavakoli M, Rostami N, Ebrahimi H. Assessing the flood potential of watersheds using hierarchical analysis process (case study: Ilam city watershed). Comprehensive watershed management. 2021;1(1):1-16.
26. Kayani Asl MA, Motshefa B, Roshan SH. Evaluation of machine learning algorithms (RF and SVM) in producing flood sensitivity map of Maron watershed. Iranian Journal of Watershed Science and Engineering. 2023;17(61):41-51.
27. Islam ARMT, Talukdar S, Mahato S, Kundu S, Eibek KU, Pham QB, et al. Flood susceptibility modelling using advanced ensemble machine learning models. Geoscience Frontiers. 2021;12(3):101075.
28. Arab N, Mahini S, A M, Tabrizi A, Witte T. Flood risk analysis using random forest machine learning method (case study: Mashhad city). Ecohydrology. 2023;10(1):1-15.