Influence of random forest hyperparameterization on short-term runoff forecasting in an andean mountain catchment

dc.contributor.authorContreras Andrade, Pablo Andrés
dc.contributor.authorOrellana Alvear, Johanna Marlene
dc.contributor.authorMuñoz, Paul
dc.contributor.authorBendix, Jorg
dc.contributor.authorCélleri Alvear, Rolando Enrique
dc.date.accessioned2022-01-26T16:15:58Z
dc.date.available2022-01-26T16:15:58Z
dc.date.issued2021
dc.description.abstractThe Random Forest (RF) algorithm, a decision-tree-based technique, has become a promising approach for applications addressing runoff forecasting in remote areas. This machine learning approach can overcome the limitations of scarce spatio-temporal data and physical parameters needed for process-based hydrological models. However, the influence of RF hyperparameters is still uncertain and needs to be explored. Therefore, the aim of this study is to analyze the sensitivity of RF runoff forecasting models of varying lead time to the hyperparameters of the algorithm. For this, models were trained by using (a) default and (b) extensive hyperparameter combinations through a grid-search approach that allow reaching the optimal set. Model performances were assessed based on the R2, %Bias, and RMSE metrics. We found that: (i) The most influencing hyperparameter is the number of trees in the forest, however the combination of the depth of the tree and the number of features hyperparameters produced the highest variability-instability on the models. (ii) Hyperparameter optimization significantly improved model performance for higher lead times (12- and 24-h). For instance, the performance of the 12-h forecasting model under default RF hyperparameters improved to R2 = 0.41 after optimization (gain of 0.17). However, for short lead times (4-h) there was no significant model improvement (0.69 < R2 < 0.70). (iii) There is a range of values for each hyperparameter in which the performance of the model is not significantly affected but remains close to the optimal. Thus, a compromise between hyperparameter interactions (i.e., their values) can produce similar high model performances. Model improvements after optimization can be explained from a hydrological point of view, the generalization ability for lead times larger than the concentration time of the catchment tend to rely more on hyperparameterization than in what they can learn from the input data. This insight can help in the development of operational early warning systems.
dc.identifier.doi10.3390/atmos12020238
dc.identifier.issn2073-4433
dc.identifier.urihttps://www.scopus.com/record/display.uri?eid=2-s2.0-85101248839&origin=resultslist&sort=plf-f&src=s&st1=Influence+of+random+forest+hyperparameterization+on+short-term+runoff+forecasting+in+an+andean+mountain+catchment&sid=37d80a7ea6b5002218992762007f2f6b&sot=b&sdt=b&sl=128&s=TITLE-ABS-KEY%28Influence+of+random+forest+hyperparameterization+on+short-term+runoff+forecasting+in+an+andean+mountain+catchment%29&relpos=0&citeCnt=4&searchTerm=
dc.language.isoes_ES
dc.sourceAtmosphere
dc.subjectMachine learning
dc.subjectOptimal hyperparameters
dc.subjectRandom forest
dc.subjectRunoff forecasting
dc.subjectTropical andes
dc.titleInfluence of random forest hyperparameterization on short-term runoff forecasting in an andean mountain catchment
dc.typeARTÍCULO
dc.ucuenca.afiliacionContreras, P., Universidad de Cuenca, Departamento de Recursos Hídricos y Ciencias Ambientales, Cuenca, Ecuador
dc.ucuenca.afiliacionOrellana, J., Universidad de Cuenca, Departamento de Recursos Hídricos y Ciencias Ambientales, Cuenca, Ecuador
dc.ucuenca.afiliacionMuñoz, P., Universidad de Cuenca, Departamento de Recursos Hídricos y Ciencias Ambientales, Cuenca, Ecuador
dc.ucuenca.afiliacionBendix, J., University of Marburg, Marburg, Alemania
dc.ucuenca.afiliacionCelleri, R., Universidad de Cuenca, Departamento de Recursos Hídricos y Ciencias Ambientales, Cuenca, Ecuador
dc.ucuenca.areaconocimientofrascatiamplio1. Ciencias Naturales y Exactas
dc.ucuenca.areaconocimientofrascatidetallado1.5.8 Ciencias del Medioambiente
dc.ucuenca.areaconocimientofrascatiespecifico1.5 Ciencias de la Tierra y el Ambiente
dc.ucuenca.areaconocimientounescoamplio05 - Ciencias Físicas, Ciencias Naturales, Matemáticas y Estadísticas
dc.ucuenca.areaconocimientounescodetallado0521 - Ciencias Ambientales
dc.ucuenca.areaconocimientounescoespecifico052 - Medio Ambiente
dc.ucuenca.correspondenciaOrellana Alvear, Johanna Marlene, johanna.orellana@ucuenca.edu.ec
dc.ucuenca.cuartilQ2
dc.ucuenca.factorimpacto0.699
dc.ucuenca.idautor0104826086
dc.ucuenca.idautor0104162268
dc.ucuenca.idautor0000-0002-8000-8840
dc.ucuenca.idautorSgrp-4890-004
dc.ucuenca.idautor0602794406
dc.ucuenca.indicebibliograficoSCOPUS
dc.ucuenca.numerocitaciones0
dc.ucuenca.urifuentehttps://www.mdpi.com/journal/atmosphere
dc.ucuenca.versionVersión publicada
dc.ucuenca.volumenVolumen 12, número 2

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
documento.pdf
Size:
12.45 MB
Format:
Adobe Portable Document Format
Description:
document

Collections