Page 1 of 15
Transactions on Machine Learning and Artificial Intelligence - Vol. 10, No. 5
Publication Date: October, 25, 2022
DOI:10.14738/tmlai.105.13049. Karimanzira, D., Ritzau, L., & Emde, K. (2022) Catchment Area Multi-Streamflow Multiple Hours Ahead Forecast Based on Deep
Learning. Transactions on Machine Learning and Artificial Intelligence, 10(5). 15-29.
Services for Science and Education – United Kingdom
Catchment Area Multi-Streamflow Multiple Hours Ahead Forecast
Based on Deep Learning
Divas Karimanzira
Fraunhofer Institute of Optronics
System Technologies and Image Exploitation (IOSB)
Am Vogelherd 90, 98693 Ilmenau, Germany
Linda Ritzau
Fraunhofer Institute of Optronics
System Technologies and Image Exploitation (IOSB)
Am Vogelherd 90, 98693 Ilmenau, Germany
Katharina Emde
School 1Fraunhofer Institute of Optronic
System Technologies and Image Exploitation (IOSB)
Am Vogelherd 90, 98693 Karlsruhe, Germany
ABSTRACT
Modeling of rainfall-runoff is very critical for flood prediction studies in decision
making for disaster management. Deep learning methods have proven to be very
useful in hydrological prediction. To increase their acceptance in the hydrological
community, they must be physic-informed and show some interpretability. They
are several ways this can be achieved e.g. by learning from a fully-trained
hydrological model which assumes the availability of the hydrological model or to
use physic-informed data. In this work we developed a Graph Attention Network
(GAT) with learnable Adjacency Matrix coupled with a Bi-directional Gated
Temporal Convolutional Neural Network (2DGAT-BiLSTM). Physic-informed data
with spatial information from Digital Elevation Model and geographical data is used
to train it. Besides, precipitation, evapotranspiration and discharge, the model
utilizes the catchment area characteristic information, such as instantaneous slope,
soil type, drainage area etc. The method is compared to two different current
developments in deep learning structures for streamflow prediction, which also
utilize all the spatial and temporal information in an integrated way. One, namely
Graph Neural Rainfall-Runoff Models (GNRRM) uses timeseries prediction on each
node and a Graph Neural Network (GNN) to route the information to the target node
and another one called STA-LSTM is based on Spatial and temporal Attention
Mechanism and Long Short Term Memory (LSTM) for prediction. The different
methods were compared in their performance in predicting the flow at several
points of a pilot catchment area. With an average prediction NSE and KGE of 0.995
and 0.981, respectively for 2DGAT-BiLSTM, it could be shown that graph attention
mechanism and learning the adjacency matrix for spatial information can boost the
model performance and robustness, and bring interpretability and with the
inclusion of domain knowledge the acceptance of the models.
Page 2 of 15
16
Transactions on Machine Learning and Artificial Intelligence (TMLAI) Vol 10, Issue 5, October - 2022
Services for Science and Education – United Kingdom
Keywords: Deep learning; Streamflow prediction; Graph Attention Neural Networks,
Sequential model. Rainfall-Runoff Modelling.
INTRODUCTION
Among all extreme events such as droughts, fires, etc., flood is the most common and cause a
lot of deaths according to the UN [1,2]. Therefore, rainfall-runoff studies are very important in
several respects. Thy can be used to simulate flow peaking, changes in the water quantity and
quality and helps in decision on water resources management. Due to the nature of hydrological
processes, interconnectivity between several influencing variables, nonlinearity, etc, rainfall- runoff modelling has many challenges. Traditionally, physically-based or numerical
hydrological models are used for flood prediction, e.g. in [3], fully dynamic, diffusive, and
kinematic waves models are used to describe the surface water flow and produces very good
results in flow and water level prediction. Although the performance of the physically-based
models in terms of prediction are excellent, these models require various types of hydrological
and geomorphological observations. Their setup and operation are very time-expensive, which
makes short-term prediction required, not feasible [4]. Furthermore, developing a physically- based model requires in-depth domain knowledge and expertise on hydrological parameters
and model region, which is very challenging. Physically-based models postulates inserting
maltitudes of different types of hydrological data for learning, but in fact, it is known that there
are too many influencing factors of floods which are difficult to fully simulate them. Therefore,
Cea et al,[5] and others, suggest to use as less variables as possible to increase the
transferability of the models. For example Cea et al, [5], pointed out the correlation between
rainfall and runoff.
In recent years, data-driven and deep learning models are becoming the point of interest in
hydrological studies [6]. Mostly, LSTM (Long Short-Term Memory), TCN (Temporal
Convolutional Networks) are used as rainfall-runoff modelling is taken as a timeseries problem
[7,8,9,10].
If geospatial information such as topography and other watershed characteristics considering
the upstream-downstream relationships are integrated into deep learning, a combination of
Graph Neural Network (GNN) or Convolutional Neural Networks and LSTMs are used as in
[11],[12]. In this case, CNNs are used to extract and encode spatial information at each time
step which is then fed into an LSTM module for the time-series modeling and prediction. Due
to the nature of hydrological data that it is non-euclidean, using CNNs might not be physically
correct. The CNNs have convolution kernels with fixed size filter which assumes Euclidean data
and takes all the local neighbors of each node into consideration. In hydrology the fact that
streams or hillslopes are regionally close to one another does not mean that they are
hydrologically related. A more physically appealing solution is to use GNNs which capture the
upstream-downstream dependency graphically [13].
There are two different current developments in deep learning structures for streamflow
prediction. One from the authors [14] Graph Neural Rainfall-Runoff Models (GNRRM) assumes
raster information and uses timeseries prediction models on each grid cell and Graph Neural
Networks to aggregate grid cells with similar distance from the target node to compute its
streamflow. Through the GNN it is non-euclidean as it utilizes spatial information. The author
showed that for the sequence models in the nodes to model temporal behavior, BiGTCN