A. Scozzari et al. (eds.)ICT for Smart Water Systems: Measurements and Data ScienceThe Handbook of Environmental Chemistry102https://doi.org/10.1007/698_2019_403

Exploring Assimilation of Crowdsourcing Observations into Flood Models

M. Mazzoleni^{1, 2}, Leonardo Alfonso³ and D. P. Solomatine^{3, 4}

(1)

Department of Earth Sciences, Uppsala University, Uppsala, Sweden

(2)

Centre of Natural Hazards and Disaster Science (CNDS), Uppsala, Sweden

(3)

IHE Delft Institute for Water Education, Delft, The Netherlands

(4)

Delft University of Technology, Delft, The Netherlands

M. Mazzoleni

Email: maurizio.mazzoleni@geo.uu.se

1 Introduction

2 Crowdsourced Observations

3 Case Studies and Water-Related Models

3.1 Brue Catchment (UK)

3.2 Bacchiglione Catchment (Italy)

4 Model Updating Techniques

4.1 Kalman Filter

4.2 Ensemble Kalman Filter

4.3 Synthetic Flow Observations

4.4 Estimation of the Observational Error

5 Assimilation of Flow Observations from Static Heterogeneous Sensors

5.1 Assimilation of Synchronous Observations

5.2 Assimilation of Asynchronous Observations

6 Conclusions

References

Abstract

This chapter aims to describe the latest innovative approaches for integrating heterogeneous observations from static social sensors within hydrological and hydrodynamic modelling to improve flood prediction. The distinctive characteristic of such sensors, with respect to the traditional ones, is their varying lifespan and space-time coverage as well as their spatial distribution. The main part of the chapter is dedicated to the optimal assimilation of heterogeneous intermittent data within hydrological and hydraulic models. These approaches are designed to account for the intrinsic uncertainty contained into hydrological observations and model structure, states and parameters. Two case studies, the Brue and Bacchiglione catchments, are considered. Finally, the evaluation of the developed methods is provided. This study demonstrates that networks of low-cost static and dynamic social sensors can complement traditional networks of static physical sensors, for the purpose of improving flood forecasting accuracy. This can be a potential application of recent efforts to build citizen observatories of water, in which citizens not only can play an active role in information capturing, evaluation and communication but also can help improve models and increase flood resilience.

Keywords

Crowdsourced observationsData assimilationFlood forecastingHydraulic modellingHydrological modelling

1 Introduction

The impact of natural hazards on societies and economies has drastically increased in the last years due to many natural and anthropogenic factors, including climate change [1, 2]. For this reason, the demands for non-structural measures able to accurately and timely forecast in real-time river water level to allow decision-makers to take the most effective and timely decisions for reducing harm or loss have significantly increased [3–5]. Among different types of water system models, hydrological and hydrodynamic models are the most utilised ones in flood early warning systems in river basins.

Unfortunately, deterministic predictions contain an intrinsic uncertainty due to many sources of error that propagate through the model and therefore affect its output [6]. In fact, uncertainty can be due to either the inherent stochastic nature and variability of hydrological processes, i.e. aleatory uncertainty [7, 8], or to our imperfect state of knowledge of the hydrological system and our limitedness to model it, i.e. epistemic uncertainty [9–12]. Three main sources of uncertainty can be identified [13] in hydrological and hydrodynamic modelling: (a) observation uncertainty, which is the approximation in the observed hydrological variables used as input or calibration data (e.g. rainfall, temperature and river discharge); (b) parameter uncertainty, which is induced by imperfect model calibration; and (c) model structural uncertainty, which is a result of the inability of models to perfectly schematize the physical processes involved. Epistemic uncertainty can be associated with the latest two sources of uncertainty previously mentioned due to limited knowledge about the physical behaviour of the system.

A reliable characterisation and reduction of the uncertainties affecting hydrological and hydrodynamic processes is an important scientific and operational challenge [14–17]. Different approaches like the first-order reliability method [18], probabilistic Monte Carlo (MC) and fuzzy rule-based methods [19–21] can be used to assess model uncertainty.

Several research activities aimed to reduce such uncertainty in the flood estimation, predictive uncertainty, have been carried out due to its importance to the decision of issuing a flood warning [5, 22, 23]. Methods like the UNcertainty Estimation based on Local Errors and Clustering (UNEEC, [24–26]), Generalised Likelihood Uncertainty Estimation (GLUE, [27, 28]) and Machine Learning in parameter Uncertainty Estimation (MLUE, [29, 30]) can be employed to assess uncertainty in water system models and estimate predictive uncertainty (see, e.g. [31, 32]). However, such tools are often not used in operational forecasting by environmental agencies and river basin authorities, perhaps because of their belief that uncertainty analysis cannot be incorporated into the decision-making process and because uncertainty analysis is too subjective, among others [5, 11, 33].

In the last decades, model updating techniques for reducing predictive uncertainty approaches have been increasingly studied and implemented in water-related applications. These approaches allow for changing model input, states, parameters or output in response of new observations coming into the model in order to improve the prediction accuracy and quantifying uncertainty [3, 14, 34]. In most of the cases, model updating occurs only in form of data assimilation using information of streamflow, soil moisture, etc. coming from static physical stations. Model updating techniques are rarely implemented in operational forecasting due to the lack of approaches to quantify the uncertainty in real-time observations from multiple sources across a range of spatiotemporal scales and methods to integrate these new information in an appropriate and transparent way. In this respect, in operational practice it is preferred to correct the model inputs (in most of the cases), states, initial conditions and parameters in an empirical and subjective way rather than apply advanced (optimal) data assimilation techniques for improving hydrologic forecast [35]. Welles et al. [36] and Liu et al. [34] pointed out how the need for implementing reliable data assimilation methods in operational forecast is increasing in order to fill the mentioned gap with the scientific world.

Traditionally, static physical sensors, such as pressure sensors, water level sensors, and pluviometers, are commonly used by water authorities to calibrate, validate and (in some cases) update physical models in real time. However, the main problem of physical sensors is the proper maintenance which can be very expensive in case of a vast network as well as the limited data that existing sparse monitoring networks can provide to this end.

The continued technological advances have stimulated the spread of low-cost sensors that has triggered crowdsourcing as a way to obtain observations of hydrological variables in a more distributed way than the classic static physical sensors [37]. The main advantage of using these types of sensors is that they can be used not only by technicians, as is the case of traditional physical sensors, but also by regular citizens. Recently, citizen science activities have been widely promoted in order to allow citizens to participate in different aspects of environmental planning and management. One of the most common activities to achieve such goal includes involving citizens in data collection, or crowdsourcing (CS). In particular, observations of hydrological variables can generate additional knowledge, in relation to the water cycle, and use such knowledge in decision-making [38, 39]. However, because of their relatively limited reliability, and random accuracy in time and space, crowdsourced observations have not been widely integrated in hydrological and/or hydraulic models for flood forecasting applications. Instead, they have generally been used to validate model results against observations, in post-event analyses. Different studies addressed the issue of assimilation of distributed observations in distributed and semi-distributed hydrological models (e.g. [40–43]). Neither of the previous studies considers the dynamic nature of data from heterogeneous sensors which provide an intermittent signal in time and space. In fact, the information coming from a specific sensor might be sent just once, occasionally or in time steps that are non-consecutive, i.e. with intermittent observations having different lifespans.

A number of studies have developed methods for using crowdsourced citizens-based observations in water-related models [44–56]. In particular, crowdsourced information are used for directly creating deterministic or probabilistic flood maps [48], derive stream discharges and flow velocities fields [57] and flood extent [52]. In alternative, crowdsourced data have been used for validating flood models [44, 56]. A detailed review on the use of citizen observations for flood modelling applications is provided in Assumpção et al. [58]. However, none of the previous studies assessed the usefulness of citizen observations for improving flood predictions [39, 59]. The first attempts to study the effects of assimilating crowdsourced citizen observations in hydrological and hydraulic models for improving flood prediction in real-time applications are reported in Mazzoleni et al. [60–62] and Mazzoleni [63]. Just recently, Mazzoleni et al. [64] proposed two innovative approaches to assimilated qualitative flow data within hydrologic routing models.

In this chapter, we describe the proposed innovative methods to assimilate heterogeneous intermittent observations, coming from social sensors, within hydrological and hydrodynamic modelling to improve flood prediction. This research was carried out under the framework of the European project WeSenseIt (https://www.wesenseit.com/) [65].

2 Crowdsourced Observations

In this chapter, we consider two different types of sensors to measure hydrological variables such as water level: static physical (StPh) and static social (StSc) sensors (see Fig. 1). In addition, also dynamic social sensors may be used but are not included in this chapter. An example of a static social sensor is a staff gauge located in a strategic point of the river used by citizens to estimate water depth values using a mobile phone app to send CS observations using the QR code as geographical reference point. An example of dynamic sensor is a mobile app allowing any citizen to send the information related to the distance between the water profile and the river bank using a mobile app at random locations along the river. It might be in fact difficult to estimate the water depth value without having any indication about river depth. In this case, the CS observations have higher degree of uncertainty due to the indirect method used to estimate water depth value.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig1_HTML.png — Fig. 1
Proposed sensors classification with (a) static physical sensors (StPh), (b) static social sensors (StSc), and (c) dynamic social sensors (DySc)

According to the nature of the sensor, uncertainty can be defined either as a probability distribution (quantitative observation) or a fuzzy set (qualitative or semi-qualitative observations).

During the last decades, probability theory has been applied in order to represent epistemic or observational uncertainty in mathematical models. In particular, quantitative observations of physical variables can be expressed as a stochastic variable with a given probability distribution which represents the likelihood of that variable value to take on a given value. In most of the cases, stochastic variables are represented using a normal distribution with assigned mean and standard deviation. The higher is the standard deviation, the higher the uncertainty of that variable is.

Examples of qualitative information can be found in verbal or text messages coming from social networks (Twitter, Facebook, etc.). Fuzzy logic emerged as a more general form of logic that can handle the concept of possibilistic values or partial truth. This approach has been used recently [64] as a qualitative modelling methodology since it allows for an easier transition between human and computers for decision-making (transition from fuzzy to numerical data), and it is able to handle imprecise and uncertain information [66]. From a statistical point of view, a physical variable can be associated to a deterministic value plus a given degree of uncertainty, expressed as a pdf, or the second or third order moment. In fuzzy logic-based approach, a physical variable value (e.g. precipitation) would belong to a specific fuzzy set having given characteristic (e.g. low, medium, high precipitation).

3 Case Studies and Water-Related Models

Two different case studies having different hydrometeorological characteristics are analysed in this book chapter. The case studies are the Brue catchment (UK) and the Bacchiglione catchment (Italy). Different hydrological and hydraulic models are implemented within each case study. In particular, a semi-distributed version of a continuous Kalinin-Milyukov-Nash (KMN) cascade hydrological model is applied on the Brue catchment, while a semi-distributed hydrological and hydraulic model developed by the Alto Adriatico Water Authority is implemented in the Bacchiglione catchment. In this study, synthetic flow observations derived from observed and simulated quantitative streamflow are used. Synthetic data are used to evaluate the potential of the proposed approaches as real qualitative observations may be affected by different unpredictable errors.

3.1 Brue Catchment (UK)

The Brue catchment is located in Somerset, South West England, with a drainage area of about 135 km² and a time of concentration of 10 h at the catchment outlet, Lovington. Hourly precipitation data are supplied by the British Atmospheric Data Centre from the NERC Hydrological Radar Experiment Dataset (HYREX) project [67, 68] and available at 49 automatic rain stations; average annual rainfall of 867 mm is measured in the period between 1961 and 1990. Discharge is measured at the catchment outlet by one station at a 15 min time step resolution, having an average value of 1.92 m³/s. For both precipitation and discharge data, a 3-year complete data set, between 1994 and 1996, is available.

A semi-distributed hydrological model is used to assess the flood hydrograph at the outlet section of the Brue catchment and to represent the spatial variability of the CS flow observations. The Brue catchment is divided into 68 sub-catchments having a small drainage area (on average around 2 km²) so that any observation at a random location in a given sub-catchment would provide the same information content that an observation at the outlet of same sub-catchment [60]. For each sub-catchment, a conceptual lumped hydrological model, continuous Kalinin-Milyukov-Nash (KMN) cascade, is implemented to estimate the outflow discharge [69]. The KMN model considers a cascade of storage elements (or reservoirs), assuming that the relation between stage, discharge and stored water volume is linear and that the water storage x_t is only a function of the outflow of the reach Q_t [60]. Subsequently, the KMN is represented as a dynamic state-space system to apply data assimilation techniques as explained in the previous section. In the case of the linear systems, the discrete state-space system can be represented as follows [69]:

${\mathbf{x}}_t=\boldsymbol{\Phi} {\mathbf{x}}_{t-1}+\boldsymbol{\Gamma} {I}_t+{w}_t$

(1)

${z}_t=\mathbf{H}{\mathbf{x}}_t+{v}_t$

(2)

where t is the time step, x is vector of the model states (stored water volume in m³), Φ is the state-transition matrix (function of the model parameters n and k), Γ is the input-transition matrix, H is the output matrix and I and z are the input (forcing) and model output, while w and v are the system and measurements errors.

Muskingum channel routing method [70] is used for flow propagation between sub-catchments; for details see Mazzoleni et al. [60]. The semi-distributed model is structured in such a way that the sub-catchments are sequentially connected and the output of the upstream sub-catchments is used as input in the downstream ones (see Fig. 2). More details about the model calibration are reported in Mazzoleni et al. [60].

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig2_HTML.png — Fig. 2
Considered model structures (MS) for the semi-distributed hydrological model

3.2 Bacchiglione Catchment (Italy)

The Bacchiglione River catchment is located in the north-east of Italy and tributary of the River Brenta which flows into the Adriatic Sea at the south of the Venetian Lagoon and at the north of the River Po delta. The considered area is the upstream part of the Bacchiglione River, which has an overall area of about 400 km², river length of about 50 km, river width of 40 m and river slope of about 0.5% [71]. The main urban area is Vicenza, located in the downstream part of the study area, where recent floods were registered during the springs of 2010 and 2013. Within the activities of the WeSenseIt project [72], one StPh sensor and ten StSc sensors (staff gauges complemented by a QR code, as represented in Fig. 1) were installed in the Bacchiglione River to measure water level (see Fig. 3). Hourly information related to rainfall, temperature, wind direction and intensity, humidity, snow, solar radiation and water level are available for the last 12 years.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig3_HTML.png — Fig. 3
Structure of the semi-distributed model for the Bacchiglione catchment and location of the static physical (StPh) and social (StSc) sensors

In order to represent the distributed hydrological response of this catchment, a semi-distributed model, in which the output of the hydrological model is used as boundary conditions in the hydraulic model, has been implemented.

The hydrological response of the catchment is estimated using the hydrological model developed by the Alto Adriatico Water Authority (AAWA) that considers the routines for runoff generation, having precipitation as model forcing, and a simple routing procedure. The processes related to runoff generation are modelled mathematically by applying the water balance to a control volume, of soil depth, representative of the active soil at the sub-catchment scale. The water content is estimated as function of the precipitation, evapotranspiration, surface runoff, sub-surface runoff and deep percolation. The propagation process in the river channel is represent using a distributed Muskingum-Cunge model discretized each 1,000 m. More details about these models can be found in Mazzoleni et al. [62].

The calibration of the hydrological model parameters was performed by AAWA using an adaptation of the “SCE-UA” algorithm [73], considering the time series of precipitation from 2000 to 2010, in order to minimise the root mean square error between observed and simulated values of water level at PA (Vicenza) gauged station. For the Muskingum-Cunge model, the only parameter that is calibrated in this chapter is the Manning coefficient n, used to estimate the water level along the river. The semi-distributed hydrological-hydraulic model in the Bacchiglione catchment is then validated considering the flood events that occurred in May 2013, November 2014 and February 2016.

In order to apply data assimilation, both the hydrological and Muskingum models are represented using the stochastic state-space form reported in the previous section. In particular, for the Muskingum model, the approach proposed by Georgakakos et al. [74] is used.

4 Model Updating Techniques

Operational forecast can be seen as combination of water models (e.g. hydrological and hydrodynamic) and an updating module. In fact, in the last decades model updating techniques have been intensively used within water system models [75, 76], in order to reduce predictive uncertainty. The hydrological and hydrodynamic models utilise input variables, which are either measured or estimated (e.g. areal precipitation, air temperature, potential evapotranspiration), into a set of equations that contain state variables and parameters. Typically, the parameters remain constant, while the state variables vary in time, even if there are different examples of parameter updating approaches such as Moradkhani et al. [77, 78], Salomon and Feyen [79] and Lü et al. [80]. The feedback process of assimilating the new available information into the forecasting procedure is referred to as updating [75] or DA [76].

The assimilation methods can be divided according to the variables modified during the updating process. In the frequently cited WMO report [76], updating is understood in a wide sense, and input, parameters, states and output updating techniques are distinguished. Recently, Liu et al. [34] provided a detailed review of the status, progresses, challenges and opportunities in advancing DA in operational hydrological forecasting. There are many data assimilation techniques that can be used to integrate hydrological observations within water-related models. In this chapter we will focus mainly on Kalman filter and ensemble Kalman filter.

4.1 Kalman Filter

Kalman filter (KF, [81]) is an approach which allows to optimally estimate the state of a dynamic uncertain model as response of real-time (noisy) observations [3, 14, 77, 82–85]. KF update model states considering only the last available observation allowing for a faster computation. However, KF is optimal only in the case of linear dynamic systems. Kalman filter procedure can be divided in two steps: time update equations, namely, forecast (background) equations, Eqs. (3) and (4),

${\mathbf{x}}_t^{-}=\boldsymbol{\Phi} {\mathbf{x}}_{t-1}^{+}+\boldsymbol{\Gamma} {I}_t+{w}_t$

(3)

${\mathbf{P}}_t^{-}=\boldsymbol{\Phi} {\mathbf{P}}_{t-1}^{+}{\boldsymbol{\Phi}}^T+{\mathbf{S}}_t$

(4)

and update (or analysis) Eqs. (5), (6) and (7):

${\mathbf{K}}_t=\frac{{\mathbf{P}}_t^{-}{\mathbf{H}}^{\mathrm{T}}}{{\mathbf{H}\mathbf{P}}_t^{-}{\mathbf{H}}^{\mathrm{T}}+{\mathbf{R}}_t}$

(5)

${\mathbf{x}}_t^{+}={\mathbf{x}}_t^{-}+{\mathbf{K}}_t\cdotp \left({z}_t^o-{\mathbf{Hx}}_t^{-}\right)$

(6)

${\mathbf{P}}_t^{+}=\left(\mathbf{I}-{\mathbf{K}}_{\mathrm{t}}\mathbf{H}\right){\mathbf{P}}_t^{-}$

(7)

where x is the n_state × 1 state matrix at time t and t−1, K_t is the n_states × n_obs Kalman gain matrix, P is the n_states × n_states error covariance matrix and z⁰ is the new observation. The superscripts + and – indicate, respectively, the updated and background state values, and Φ and Γ represent the state-transition and input-transition matrices, which change according to the model type and structure. The system and measurement error w_t is assumed to be normally distributed with zero mean and covariance R. In the application considered in this chapter, the matrix R is time dependent as the error in the measurement is assumed variable because of the varying behaviour in time and space of the crowdsourcing observations.

A key issue in the implementation of the Kalman filter is the determination of model errors. In fact, an overestimation of model errors can reduce the confidence in the model bringing the KF closer to the observations and vice versa [86]. In this study, the modified version of KF, which accounts for the intermittency of crowdsourced observations in between two model time steps, proposed in Mazzoleni et al. [62] is used.

4.2 Ensemble Kalman Filter

Ensemble Kalman filter [87–90] is a widely used data assimilation method for non-linear dynamic model. The main idea of the EnKF is to represent the forecasted pdf estimate with a set of random samples and estimate the updated probability density function (pdf) of the model states as a combination between data likelihood and forecasted pdf of model states by means of a Bayesian update. In this way, the evaluation of the model error covariance matrix is performed as proposed by Evensen [87]:

${\mathbf{P}}_{\mathrm{t}}^{-}=\frac{1}{N_{\mathrm{ens}}-1}\mathbf{E}{\mathbf{E}}^{\mathrm{T}}$

(8)

where N_ens is the number of ensemble members and E is the ensemble anomaly [40] for each ensemble member:

${\mathbf{E}}_{\mathrm{t}}=\left({\mathbf{x}}_{t,1}^{-}-\overline{\mathbf{x}},{\mathbf{x}}_{t,2}^{-}-\overline{\mathbf{x}},\cdots, {\mathbf{x}}_{t,i}^{-}-\overline{\mathbf{x}},\cdots, {\mathbf{x}}_{t,{N}_{\mathrm{ens}}}^{-}-\overline{\mathbf{x}}\right)$

(9)

where $\overline{\mathbf{x}}$ is the ensemble mean. The update states and Kalman gain are calculated using Eqs. (5) and (6). Because the EnKF performance is influenced by the spread of the ensemble [91–93], it is important to properly perturb the system in a way to obtain a reliable spread of the ensemble within a meaningful range [94]. For this reason, in this study we used the approach proposed by Anderson [91] to perturb the system and to evaluate the quality of the ensemble spread. More details are provided in Mazzoleni [63].

In order to implement EnKF, an ensemble of model realisations is generated perturbing the forcing data and the model parameters using a uniform distribution. The observation error is assessed using the approach described in the section below.

4.3 Synthetic Flow Observations

Synthetic flow observations are used because of the lack of distributed crowdsourced observations at the time of this study within the considered case study [62]. Such synthetic observations are generated by two different approaches for the two catchments. On the one hand, for the Brue catchment, the approach used to generate the synthetic values of river flow is very similar to the one used by Weerts and El Serafy [90], in which the model forcing is perturbed by means of a time series normally distributed with zero mean and given standard deviation.

On the other hand, for the Bacchiglione catchment, the observed time series of precipitation are used as input for the hydrological models of the sub-catchments and inter-catchments to generate synthetic discharges and then propagate them with the hydraulic model down to the outlet point of the catchment. In this way, the synthetic WL values at the outlet of the sub-catchments or inter-catchments and at each spatial discretization of the six reaches of the Bacchiglione River are estimated and assumed as observed variables in the assimilation process.

4.4 Estimation of the Observational Error

The correct estimation of the model and observational error is crucial for implementing data assimilation methods. Few studies in the past have addressed this issue (e.g. [95]), but further research is needed. For this reason, we adopted a simplified approach to quantify observational errors. Here, the covariance matrix R is assessed using the approach described in Weerts and El Serafy [90], Rakovec et al. [43] and Mazzoleni [63]:

${\mathbf{R}}_t={\left({\alpha}_t\bullet {Q}_t^{\mathrm{synth}}\right)}^2$

(10)

where α is a variable related to the accuracy level (i.e. degree to which the measurement is correct overall) of the flow measurement and Q^synth is the synthetic flow observation. In the case of CS observations, accuracy levels vary temporally and spatially.

Table 1 summarises the distribution of the coefficient α of the observational error of Eq. (10). The distribution of the coefficient α does not pretend to be exhaustive in accounting for different inaccuracies of observations coming from physical and social sensors and is subject for further research.

Table 1

Assumed observational errors for the different types of sensors

Sensor type	Assumed accuracy level	Coefficient α	Temporal and spatial variability
Static physical (StPh)	High	α = 0.1	Fixed location Constant in time
Static social (StSc)	Medium	α = U(0.1, 0.3)	Fixed location Intermittent arrival

Sensor type

Assumed accuracy level

Coefficient α

Temporal and spatial variability

Static physical (StPh)

High

α = 0.1

Fixed location

Constant in time

Static social (StSc)

Medium

α = U(0.1, 0.3)

Fixed location

Intermittent arrival

For static social sensors, α values are higher than for static physical sensors and are considered to be a random stochastic variable uniformly distributed in time and space. More details can be found in Mazzoleni et al. [61].

5 Assimilation of Flow Observations from Static Heterogeneous Sensors

This section aims to explore the benefits of assimilating flow observations from a network of static heterogeneous sensors in the case of synchronous (Sect. 5.1) or asynchronous (Sect. 5.2) social observations, depending on the predictability of the arrival time of the observations. In particular, we assume that social sensors provide intermittent observations that can lie either in a specific model time step (synchronous) or in between two model time steps (asynchronous). In addition, social sensors may be distributed within the catchment or be located in a specific point.

5.1 Assimilation of Synchronous Observations

Here, we show the model performance after the assimilation of intermittent synchronous observations, i.e. their arrival time matches the model time step, within the semi-distributed hydrological models of the Brue catchment. We can divide this section in two parts: first, the flow observations are assimilated from different social sensors located within the catchment; second, social sensors are integrated with a network of physical sensors to evaluate the added value of crowdsourced sensors in the assimilation process. A straightforward and pragmatic method (based on EnKF) is used to assimilate the intermittent observations into the hydrological model updating the model states matrix only when observations are available, while when there are no observations, it is assumed that the state covariance error does not change at that time step [60, 96].

5.1.1 Assimilation of Flow Observations Only from Social Sensors

In the first part of this section, we considered MS1 and three different spatial configurations (SC) of static social sensors within the catchments (called scenarios SC1, SC2 and SC3). In particular, SC1 refers to social sensors located along the main river channel, SC2 to sensors located on the upstream part of the main river channel, while SC3 to sensors located close to the catchment outlet. Model results obtaining assimilating flow observations from physical sensors are considered as benchmark in order to compare the assimilation performances using social sensors. A main assumption of this study is that flow observations from social sensors are accepted to be less accurate, with random observation error both in time and space, than the ones from physical sensors.

The difference between the outflow hydrograph estimated assimilating physical and social data (in the same location) is represented in Fig. 4. The different colours of the hydrographs represent the different intermittency configurations of the social sensors, i.e. the unpredictable arrival time of the social observation. The smaller the value of difference, the smaller the sensitivity of the model to assimilation of observations from social sensors. Two different flood events are analysed.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig4_HTML.png — Fig. 4
Differences between assimilation of synthetic physical and social flow data in terms of outflow hydrographs under different intermittency configurations of the social sensors (different colour lines) for MS1 (source [60])

As expected, the assimilation performances change with the different locations of social sensor within the catchment. Considering the flood event A, it can be seen that model outputs are affected by changing from physical to social flow data mainly for SC3. Physically, this can be due to the particular structure of the hydrological model. In particular, the discharge differences in flood event B are smaller than in flood event A due to the different performances of the model without assimilation. In fact, for flood event B, additional real-time observations of discharge slightly improved the model results since the model tends to better estimate the observed value of discharge even without assimilation. It is worth noting that results do not seem to be very sensitive to the intermittency scenarios (different colours of Fig. 4).

The results reported in Table 2 show a large difference in the NSE between assimilations from physical and social sensors. Table 2 underlines that the best model performances are not obtained when the assimilation of flow data is performed using sensors located at the outlet section of the catchment but when sensors are located along the main river channel, i.e. SC1 [60].

Table 2

NSE index values obtained assimilating streamflow observations from different spatial configuration of physical and social sensors for MS1

Spatial configuration	NoDA	1	2	3
Physical sensor	0.46	0.77	0.69	0.75
Social sensors	0.46	0.58	0.51	0.47

5.1.2 Assimilation of Flow Observations from Both Physical and Social Sensors

As a matter of fact, the location of the social sensors should typically follow some rules and be subjected to specific constraints. For example, existence of multiple sensors in remote areas of the catchment is quite unlikely due to economical and management reasons. For this reason, in the second part of this section, we assume a realistic configuration of the social sensors closer to the main urbanised area within the catchment (see Fig. 5). The network of social static sensors is integrated with the optimal network of static physical sensors (α equal to 0.1) for MS1 and MS2, respectively.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig5_HTML.png — Fig. 5
Representation of distribution of static physical and social sensors along the Brue Basin for MS1 and MS2, respectively [63]

Different scenarios are introduced based on assumption on the intermittency and availability of CS data and on the possible integration between uncertain CS data and optimal/nonoptimal network of static physical sensors (see Table 3).

Table 3

Description of the different settings

Setting	Social sensors			Physical sensors
Setting	Intermittent	Daily timing	Daily and peak timing	Optimal	Nonoptimal
1	–	X	–	–	–
2	X	X	–	–	–
3	–	–	X	–	–
4	X	–	X	–	–
5	–	–	–	X	–
6	–	X	–	X	–
7	X	X	–	X	–
8	–	–	X	X	–
9	X	–	X	X	–
10	–	–	–	–	X
11	–	X	–	–	X
12	X	X	–	–	X
13	–	–	X	–	X
14	X	–	X	–	X

We demonstrate that assimilation of uncertain discharge observations measured at seven staff gauges by social sensors could improve the model results, however, still with the underestimation of the peak flow for scenarios 1 and 2 (see Fig. 6). Assimilation of observations coming from trained volunteers in the time of the peak flow (scenarios 3 and 4) showed a satisfactory improvement of the discharge hydrograph (higher for the model structure 1 than for the structure 2). Intermittent observations do not improve the model results in the same way that social observations coming continuously in time do.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig6_HTML.png — Fig. 6
Outflow hydrographs resulting from the assimilation of physical, social and intermittent observations in the case of realistic scenarios (from a to f) of spatial and temporal distribution of static sensors [63] for MS1 (first row) and MS2 (second row)

Figure 6b confirms that similar improvements for the scenario 3 are achieved assimilating observations coming from the optimally located static sensors running continuously in time (scenario 5). In addition, a combined assimilation of intermittent observations (during daylight time) and static observations from optimal and nonoptimal network of static sensors tends to slightly improve the model output.

Figure 6 demonstrates that considering this type of hydrological model in this particular basin, in the case of an inappropriate distribution of static physical sensors within the basin (scenario 10), the model performances can be improved. However, there is an evident limitation of the model in providing biased hydrographs (especially for MS2), underestimated when compared to the observed one. Biased models can affect the DA results [34].

5.2 Assimilation of Asynchronous Observations

In the previous analysis, social data are provided at the same time of the model time step. However, in case of CS observations, the arrival moment might have lower frequency than the model time step (asynchronous observations), as reported in Mazzoleni et al. [62]. Various experimental scenarios representing different configurations of arrival frequency, number and accuracy of the flow observations are reported in Fig. 7. In order to remove the random behaviour related to the irregular arrival frequency and observation accuracy, different model runs (100 in this case) are carried out, assuming different random values of arrival and accuracy (coefficient α in Eq.10) during each model run, for a given number of observations and lead time. The NSE value is estimated for each model run, so μ(NSE) represents the mean of the different values of NSE.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig7_HTML.png — Fig. 7
The experimental scenarios representing different configurations of arrival frequency, number and accuracy of the streamflow observations [62]

5.2.1 Assimilation of Flow Observations Only from Social Sensors

A lumped hydrological model based on the KMN model is applied to the Brue catchment in order to assimilate synthetic asynchronous observations using the modified version of KF reported in Mazzoleni et al. [62]. Two flood events and experimental scenarios from 1 to 9 (see Fig. 7) are considered in this section.

As it can be seen from Fig. 8, increasing the number of social observations within the observation window results in the improvement of the NSE, but it becomes negligible for more than ten observations. This means that the additional social observations do not add information useful for improving the model performance.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig8_HTML.png — Fig. 8
μ(NSE) values estimated for varying number of assimilated flow observations, for the intermittency scenarios for the different flood events [63]

From Fig. 8 it can be seen that, overall, assimilation of crowdsourced observations improves model performances in all the considered flood events. In the case of scenarios 2 and 3 (represented using warm, red and orange, colours in Fig. 8, for lead time equal to 24 h), i.e. random arrival frequency with fixed/controlled accuracy, the average values of NSE, μ(NSE), are smaller but comparable with the ones obtained in case of scenario 1 for all the considered flood events. In particular, scenario 3 has lower μ(NSE) than scenario 2. This can be related to the fact that both scenarios have random arrival frequency; however, in scenario 3 observations are not provided at the model time step, as opposed to scenario 2. In scenario 4, represented using cold blue colour, observations are considered coming at regular time steps but having random accuracy. Figure 8 shows that μ(NSE) values are lower in case of scenario 4 rather than scenarios 2 and 3. This can be related to the higher influence of observation accuracy if compared to arrival frequency. The combined effects of random arrival frequency and observation accuracy are represented in scenario 5 using a magenta colour (i.e. the combination of warm and cold colours) in Fig. 8. As expected, this scenario is the one with the lower values of μ(NSE) if compared to the previous ones. The remaining scenarios, from 6 to 9, are equivalent to the ones from 2 to 5 with the only difference that they are non-periodic in time. For this reason, in Fig. 8, scenarios from 6 to 9 have the same colour of scenarios 2–5 but indicated with dashed line in order to underline their non-periodic behaviour. Overall it can be observed that non-periodic scenarios have similar μ(NSE) values to their corresponding periodic scenario. However, their smoother μ(NSE) trends are due to lower variability of NSE values which means that model performances are less dependent to the non-periodic nature of the crowdsourced observations than their periodic behaviour. Overall, σ(NSE) tends to decrease for the high number of observations.

5.2.2 Assimilation of Flow Observations from Both Physical and Social Sensors

In the following, the contribution of assimilating synthetic flow data from a heterogeneous network of physical and social sensors on the semi-distributed model implemented in the Bacchiglione catchment is analysed. Streamflow observations from physical sensors are assumed to be synchronous with hourly frequency, while social observations are considered asynchronous with higher and irregular frequency. Five different experimental settings are introduced and represented in Fig. 9, corresponding to different types of sensors used.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig9_HTML.png — Fig. 9
Different experimental settings implemented within the Bacchiglione catchment (based on [62])

The physical and social observations are assimilated in order to improve the poor model prediction at the catchment outlet (city of Vicenza) affected by an underestimation of the 3-day rainfall forecast used as normal input in flood forecasting practice in this area. Scenarios 10 and 11, described in Fig. 7, are used in this experiment in order to represent an irregular and random behaviour of the social observations.

Figure 10 shows the results obtained from the experiment settings represented in case of observations from distributed physical and social sensors. One of the main outcomes of these analyses is that the replacement of a physical sensor for a social sensor at only one location (settings B) does not improve the model performance in terms of NSE for different lead time values. Distributed locations of social sensors (setting C) can provide higher value of NSE than a single physical sensor, even for low number of observations in both regular and intermittent social observations. It is interesting to note that in case of integration between physical and social sensors (setting D), the NSE is higher than in case of setting C for low number of observations. However, with the higher number of observations, setting C is the one providing the best model improvement for low lead time values. Best model improvement is achieved in case of setting E. In case of intermittent observations (d, e and f), it can be noticed that the setting D provides higher improvement than setting C. In case of high lead time value (12 h), results of setting C tend to be similar to the ones obtained with setting B. As in case of scenario 10, also in case of scenario 11, the best results are achieved in case of setting E.

../images/369945_1_En_403_Chapter/369945_1_En_403_Fig10_HTML.png — Fig. 10
Model performance expressed as μ(NSE) – assimilating different number of crowdsourced observations, for the three lead time values, having characteristic of scenario 10 (first row) and 11 (second row) (based on [63])

6 Conclusions

This chapter describes the novel methods mainly developed within the EU-FP7 WeSenseIt project, aimed to optimally assimilate heterogeneous intermittent observations, coming from static social sensors, to improve hydrological and hydrodynamic models for flood prediction. The proposed methods used to assimilate crowdsourced observations are applied to the Brue and Bacchiglione catchments, in which different hydrological and hydraulic models are implemented. A Kalman filter and ensemble Kalman filter are used to assimilate flow observations in linear and non-linear models, respectively. Observational error is assumed uniformly distributed with multiplying factors of 0.1 and 0.3 as minimum and maximum values for the static social sensors, respectively. It is worth noting that because real crowdsourced observations from citizen were not available at the time of this study, model-based synthetic realistic flow observations are used instead.

This study demonstrated that crowdsourced citizen-based observations can significantly improve flood prediction if integrated into hydrological and hydraulic models. In addition, networks of low-cost static and dynamic social sensors can actually complement traditional networks of static physical sensors, for the purpose of improving flood forecasting accuracy. This can be one of the potential applications of increasing efforts to build citizen observatories of water. On the one hand, citizens can play an active role in information capturing, evaluation and communication, and on the other hand, they can also help in improving models and increasing flood resilience.

In particular, assimilation of streamflow observations from static social sensors provides improvements in model performance which depends on the location of such observations and the structure of the considered hydrological model. Flood forecasts are influenced by the total number of social sensors and their locations in the case of semi-distributed model with sub-catchments connected in parallel, while results achieved with sub-catchment connected in series are more sensitive to the locations of the static physical sensors but not to their number.

This research proved that assimilation of asynchronous observations results in a significant improvement of NSE for different lead time values. Increasing the number of assimilated crowdsourced asynchronous observations within two model time steps induces an improvement in the NSE. However, after a threshold number of crowdsourced observations, NSE asymptotically approaches a certain value meaning that no improvement is achieved with additional observations.

Besides these important results, this work has still certain limitations which should be mentioned. Additional analyses on different case studies and hydrological/hydraulic model have to be carried out to draw more general conclusions about assimilation of the crowdsourced observations and their additional value in different types of catchments. In addition, the adopted simple hydrologic and flow propagation models neglect some of the physical processes in complex floodplains (e.g. lamination/reservoir effects). The internal states of the hydrologic model where crowdsourced observations are supposed to be observed should be calibrated, since unbiased models are necessary to optimise data assimilation frameworks [34]. Moreover, real-life crowdsourced observations provided by citizens using static social and dynamic social sensors have to be used to further validate the results obtained in this research.

Overall, with this research we demonstrated that the choice of the proper mathematical model and updating technique to be used for flood forecasting may vary according to the data availability, location of the sensors, type of forecast, etc.

Acknowledgements

This research was funded in the framework of the European FP7 Project WeSenseIt: Citizen Observatory of Water, grant agreement No. 308429.

References

1.
Hinkel J, Lincke D, Vafeidis AT, Perrette M, Nicholls RJ, Tol RSJ, Marzeion B, Fettweis X, Ionescu C, Levermann A (2014) Coastal flood damage and adaptation costs under 21st century sea-level rise. Proc Natl Acad Sci 111(9):3292–3297. https://doi.org/10.1073/pnas.1222469111Crossref
2.
Jongman B, Hochrainer-Stigler S, Feyen L, Aerts JCJH, Mechler R, Botzen WJW, Bouwer LM, Pflug G, Rojas R, Ward PJ (2014) Increasing stress on disaster-risk finance due to large floods. Nat Clim Chang 4(4):264–268. https://doi.org/10.1038/nclimate2124Crossref
3.
McLaughlin D (2002) An integrated approach to hydrologic data assimilation: interpolation, smoothing, and filtering. Adv Water Resour 25(8–12):1275–1286. https://doi.org/10.1016/S0309-1708(02)00055-6Crossref
4.
Solomatine DP, Wagener T (2011) Hydrological modelling. In: Wilderer P (ed) Treatise on water science. Elsevier, Amsterdam, pp 435–457Crossref
5.
Todini E, Alberoni P, Butts M, Collier C, Khatibi R, Samuels P, Weerts A (2005) ACTIF best practice paper–understanding and reducing uncertainty in flood forecasting. In: Balabanis P, Lumbroso D, Samuels P (eds) International conference on innovation, advances and implementation of flood forecasting technology, Troms, Norway
6.
Pappenberger F, Matgen P, Beven KJ, Henry J-B, Pfister L, de Fraipont P (2006) Influence of uncertain boundary conditions and model structure on flood inundation predictions. Adv Water Resour 29(10):1430–1449. https://doi.org/10.1016/j.advwatres.2005.11.012Crossref
7.
Koutsoyiannis D (2010) HESS opinions “a random walk on water”. Hydrol Earth Syst Sci 14(3):585–601. https://doi.org/10.5194/hess-14-585-2010Crossref
8.
Montanari A, Koutsoyiannis D (2012) A blueprint for process-based modeling of uncertain hydrological systems. Water Resour Res 48(9):W09555. https://doi.org/10.1029/2011WR011412Crossref
9.
Alfonso L, Tefferi M (2015) Effects of uncertain control in transport of water in a river-wetland system of the Low Magdalena River, Colombia. Transport of water versus transport over water. Springer, Cham, pp 131–144Crossref
10.
Domeneghetti A, Vorogushyn S, Castellarin A, Merz B, Brath A (2013) Probabilistic flood hazard mapping: effects of uncertain boundary conditions. Hydrol Earth Syst Sci 17(8):3127–3140. https://doi.org/10.5194/hess-17-3127-2013Crossref
11.
Hall J, Solomatine D (2008) A framework for uncertainty analysis in flood risk management decisions. Int J River Basin Manag 6(2):85–98. https://doi.org/10.1080/15715124.2008.9635339Crossref
12.
Merz B, Thieken AH (2005) Separating natural and epistemic uncertainty in flood frequency analysis. J Hydrol 309(1–4):114–132. https://doi.org/10.1016/j.jhydrol.2004.11.015Crossref
13.
Goetzinger J, Bardossy A (2008) Generic error model for calibration and uncertainty estimation of hydrological models. Water Resour Res 44:W00B07. https://doi.org/10.1029/2007WR006691Crossref
14.
Liu Y, Gupta HV (2007) Uncertainty in hydrologic modeling: toward an integrated data assimilation framework. Water Resour Res 43(7):1–18. https://doi.org/10.1029/2006WR005756Crossref
15.
Quinonero-Candela J, Rasmussen CE, Sinz F, Bousquet O, Schölkopf B (2006) Evaluating predictive uncertainty challenge. Machine learning challenges. Evaluating predictive uncertainty, visual object classification, and recognising tectual entailment. Springer, New York. http://link.springer.com/10.1007%2F11736790_1. Accessed 2 Mar 2016, pp 1–27Crossref
16.
Renard B, Kavetski D, Kuczera G, Thyer M, Franks SW (2010) Understanding predictive uncertainty in hydrologic modeling: the challenge of identifying input and structural errors. Water Resour Res 46(5):W05521. https://doi.org/10.1029/2009WR008328Crossref
17.
Wagener T, Gupta HV (2005) Model identification for hydrological forecasting under uncertainty. Stoch Environ Res Risk Assess 19(6):378–387. https://doi.org/10.1007/s00477-005-0006-5Crossref
18.
Melchers RE (1999) Structural reliability analysis and prediction, 2nd edn. Wiley, New York
19.
Abebe AJ, Solomatine DP, Venneker RGW (2000) Application of adaptive fuzzy rule-based models for reconstruction of missing precipitation events. Hydrol Sci J 45(3):425–436Crossref
20.
Bárdossy A, Bronstert A, Merz B (1995) 1-, 2- and 3-dimensional modeling of water movement in the unsaturated soil matrix using a fuzzy approach. Adv Water Resour 18(4):237–251Crossref
21.
Hundecha Y, Bardossy A, Theisen HW (2001) Development of a fuzzy logic-based rainfall-runoff model. Hydrol Sci J 46(3):363–376Crossref
22.
Plate E, Shahzad K (2015) Uncertainty analysis of multi-model flood forecasts. Water 7(12):6788–6809. https://doi.org/10.3390/w7126654Crossref
23.
Xuan Y, Cluckie ID, Wang Y (2009) Uncertainty analysis of hydrological ensemble forecasts in a distributed model utilising short-range rainfall prediction. Hydrol Earth Syst Sci 13(3):293–303Crossref
24.
Dogulu N, López López P, Solomatine DP, Weerts AH, Shrestha DL (2015) Estimation of predictive hydrologic uncertainty using the quantile regression and UNEEC methods and their comparison on contrasting catchments. Hydrol Earth Syst Sci 19:3181–3201. https://doi.org/10.5194/hess-19-3181-2015Crossref
25.
Shrestha DL, Solomatine DP (2006) Machine learning approaches for estimation of prediction interval for the model output. Neural Netw 19:225–235. https://doi.org/10.1016/j.neunet.2006.01.012Crossref
26.
Shrestha DL, Rodriguez J, Price RK, Solomatine DP (2006) Assessing model prediction limits using fuzzy clustering and machine learning. Proceedings of the 7th international conference on hydroinformatics, 4–8 September, Nice, France
27.
Beven K, Binley A (2014) GLUE: 20 years on. Hydrol Process 28:5897–5918. https://doi.org/10.1002/hyp.10082Crossref
28.
Beven K, Binley A (1992) The future of distributed models: model calibration and uncertainty prediction. Hydrol Process 6:279–298. https://doi.org/10.1002/hyp.3360060305Crossref
29.
Shrestha DL, Kayastha N, Solomatine D (2009) A novel approach to parameter uncertainty analysis of hydrological models using neural networks. Hydrol Earth Syst Sci 13:1235–1248Crossref
30.
Solomatine D, Shrestha DL (2009) A novel method to estimate total model uncertainty using machine learning techniques. Water Resour Res 45:W00B11. https://doi.org/10.1029/2008WR006839Crossref
31.
Beven K, Freer J (2001) Equifinality, data assimilation, and uncertainty estimation in mechanistic modelling of complex environmental systems using the GLUE methodology. J Hydrol 249:11–29Crossref
32.
Mantovan P, Todini E (2006) Hydrological forecasting uncertainty assessment: incoherence of the GLUE methodology. J Hydrol 330(1–2):368–381. https://doi.org/10.1016/j.jhydrol.2006.04.046Crossref
33.
Pappenberger F, Beven KJ (2006) Ignorance is bliss: or seven reasons not to use uncertainty analysis. Water Resour Res 42(5):1–8. https://doi.org/10.1029/2005WR004820Crossref
34.
Liu Y, Weerts AH, Clark M, Hendricks Franssen HJ, Kumar S, Moradkhani H, Seo DJ, Schwanenberg D, Smith P, Van Dijk AIJM, Van Velzen N, He M, Lee H, Noh SJ, Rakovec O, Restrepo P (2012) Advancing data assimilation in operational hydrologic forecasting: progresses, challenges, and emerging opportunities. Hydrol Earth Syst Sci 16(10):3863–3887. https://doi.org/10.5194/hess-16-3863-2012Crossref
35.
Seo DJ, Cajina L, Corby R, Howieson T (2009) Automatic state updating for operational streamflow forecasting via variational data assimilation. J Hydrol 367(3–4):255–275. https://doi.org/10.1016/j.jhydrol.2009.01.019Crossref
36.
Welles E, Sorooshian S, Carter G, Olsen B (2007) Hydrologic verification: a call for action and collaboration. Bull Am Meteorol Soc 88:503–511Crossref
37.
Yarvis M, Kushalnagar N, Singh H, Rangarajan A, Liu Y, Singh S (2005) Exploiting heterogeneity in sensor networks. Proceedings IEEE INFOCOM 2005. 24th annual joint conference of the IEEE computer and communications societies, vol 2, pp 878–890
38.
Bonney R, Shirk JL, Phillips TB, Wiggins A, Ballard HL, Miller-Rushing AJ, Parrish JK (2014) Next steps for citizen science. Science 343(6178):1436–1437. https://doi.org/10.1126/science.1251554Crossref
39.
Buytaert W, Zulkafli Z, Grainger S, Acosta L, Alemie TC, Bastiaensen J, De BiÃvre B, Bhusal J, Clark J, Dewulf A, Foggin M, Hannah DM, Hergarten C, Isaeva A, Karpouzoglou T, Pandeya B, Paudel D, Sharma K, Steenhuis T, Tilahun S, Van Hecken G, Zhumanova M (2014) Citizen science in hydrology and water resources: opportunities for knowledge generation, ecosystem service management, and sustainable development. Front Earth Sci 2:1–21. https://doi.org/10.3389/feart.2014.00026Crossref
40.
Clark MP, Rupp DE, Woods RA, Zheng X, Ibbitt RP, Slater AG, Schmidt J, Uddstrom MJ (2008) Hydrological data assimilation with the ensemble Kalman filter: use of streamflow observations to update states in a distributed hydrological model. Adv Water Resour 31(10):1309–1324. https://doi.org/10.1016/j.advwatres.2008.06.005Crossref
41.
Mazzoleni M, Alfonso L, Solomatine D (2016) Influence of spatial distribution of sensors and observation accuracy on the assimilation of distributed streamflow data in hydrological modelling. Hydrol Sci J 62(3):389–407. https://doi.org/10.1080/02626667.2016.1247211Crossref
42.
Mazzoleni M, Noh SJ, Lee H, Liu Y, Seo DJ, Amaranto A, Alfonso L, Solomatine DP (2018) Real-time assimilation of streamflow observations into a hydrological routing model: effects of model structures and updating methods. Hydrol Sci J 63:386–407Crossref
43.
Rakovec O, Weerts AH, Hazenberg P, Torfs PJJF, Uijlenhoet R (2012) State updating of a distributed hydrological model with ensemble Kalman filtering: effects of updating frequency and observation network density on forecast accuracy. Hydrol Earth Syst Sci 16(9):3435–3449. https://doi.org/10.5194/hess-16-3435-2012Crossref
44.
Alfonso L, Lobbrecht A, Price R (2010) Using mobile phones to validate models of extreme events. 9th international conference on hydroinformatics, Tianjin, China, pp 1447–1454
45.
de Vos L, Leijnse H, Overeem A, Uijlenhoet R (2017) The potential of urban rainfall monitoring with crowdsourced automatic weather stations in Amsterdam. Hydrol Earth Syst Sci 21:765–777. https://doi.org/10.5194/hess-21-765-2017Crossref
46.
Etter S, Strobl B, Seibert J, van Meerveld I (2018) Value of uncertain streamflow observations for hydrological modelling. Hydrol Earth Syst Sci Discuss 22(10):5243–5257. https://doi.org/10.5194/hess-2018-355Crossref
47.
Fava C, Santana G, Bressiani DA, Rosa A, Horita FEA, Souza VCB, Mendiondo EM (2014) Integration of information technology systems for flood forecasting with hybrid data sources. International conference of flood management, Sao Paolo, Brazil
48.
Fohringer J, Dransch D, Kreibich H, Schröter K (2015) Social media as an information source for rapid flood inundation mapping. Nat Hazards Earth Syst Sci 15:2725–2738. https://doi.org/10.5194/nhess-15-2725-2015Crossref
49.
Gaitan S, van de Giesen NC, ten Veldhuis JAE (2016) Can urban pluvial flooding be predicted by open spatial data and weather data? Environ Model Softw 85:156–171. https://doi.org/10.1016/j.envsoft.2016.08.007Crossref
50.
Giuliani M, Castelletti A, Fedorov R, Fraternali P (2016) Using crowdsourced web content for informing water systems operations in snow-dominated catchments. Hydrol Earth Syst Sci 20:5049–5062. https://doi.org/10.5194/hess-20-5049-2016Crossref
51.
Rollason E, Bracken LJ, Hardy RJ, Large ARG (2018) The importance of volunteered geographic information for the validation of flood inundation models. J Hydrol 562:267–280Crossref
52.
Rosser JF, Leibovici DG, Jackson MJ (2017) Rapid flood inundation mapping using social media, remote sensing and topographic data. Nat Hazards 87:103–120Crossref
53.
Schneider P, Castell N, Vogt M, Dauge FR, Lahoz W, Bartonova A (2017) Mapping urban air quality in near real-time using observations from lowcost sensors and model information. Environ Int 106:234–247Crossref
54.
Smith L, Liang Q, James P, Lin W (2015) Assessing the utility of social media as a data source for flood risk management using a real-time modelling framework. J Flood Risk Manag 10:370–380. https://doi.org/10.1111/jfr3.12154Crossref
55.
Starkey E, Parkin G, Birkinshaw S, Large A, Quinn P, Gibson C (2017) Demonstrating the value of community-based (“citizen science”) observations for catchment modelling and characterisation. J Hydrol 548:801–817. https://doi.org/10.1016/j.jhydrol.2017.03.019Crossref
56.
Yu D, Yin J, Liu M (2016) Validating city-scale surface water flood modelling using crowd-sourced data. Environ Res Lett 11:124011. https://doi.org/10.1088/1748-9326/11/12/124011Crossref
57.
Le Coz J, Patalano A, Collins D, Guillén NF, García CM, Smart GM, Bind J, Chiaverinica A, Le Boursicauda R, Dramaisa G, Braud I (2016) Crowdsourced data for flood hydrology: feedback from recent citizen science projects in Argentina, France and New Zealand. J Hydrol 541:766–777Crossref
58.
Assumpção TH, Popescu I, Jonoski A, Solomatine DP (2018) Citizen observations contributing to flood modelling: opportunities and challenges. Hydrol Earth Syst Sci 22:1473–1489. https://doi.org/10.5194/hess-22-1473-2018Crossref
59.
Shanley L, Burns R, Bastian Z, Robson E (2013) Tweeting up a storm: the promise and perils of crisis mapping, available SSRN 2464599. https://ssrn.com/abstract=2464599. Accessed 20 Mar 2016
60.
Mazzoleni M, Alfonso L, Chacon-Hurtado J, Solomatine D (2015) Assimilating uncertain, dynamic and intermittent streamflow observations in hydrological models. Adv Water Resour 83:323–339Crossref
61.
Mazzoleni M, Cortes Arevalo VJ, Wehn U, Alfonso L, Norbiato D, Monego M, Ferri M, Solomatine DP (2018) Exploring the influence of citizen involvement on the assimilation of crowdsourced observations: a modelling study based on the 2013 flood event in the Bacchiglione catchment (Italy). Hydrol Earth Syst Sci 22:391–416. https://doi.org/10.5194/hess-22-391-2018Crossref
62.
Mazzoleni M, Verlaan M, Alfonso L, Monego M, Norbiato D, Ferri M, Solomatine DP (2017) Can assimilation of crowdsourced data in hydrological modelling improve flood prediction? Hydrol Earth Syst Sci 21:839–861. https://doi.org/10.5194/hess-21-839-2017Crossref
63.
Mazzoleni M (2017) Improving flood prediction assimilating uncertain crowdsourced data into hydrologic and hydraulic models. UNESCO-IHE PhD thesis series, CRC Press/Balkema, Leiden
64.
Mazzoleni M, Amaranto A, Solomatine DP (2019) Integrating qualitative flow observations in a lumped hydrologic routing model. Water Resour Res 55. https://doi.org/10.1029/2018WR023768
65.
WeSenseIt (2016) WeSenseIt: citizen water observatories. http://wesenseit.eu/. Accessed 19 Feb 2016
66.
Sugeno M, Yasukawa T (1993) A fuzzy-logic-based approach to qualitative modelling. IEEE Trans Fuzzy Syst 1:7–31Crossref
67.
Moore RJ, Jones DA, Cox DR, Isham VS (2000) Design of the HYREX raingauge network. Hydrol Earth Syst Sci 4(4):521–530. https://doi.org/10.5194/hess-4-521-2000Crossref
68.
Wood SJ, Jones DA, Moore RJ (2000) Accuracy of rainfall measurement for scales of hydrological interest. Hydrol Earth Syst Sci Discuss 4(4):531–543Crossref
69.
Szilagyi J, Szollosi-Nagy A (2010) Recursive streamflow forecasting: a state space approach. CRC Press, Leiden
70.
Cunge JA (1969) On the subject of a flood propagation computation method (Muskingum method). J Hydraul Res 7(2):205–230Crossref
71.
Ferri M, Monego M, Norbiato D, Baruffi F, Toffolon C, Casarin R (2012) La piattaforma previsionale per i bacini idrografici del Nord Est Adriatico (I). Proceedings XXXIII conference of hydraulics and hydraulic engineering, Brescia, p 10
72.
Huwald H, Barrenetxea G, de Jong S, Ferri M, Carvalho R, Lanfranchi V, McCarthy S, Glorioso G, Prior S, Solà E, Gil-Roldàn E, Alfonso L, Wehn de Montalvo U, Onencan A, Solomatine D, Lobbrecht A (2013) D1.11 sensor technology requirement analysis. Confidential deliverable, the WeSenseIt project (FP7/2007-2013 grant agreement no 308429)
73.
Duan Q, Ajami NK, Gao X, Sorooshian S (2007) Multi-model ensemble hydrologic prediction using Bayesian model averaging. Adv Water Resour 30(5):1371–1386. https://doi.org/10.1016/j.advwatres.2006.11.014Crossref
74.
Georgakakos AP, Georgakakos KP, Baltas EA (1990) A state-space model for hydrologic river routing. Water Resour Res 26:827–838
75.
Refsgaard JC (1997) Validation and intercomparison of different updating procedures for real-time forecasting. Nord Hydrol 28(2):65–84. https://doi.org/10.2166/nh.1997.005Crossref
76.
WMO (1992) Simulated real-time intercomparison of hydrological models. World Meteorological Organization, Geneva
77.
Moradkhani H, Hsu KL, Gupta H, Sorooshian S (2005) Uncertainty assessment of hydrologic model states and parameters: sequential data assimilation using the particle filter. Water Resour Res 41(5):W05012. https://doi.org/10.1029/2004WR003604Crossref
78.
Moradkhani H, Sorooshian S, Gupta HV, Houser PR (2005) Dual state–parameter estimation of hydrological models using ensemble Kalman filter. Adv Water Resour 28(2):135–147Crossref
79.
Salamon P, Feyen L (2009) Assessing parameter, precipitation, and predictive uncertainty in a distributed hydrological model using sequential data assimilation with the particle filter. J Hydrol 376(3-4):428–442Crossref
80.
Lü H, Yu Z, Zhu Y, Drake S, Hao Z, Sudicky EA (2011) Dual state-parameter estimation of root zone soil moisture by optimal parameter estimation and extended Kalman filter data assimilation. Adv Water Resour 34(3):395–406Crossref
81.
Kalman RE (1960) A new approach to linear filtering and prediction problems. J Basic Eng 82(1):35–45. https://doi.org/10.1115/1.3662552Crossref
82.
Heemink AW, Segers AJ (2002) Modeling and prediction of environmental data in space and time using Kalman filtering. Stoch Environ Res Risk Assess 16(3):225–240. https://doi.org/10.1007/s00477-002-0097-1Crossref
83.
Reichle RH, Crow WT, Keppenne CL (2008) An adaptive ensemble Kalman filter for soil moisture data assimilation. Water Resour Res 44(3):W03423. https://doi.org/10.1029/2007WR006357Crossref
84.
Robinson AR, Lermusiaux PFJ, Sloan III NQ (1998) Data assimilation. Sea 10:541–594
85.
Walker JP, Houser PR (2005) Hydrologic data assimilation. Adv Water Sci Methodol 41:233. https://doi.org/10.5772/1112Crossref
86.
Sun L, Seidou O, Nistor I, Liu K (2015) Review of the Kalman type hydrological data assimilation. Hydrol Sci J 61(13):2348–2366. https://doi.org/10.1080/02626667.2015.1127376Crossref
87.
Evensen G (2003) The ensemble Kalman filter: theoretical formulation and practical implementation. Ocean Dyn 53:343–367Crossref
88.
Heemink AW, Verlaan M, Segers AJ (2001) Variance reduced ensemble Kalman filtering. Mon Weather Rev 129(7):1718–1728Crossref
89.
Reichle R, McLaughlin DB, Entekhabi D (2002) Hydrologic data assimilation with the ensemble Kalman filter. Am Meteorol Soc 130(1):103–114. https://doi.org/10.1175/1520-0493(2002)130<0103:HDAWTE>2.0.CO;2Crossref
90.
Weerts AH, El Serafy GYH (2006) Particle filtering and ensemble Kalman filtering for state updating with hydrological conceptual rainfall-runoff models. Water Resour Res 42(9):1–17. https://doi.org/10.1029/2005WR004093Crossref
91.
Anderson JL (2001) An ensemble adjustment kalman filter for data assimilation. Mon Weather Rev 129:2884–2903Crossref
92.
Murphy JM (1988) The impact of ensemble forecasts on predictability. Q J Roy Meteorol Soc 114(480):463–493. https://doi.org/10.1002/qj.49711448010Crossref
93.
Pauwels VRN, De Lannoy GJM (2009) Ensemble-based assimilation of discharge into rainfall-runoff models: a comparison of approaches to mapping observational information to state space. Water Resour Res 45(8):W08428. https://doi.org/10.1029/2008WR007590Crossref
94.
De Lannoy GJM, Reichle RH, Houser PR, Pauwels VRN, Verhoest NEC (2007) Correcting for forecast bias in soil moisture assimilation with the ensemble Kalman filter. Water Resour Res 43(9):W09410. https://doi.org/10.1029/2006WR005449Crossref
95.
Brouwer T, Eilander D, Van Loenen A, Booij MJ, Wijnberg KM, Verkade JS, Wagemaker J (2017) Probabilistic flood extent estimates from social media flood observations. Nat Hazards Earth Syst Sci 17(5):735Crossref
96.
Cipra T, Romera R (1997) Kalman filter with outliers and missing observations. TEST 6(2):379–395. https://doi.org/10.1007/BF02564705Crossref