Calibration Summary Statistics

HEC-HMS calculates and displays four summary statistics to quantify model performance compared to observations. Statistics include Nash-Sutcliffe Efficiency (NSE), Ratio of the Root Mean Square Error to the Standard Deviation Ratio (RSR) and Percent Bias (PBIAS) (Moriasi, et al., 2007), as well as Coefficient of Determination (R²) (Legates and McCabe, 1999). These statistics are summarized in the Table below.

Table. Calibration summary statistics

Criterion

Equation

Notes

Nash-Sutcliffe Efficiency (NSE)

1)	$\begin{array}{l}\displaystyle NSE=1-\left[\frac{\sum_{i=1}^{n}(Y_i^{obs} - Y_i^{sim})^2} {\sum_{i=1}^{n}(Y_i^{obs} - \bar Y_{obs})^2} \right]\end{array}$

NSE evaluates the relative difference between the magnitude of residual data variance ("noise") and the measured data variance (“information”)
It is a measure of how well the plot of observed versus simulated data fits the 1:1 line.
Ranges between - $\begin{array}{l}\infty\end{array}$ and 1.
NSE of 1 indicates one-to-one match between the observed and simulated values
Negative NSE indicates that the mean of observed values provides a better estimate of the observed data than the model.
Widely used in hydrology and is considered a good statistic to represent the overall shape of the hydrograph.

Ratio of the Root Mean Square Error to the Standard Deviation Ratio (RSR)

2)	$\begin{array}{l}\displaystyle RSR=\frac{RMSE}{\sigma_o}=\frac{\left[\sqrt{\sum_{i=1}^{n}(Y_i^{obs}-Y_i^{sim})^2}\right]}{\left[\sqrt{\sum_{i=1}^{n}(Y_i^{obs}-\bar Y_{obs})^2}\right]}\end{array}$

Standardizes the root mean square error (RMSE) using the observation standard deviation
The optimal value of RSR is 0.0 which presents zero RMSE for perfect model simulation
Lower RSR values present better model performance

Percent Bias (PBIAS)

3)	$\begin{array}{l}\displaystyle PBIAS=\left [\frac{\sum_i^n(Y_i^{sim}-Y_i^{obs})*(100)} {\sum_i^n(Y_i^{obs})}\right]\end{array}$

Provides a measure of the simulated values are on average larger or smaller than the corresponding observed values.
Varies from 0% to infinity, with 0 being the optimal.
Negative PBIAS means the model under-estimates observed data
Positive PBIAS means the model over-estimates observed data

NOTE:

PBIAS sign convention in HEC-HMS is opposite from the sign convention in Moriasi, 2007

Coefficient of Determination (R²)

4)	$\begin{array}{l}\displaystyle R^2=\left[\frac{\sum_i^n(Y_i^{obs}-\bar Y_{obs})(Y_i^{sim}-\bar Y_{sim})}{\sqrt{\sum_i^n(Y_i^{obs}-\bar Y_{obs})^2}*\sqrt{\sum_i^n(Y_i^{sim}-\bar Y_{sim})^2}}\right]^2\end{array}$

R² describes degree of collinearity between simulated and observed data.
Represents the proportion of the variance in measured data explained by the model.
Varies between 0 and 1, with 1 being an optimal value.
Is oversensitive to outliers and insensitive to additive and proportional differences between model predictions and measured data

Modified Kling Gupta Efficiency (MKGE)

5)	$\begin{array}{l}\displaystyle MKGE = 1 - \sqrt{(r - 1)^2 + (\beta - 1)^2 + (\gamma - 1)^2}\end{array}$

6)	$\begin{array}{l}\displaystyle \beta = \frac{\bar Y_{sim}}{\bar Y_{obs}}\end{array}$

7)	$\begin{array}{l}\displaystyle \gamma= \frac{CV_s}{CV_o} = \frac{\sigma_s/\bar Y_{sim}}{\sigma_o/\bar Y_{obs}}\end{array}$

Multi-objective alternative to mean squared error and Nash-Sutcliffe Efficiency (NSE)
Can be decomposed into three terms: (1) correlation $\begin{array}{l}r\end{array}$ , (2) bias ratio $\begin{array}{l}\beta\end{array}$ , and (3) variability ratio $\begin{array}{l}\gamma\end{array}$
The value of MKGE gives the lower limit of the three components ( $\begin{array}{l}r\end{array}$ , $\begin{array}{l}\beta\end{array}$ , $\begin{array}{l}\gamma\end{array}$ )
The original version of the KGE-statistic uses a variability ratio of γ of σ_s/σ_o instead of CV_s/CV_o. The ratio of coefficient of variations, rather than standard deviations, ensures that the bias and variability ratios are not cross-correlated.

Variables :

$\begin{array}{l}Y_i^{obs}\end{array}$ = i^th observation
$\begin{array}{l}Y_i^{sim}\end{array}$ = i^th simulated value
$\begin{array}{l}\bar Y_{obs}\end{array}$ = the mean of observed data
$\begin{array}{l}\bar Y_{sim}\end{array}$ = the mean of simulated data
n = total number of observations
$\begin{array}{l}r\end{array}$ = correlation coefficient between simulated and observed runoff (dimensionless)
$\begin{array}{l}\beta\end{array}$ = bias ratio (dimensionless)
$\begin{array}{l}\gamma\end{array}$ = variability ratio (dimensionless)
$\begin{array}{l}CV\end{array}$ = coefficient of variation (dimensionless)
$\begin{array}{l}\sigma\end{array}$ = standard deviation
The indices $\begin{array}{l}s\end{array}$ and $\begin{array}{l}o\end{array}$ represent simulated and observed runoff values, respectively.

HEC-HMS also reports observed and computed maximum flow, time of peak and total volume. These measures are also useful in the calibration process.

As a reminder, the following basic statistical measures are useful for this discussion:

Residual variance = sum of squared differences between the observed and simulated values = $\begin{array}{l}\sum_{i=1}^{n}(Y_i^{obs} - Y_i^{sim})^2\end{array}$
Measured data variance = sum of squared differences between the individual observed values and the mean of the observed value = $\begin{array}{l}\sum_{i=1}^{n}(Y_i^{obs} - Y_{obs}^{mean})^2\end{array}$
Standard deviation ( $\begin{array}{l}\sigma\end{array}$ ) is the square root of variance

Performance ranges

Suggested model performance ranges of the four summary statistics for evaluating streamflow, adapted from Moriasi et all, 2007 and 2015, are summarized in the Table below. Note that these are derived for continuous flow data at daily and monthly time step at watershed scale. The acceptable values of the summary statistics will vary for your project, depending on the time step, uncertainty in observed data and boundary conditions and project scope.

Table. HEC-HMS Performance Ratings for Summary Statistics

Performance Rating	NSE	RSR	PBIAS (%)	R²
Very Good	0.75<𝑁𝑆𝐸≤1.00	0.00<𝑅𝑆𝑅≤0.50	\|𝑃𝐵𝐼𝐴𝑆\| < ±10	R² ≥ 0.85
Good	0.65<𝑁𝑆𝐸≤0.75	0.50<𝑅𝑆𝑅≤0.60	±10≤ \|𝑃𝐵𝐼𝐴𝑆\| <±15	0.70≤R²<0.85
Satisfactory	0.50<𝑁𝑆𝐸≤0.65	0.60<𝑅𝑆𝑅≤0.70	±15≤ \|𝑃𝐵𝐼𝐴𝑆\| <±25	0.5≤R²<0.70
Unsatisfactory	𝑁𝑆𝐸≤0.50	𝑅𝑆𝑅>0.70	\|𝑃𝐵𝐼𝐴𝑆\| ≥±25	R²≤0.5

Performance ranges for MKGE are not provided in HEC-HMS. Using the mean flow as a predictor results in a NSE = 0 and a MKGE = 1 - $\begin{array}{l}\sqrt2\end{array}$ = -0.41. MKGE values greater than -0.41 indicate that the model's performance is better than the mean flow. NSE and MKGE values cannot be directly compared because the relationship depends in part on the coefficient of variation of the observed time series (Knoben et al., 2019). Modelers should analyze the MKGE components (correlation coefficient, bias ratio, and variability ratio) to better understand the model error.