Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores

Abstract

Abstract. A traditional metric used in hydrology to summarize model performance is the Nash–Sutcliffe efficiency (NSE). Increasingly an alternative metric, the Kling–Gupta efficiency (KGE), is used instead. When NSE is used, NSE = 0 corresponds to using the mean flow as a benchmark predictor. The same reasoning is applied in various studies that use KGE as a metric: negative KGE values are viewed as bad model performance, and only positive values are seen as good model performance. Here we show that using the mean flow as a predictor does not result in KGE = 0, but instead KGE =1-√2≈-0.41. Thus, KGE values greater than −0.41 indicate that a model improves upon the mean flow benchmark – even if the model's KGE value is negative. NSE and KGE values cannot be directly compared, because their relationship is non-unique and depends in part on the coefficient of variation of the observed time series. Therefore, modellers who use the KGE metric should not let their understanding of NSE values guide them in interpreting KGE values and instead develop new understanding based on the constitutive parts of the KGE metric and the explicit use of benchmark values to compare KGE scores against. More generally, a strong case can be made for moving away from ad hoc use of aggregated efficiency metrics and towards a framework based on purpose-dependent evaluation metrics and benchmarks that allows for more robust model adequacy assessment.

Keywords

Benchmark (surveying)Metric (unit)MathematicsComputer scienceStatisticsEconometricsEngineeringOperations management

Affiliated Institutions

Related Publications

A distribution function approach to rainfall runoff modeling

R. J. Moore , Robin T. Clarke

This paper begins with a critique of existing rainfall runoff models and proceeds to a largely new formulation in which the single store (representing, for example, interception...

1981 Water Resources Research 225 citations

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation

Davide Chicco , Matthijs J. Warrens , Giuseppe Jurman

Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variabl...

2021 PeerJ Computer Science 3896 citations

Relational Knowledge Distillation

Wonpyo Park , Dongju Kim , Yan Lu +1 more

Knowledge distillation aims at transferring knowledge acquired in one model (a teacher) to another model (a student) that is typically smaller. Previous approaches can be expres...

2019 2019 IEEE/CVF Conference on Computer ... 1437 citations

A partial area model for storm flow synthesis

Edwin T. Engman , A. S. Rogowski

The storm Hydrograph model described is based on the partial contributing area concept. It utilizes a physically based infiltration capacity distribution for computation of rain...

1974 Water Resources Research 75 citations

Relating Riparian Vegetation to Present and Future Streamflows

Gregor T. Auble , Jonathan M. Friedman , Michael L. Scott

The intense demand for river water in arid regions is resulting in widespread changes in riparian vegetation. We present a direct gradient method to predict the vegetation chang...

1994 Ecological Applications 306 citations

Publication Info

Year: 2019
Type: article
Volume: 23
Issue: 10
Pages: 4323-4331
Citations: 1208
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1208

OpenAlex

Cite This

APA Style

                            
                                    Wouter Knoben, 
                                
                                    Jim Freer, 
                                
                                    Ross Woods
                                
                            (2019). 
                            Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores. 
                            Hydrology and earth system sciences
                            , 23
                            (10)
                            , 4323-4331.
                            https://doi.org/10.5194/hess-23-4323-2019

Identifiers

DOI: 10.5194/hess-23-4323-2019