Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Risks Due to Variability of K-Day Extreme Precipitation Totals and Other K-Day Extreme Events

Risks Due to Variability of K-Day Extreme Precipitation Totals and Other K-Day Extreme Events J. Hydrol. Hydromech., 57, 2009, 4, 250­263 DOI: 10.2478/v10098-009-0022-0 PAVOL BRUNOVSKÝ, MILAN LAPIN, IGOR MELICHERCÍK, JÁN SOMORCÍK, DANIEL SEVCOVIC Faculty of Mathematics, Physics and Informatics, Mlynská dolina, SK-842 48 Bratislava, Slovak Republic; Mailto: brunovsky@fmph.uniba.sk, milan.lapin@fmph.uniba.sk, igor.melichercik@fmph.uniba.sk; jan.somorcik@fmph.uniba.sk, igor.sevcovic@fmph.uniba.sk Several alternative definitions of extreme events are proposed. As the first step a statistical analysis of daily precipitation measurement time series from the Hurbanovo SHMI Observatory and elaboration of potentially dangerous precipitation events is carried out. Then, combined characteristics based on daily temperature, daily air humidity and daily precipitation totals are computed. The drought index based on normalized deviations from long-term averages is defined. Alternatively, to define extreme events "Data envelopment analysis" (DEA) is employed with K-day periods of values of temperature, humidity and precipitation corresponding to decision making units. In this paper we have used the period of K = 10 days for both methodologies for identification of extreme events. The results of all definitions of extreme events are compared. KEY WORDS: Extreme Events, Precipitation, Humidity, Temperature, Drought Index, DEA Analysis. Pavol Brunovský, Milan Lapin, Igor Melichercík, Ján Somorcík, Daniel Sevcovic: RIZIKÁ SPÔSOBENÉ VARIABILITOU K-DENNÝCH EXTRÉMNYCH ÚHRNOV ZRÁZOK A INÝMI EXTRÉMNYMI UDALOSAMI. J. Hydrol. Hydromech., 57, 2009, 4, 11 lit., 17. obr., 2 tab. V clánku navrhujeme niekoko definícií extrémnych udalostí. Ako prvý krok je vypracovaná statistická analýza denných úhrnov zrázok z observatória SHMÚ v Hurbanove, na základe ktorej oznacujeme extrémne udalosti. Následne pocítame kombinované charakteristiky období sucha zalozené na denných údajoch teploty, vlhkosti vzduchu a denných úhrnoch zrázok. Index sucha je zalozený na normalizovaných odchýlkach od dlhodobých priemerov. Alternatívne definujeme extrémne udalosti na základe DEA analýzy, kde K-denné periódy teploty, vlhkosti a zrázok slúzia ako rozhodovacie jednotky. V tomto clánku sme na identifikáciu extrémnych udalostí pre obe metodológie pouzili periódu K = 10 dní. Výsledky vsetkých prístupov nakoniec porovnávame. KÚCOVÉ SLOVÁ: extrémne udalosti, atmosférické zrázky, vlhkos, teplota, index sucha, DEA analýza. Introduction It is commonly accepted that one of the features of the climatic changes in the past is an increasing number of extreme weather events of various kind ­ droughts, floods, windstorms, etc. But what is an extreme event? For instance, drought cannot be fully characterized by a single quantity. Rather, it is a combination of high temperature, low humidity, low precipitation and possibly other quantities like duration. Extreme events are commonly being characterized by drought indices synthesizing several kinds of data (Klementová, Litschman, 2001). The disad250 vantage of this method is that it involves subjective components, e.g. the choice of weights of the data and the choice of the threshold. The main goal of this paper is to introduce a completely new methodology of the identification of extreme events, the Data Envelopment Analysis (DEA). We compare its results with the results of the drought index approach in the case of long term meteorological data observations obtained from the station in Hurbanovo, Slovakia. We recall that DEA has been widely used in the field of operations research as a tool for the evaluation of "decision making units" (see e.g. Cooper et al., 2000, Knox Lovell, Pastor, 1995, Sevcovic et al., 2001). To our knowledge this is the first application of DEA for the definition of extreme events. Extreme precipitation totals We studied the time series of daily precipitation totals at Hurbanovo from 1901 to 2006. Hurbanovo Observatory (115 m a.s.l.) is a representative meteorological station for the Danubian lowland in Slovakia. It is considered to be one of the best meteorological stations in Central Europe with sufficiently long and good-quality observations since 1871 (Melo, 2005; Lapin, Melo, 2004). Method of extreme one-day and 10-day precipitation totals analysis and its application in Slovakia was presented also in Kohnová et al. (2006), Hlavcová et al. (2008) and Szolgay et al. (2009). Complete data series including precipitation, humidity and temperature from 1951 to 2006 were used in our qualitative and quantitative analysis of the Drought index and DEA methodology for identification of extreme events. Initially we computed yearly averages of daily precipitation totals (Fig. 1). The dots stand for yearly averages of precipitation totals over rainy days (with precipitation total > 0.0 mm) whereas triangles represent yearly averages over all days of particular years. The dots lie visibly above the triangles. The U-shape of the solid line suggests that daily precipitation totals over rainy days during the beginning of the 20th century as well as during several last years were higher than in the middle part. The difference of the solid and dashed lines is depicted in Fig. 2. Here the U-shape is even more visible. Since the series of the yearly precipitation totals seems to be stationary (dashed line in Fig. 1) this means that the numbers of rainy days at the beginning of the past century and in the last years were lower than in between, whereas rainfalls were heavier. Note that the results might be affected by lower quality of observations at low precipitation totals (0.1 to 0.4 mm) before 1920. The above formulated conjecture about the trend of the numbers of rainy days based on Figs. 1 and 2 is in perfect agreement with Fig. 3 showing the number of rainy days. Further, we studied the occurrence of extreme rainfalls. For the one-day precipitation totals the limits were set to 40 mm, 50 mm, 60 mm and 70 mm. These limits were exceeded 38-, 10-, 3- and 3times respectively. The distribution of these extreme events between 1901 and 2006 can be seen in Fig. 4. We note that the highest one-day precipitation total was 88.8 mm on August 20, 1918. The study was extended to two- and three-day precipitations totals (see Fig. 5 and 6). The limits were set to 50 mm, 60 mm, 70 mm and 80 mm in case of two-day precipitation totals (exceeded 37-, 14-, 4- and 3-times respectively) and to 60 mm, 70 mm, 80 mm and 90 mm in case of three-day precipitation totals (exceeded 31-, 11-, 5- and 2-times respectively). Fig. 1. Average one-day precipitation totals during rainy days (dot marks) and during all days (triangle marks). Obr. 1. Priemerné jednodové úhrny zrázok pocas zrázkových (bodky) a vsetkých dní (trojuholníky). Fig. 2. Difference between average one-day precipitation totals during rainy days and during all days. Obr. 2. Rozdiel medzi priemernými jednodovými úhrnmi zrázok pocas zrázkových a vsetkých dní. Fig. 3. Number of rainy days. Obr. 3. Pocet zrázkových dní. Figs. 4, 5 and 6 suggest that the distribution of heavy rainfalls in the period from 1901 to 2006 is rather uniform. The effect of a heavy rainfall can become really serious when it continues to rain in the following days. This motivated us to look at the average 252 number of rainy days after one- two- or three-days with precipitation totals over 40 mm, 50 mm or 60 mm respectively (the average is taken through all extreme events in a particular year). The results are in Figs. 7, 8 and 9. No increasing or decreasing trend can be seen there. Fig. 4. Number of one-day precipitation totals exceeding a given threshold value. Obr. 4. Pocet jednodových úhrnov zrázok presahujúcich danú prahovú hodnotu. Fig. 5. Number of two-day precipitation totals exceeding a given threshold value. Obr. 5. Pocet dvojdových úhrnov zrázok presahujúcich danú prahovú hodnotu. Fig. 6. Number of three-day precipitation totals exceeding a given threshold value. Obr. 6. Pocet trojdových úhrnov zrázok presahujúcich danú prahovú hodnotu. Fig. 7. Average number of rainy days after one-day precipitation totals over 40 mm. Obr. 7. Priemerný pocet zrázkových dní po jednodových úhrnoch zrázok viac ako 40 mm. Fig. 8. Average number of rainy days after two-day precipitation totals over 50 mm. Obr. 8. Priemerný pocet zrázkových dní po dvojdových úhrnoch zrázok viac ako 50 mm. Fig. 9. Average number of rainy days after three-day precipitation totals over 60 mm. Obr. 9. Priemerný pocet zrázkových dní po trojdových úhrnoch zrázok viac ako 60 mm. Drought index To begin we investigated the occurrence of series of K days with very low precipitation totals. We searched for series of at least 5 no-rain days, series of at least 15 days with precipitation totals below 1 mm and series of at least 30 days with precipitation totals lower than 5 mm. Their occurrence is shown in Figs. 10, 11 and 12. Note the U-shape in Fig. 10 what is probably due to the lower number of rainy days at the beginning of the last century and in the last years (see Fig. 3). On the other hand, Figs. 11 and 12 do not show any deviation from the stationary trend: the occurrence of longer drought periods does not seem to have changed over the last century. As a second step we have calculated the combined characteristics based on daily temperature, daily air humidity and daily precipitation. The drought index Dij for the K-day period i (in our calculations K = 10 successive days) in the j-th year we defined by 255 Fig. 10. Number of series of at least 5 no-rain days. Obr. 10. Pocet postupností aspo 5 bezzrázkových dní. Fig. 11. Number of series of at least 15 days with precipitation totals under 1 mm. Obr. 11. Pocet postupností aspo 15 dní s úhrnom zrázok do 1 mm. Dij = wt tij - t - wp pij - p - wh hij - h (1) t , p , h ­ long term (period 1951­2005) averages of temperature, precipitation and humidity respectively, t , p , h ­ standard deviations (period 1951­ 2005) of the temperature, precipitation and humidity respectively, wt , wp , wh ­ the weights of the temperature, precipitation and humidity factors fulfilling where tij , pij , hij are the average values of temperature, precipitation and humidity respectively in the period i and year j, wt + wp + wh = 1 . Fig. 12. Number of series of at least 30 days with precipitation totals under 5 mm. Obr. 12. Pocet postupností aspo 30 dní s úhrnom zrázok do 5 mm. Let us note that the definition of the drought index is similar to Klementová, Litschman (2001) where only temperature and precipitation was taken into account. Furthermore, they used the entire growing season as a period. Instead of the original observation data tij , pij , hij , the drought index employs the normalized variables tij - t pij - p hij - h , , with zero mean and standard deviation equal to 1. This is why there was no reason to weight them by other than equal weights wt = w p = wh = 1/ 3 . Let us emphasize that these normalized variables need not be normally distributed. It turned out from our data that skewness (measuring asymmetry of the distribution) is negative for normalized temperature and humidity (­0.22 resp. ­0.23) and is positive for precipitation (1.87). Kurtosis (measuring fatness of distribution tails) is positive for precipitation (4.62) indicating leptokurtic character of precipitation data (fat tails). On the other hand kurtosis for temperature and humidity is negative (­1.01 resp. ­0.61) indicating platykurtic character of data (thin tails). Notice that different weights would not affect deviations of skewness and kurto- sis from the case of normal distribution. To overcome this difficulty one should incorporate higher statistical moments into the definition of our Drought index. Hereafter we identify the K = 10-day period with the end of the period. The day is supposed to be a drought, if the drought index of the corresponding period is higher than the 95% sample quantile of calculated indices (see Fig. 13). The number of drought days in particular years can be seen in Fig. 14 (left). One can observe that this number is slightly increasing. Monthly distributions of the drought days could be seen in Fig. 14 (right). The month with the highest number of drought days is August. Since for agriculture the growing season is more important than the rest of the year we carried out the same study as above but over precipitation, temperature and humidity data from April to the end of September only. One can observe from Fig. 15 that the results are very similar to the results with complete year benchmark. Recall the standardization (1). Another approach to the standardization of the data from the i-th period in the j-th year is to use long-term averages and standard deviation estimates appertaining to the i-th period. The drought index is then defined as Dij = wt tij - ti ti - wp pij - pi pi - wh hij - hi hi (2) Fig. 13. Histogram of drought indices and the sample 95% quantile. Obr. 13. Histogram indexov sucha a výberový 95% kvantil. Fig. 14. Number of drought days in corresponding years (left) and monthly distribution (right). Obr. 14. Pocty suchých dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). Fig. 15. Growing season (April­September): number of drought days in corresponding years (left) and monthly distribution (right). Obr. 15. Vegetacné obdobie (apríl­september): pocty suchých dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). Fig. 16. Alternative method of indication of drought days (growing season April­September): number of drought days in corresponding years (left) and monthly distribution (right). Obr. 16. Alternatívna metóda urcenia suchých dní (vegetacné obdobie apríl­september): pocty suchých dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). We consider this approach as more natural because it compares weather parameters from a particular period with averages for the same period in different years rather than long-term averages of weather parameters of all periods. We tested our approach on the growing season data. The results of year comparisons are depicted in Fig. 16 (left) and are quite similar to those in Fig. 14 (left) and 15 (left). However, the monthly distribution has changed: as one can see in Fig. 16 (right) also in May and June drought periods occur quite frequently (cf. Fig. 14 (right) and 15 (right)). Identification of extreme events by means of the DEA methodology Data Envelopment Analysis (DEA) has its origin in Operations Research. It is a tool for the evaluation of "decision making units" (DMUs) the efficiency of which is measured by several parameters rather than a single quantity. Its basic aim is to single out efficient DMUs. Those are DMUs that lie on the so-called efficient frontier. This is, by definition, a part of the boundary of the (smallest convex) region filled by the parameter vectors of all the DMUs (see e.g. Cooper et al., 2000). Equivalently, a DMU is efficient in case one can find weights under which it maximizes the weighted sum of parameters. This equivalence is a basic result of the mathematical theory of convex sets. To draw the analogy to extremes of meteorological events just replace DMUs by events and efficient DMUs by extreme events. That is, a K-day period is extreme if and only if there are weights such that it maximizes the weighted sum of the string of data (precipitation, temperature, humidity) over all periods it is compared to. The DEA identification method can be best explained by comparing it to the index method of the previous section. Imagine that for an index with fixed weights one would declare as extreme the event with maximal index. Of course, unless a very unlikely coincidence of the values of the indices would take place, such an event would be unique. Therefore, one has to declare as extreme those events the indices of which exceed a certain subjectively chosen threshold. DEA, on the other hand, does not hold the weights fixed for all events. Rather, the weights are chosen individually for each event. An event o is declared extreme if weights can be found such that the value of the index of o is maximal. Of course, in such a case the number of extreme events will become higher than one. Equivalently, one can phrase this in terms of "combined" events. By a combined event we will understand a fictive event the parameter set of which is a convex combination (the coefficients of which may be interpreted as probabilities) of two or more events. An event o is extreme if there is no combined event dominating o, that is, all parameters of which are more extreme (in our example, temperature is higher, precipitation and humidity are lower). Similarly as in the case of an index with chosen weights, an event is trivially extreme if it dominates all other events. However, an event can be declared extreme also if this is not the case (say, there was no day with extreme temperature but the tempera259 ture was high for an extended period and there was little precipitation). So, what is then the difference to the index method? Unlike the index method, DEA is completely free of subjective components. Because weights are automatically computed for every event individually, there is no need of their advance subjective choice. Further, the selection of extreme events is qualitative, not requiring a choice of a quantitative threshold. In our application we chose the period of K = 10 days. Consequently, an event is characterized by 3K(= 30 in our case) parameters. There are several computation methods for DEA, all of them based on linear programming algorithms. We have employed the weighted "additive model" method used in Sevcovic et al. (2001). It was first described by Knox Lovell and Pastor (1995). In what follows, we briefly describe this model for our particular case. As above, we use the notation t, p, h for temperature, precipitation and humidity, respectively and keep the index j to label all the r periods of the pool, the value j = o being used for the period tested for extreme. In addition, we introduce the upper index i = 1,..., K(= 10 in our case) for the particular day of the period. That is, tji will stand for the temperature during the ith day of the period labeled by j etc. In order to evaluate efficiency of the period labeled by o one solves the normalized weighted additive DEA model, which is described by the following linear program: Find multiplicators j and "slacks" sj ti, sj pi, sj hi so as to minimize K K K - vt sti - v p s pi - v h s hi i =1 i =1 i =1 vt = (1/ t ), v p = (1/ p ), v h = (1/ h ) , t , p , h being the sample standard deviations (period 1951­ ­2005) of temperature, precipitation and humidity respectively. The period labeled o is rated as extreme if the optimal value to (3) is zero, or equivalently, all the slacks vanish. It is not immediately obvious that the events with zero slacks are precisely the extreme events introduced above. Intuitively, slacks measure the extent to which an event is dominated by combined events. Zero slacks mean that the event is not dominated at all. A precise formulation requires knowledge of mathematical duality theory. Further note that this computation method does not yield the choice of weights for the tested period o. Their existence is mathematically proven and can be computed. We skip their computation because it is not important for our purpose. Dry and hot extreme periods In accord with the previous section we define the extreme period as a period with high temperature, low precipitation and low humidity. The number of drought days in corresponding years can be seen in Fig. 17 (left). One can observe that this number is slightly increasing. The highest number of drought days (97) was observed in the year 2003. This is in accordance with the index method (see Fig. 14­16 (left)). The monthly distribution of the drought days could be seen in Fig. 17 (right). The months with the highest number of drought days are July and August. Quantitative comparison of Drought index and DEA methodologies under the constraints r i t j j j =1 i - s ti = to , i + s pi = po , i + s hi = ho , r i p j j j =1 r i h j j j =1 r j j =1 (3) = 1, j 0, j = 1,..., r, s ti 0, s pi 0, s hi 0, i = 1,..., K (= 10) The v's are normalization constants defined by Tab. 1 summarizes extreme events assessed by both methodologies. One can see that 1176 periods were assessed as extreme using alternatively DEA or Drought index methodology and only 383 of them were extreme using both methodologies. Number of DEA extremes (598) is smaller comparing to Drought index extremes (961). One can observe that 383/598 = 64% of DEA extremes are also Drought index extremes and only 383/961 = 39.9% of Drought index extremes are also DEA extremes. The DEA methodology seems to be more conservative for identification of extreme events. Fig. 17. DEA method: number of extreme days in corresponding years (left) and monthly distribution (right). Obr. 17. DEA metóda: pocty extrémnych dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). Tab. 2 contains data for two different periods. The period in the right part of the table (year 1954, days 220­229) was identified as extreme by the DEA methodology and not extreme by the Drought index methodology. The reason is that the Drought index methodology considers only average values of relative humidity, precipitation and temperature from the K = 10-days period. These values are not extreme. On the other hand, one can observe very low humidity, no precipitation and high temperature in the 7-th day of the period. Taking into account these values one may expect a high probability of the wildfire risk and it makes sense to consider this day as extreme. The DEA methodology uses all parameters of all days of the K = 10 days period and therefore is able to identify such an extreme event. T a b l e 1. Numbers of extreme events. T a b u k a 1. Pocty extrémnych udalostí. DEA extremes Drought index extremes DEA and Drought index extremes DEA or Drought index extremes 598 961 383 1176 assessing extreme events in agriculture, where one day extreme events are not so important. T a b l e 2. Example of differently identified extreme events. (H ­ humidity in %, P ­ precipitation totals in tenth of mm, T ­ temperature in °C.) T a b u k a 2. Príklad rozdielne posúdených extrémnych udalostí. (H ­ rel. vlhkos v %, P ­ úhrny zrázok v desatinách mm, T ­ teplota v °C.) Extreme Drought index Extreme DEA 1952, days: 222­231 1954, days: 220­229 Day H 1 62.3 2 58.7 3 53.7 4 56 5 51.7 6 49 7 60.7 8 64.7 9 85 10 64.3 Mean 60.61 P 0 0 0 0 0 0 0 35 2 16 5.3 T 22.2 21.7 25 26 26.8 27.8 25.5 22.1 16.9 21.1 23.51 H P T 58 1 21.6 74.3 0 22.3 74 53 16.4 67.3 0 17.3 65.7 0 17.5 53.3 0 22 38.3 0 26.4 53 136 24.3 82 66 14.6 69.3 0 16.9 63.52 25.6 19.93 Conclusions The period in the left part of Tab. 2 (year 1952, days 222­231) was assessed extreme by the Drought index methodology and not extreme by the DEA methodology. The last row of Tab. 2 indicates that average values of observed parameters are more extreme comparing to the period in the right part of the table (lower humidity and precipitation and higher temperature). Therefore, the period in the right part of Tab. 2 is not considered to be extreme. The average values of humidity, precipitation and temperature are more adequate e.g. when We have studied several methodologies of defining extreme events. The methodologies have been compared using the time series of daily precipitation, humidity and temperature at Hurbanovo from 1901 to 2006. First we have calculated yearly averages of daily precipitation totals. These averages appear to be stationary. Comparing them to the averages over the rainy days only, one can conclude that at the beginning and at the end of the last century the number of rainy days was lower and the rainfalls were heavier. Further we have studied the 261 occurrence of 1­3 days extreme rainfalls. The distribution of heavy rainfalls in the period from 1901 to 2006 is quite uniform. As a second step we have considered the drought index that combines characteristics based on daily temperature, relative air humidity and precipitation. The number of drought days seems to be increasing. Finally, we have used the DEA methodology for the definition of extreme events. To our knowledge this is the first application of the DEA method to the determination of extreme events of any kind rather than efficiency of performance. Unlike the index method, DEA method is not subject to subjective choices of the weights and the threshold but is more computationally involved. The DEA results largely confirm the ones obtained by the drought index: The number of extreme drought periods is increasing over years, their highest occurrence being in July and August. Continuation of this study may contribute to science of disasters (Bunde et al., 2002). Preliminary results indicate that DEA method is more adequate for identifying short-term extreme events, whereas the Drought index methodology should be preferred for long-term ones. Acknowledgments. This research was supported by the European Commission under the 6th framework programme project No. 037063 "GEO-BENE". Some parts of this paper are based on the results of projects VEGA No. 1/4033/07 and observed SHMI data; the authors thank SHMI for making these data series available. pi ­ standard deviation of precipitation totals appertaining t ti wh wp to fixed period i in every year, ­ standard deviation of temperature, ­ standard deviation of temperature appertaining to fixed period i in every year, ­ weight of the humidity factor, ­ weight of the precipitation totals factor, ­ weight of the temperature factor. wt http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Journal of Hydrology and Hydromechanics de Gruyter

Risks Due to Variability of K-Day Extreme Precipitation Totals and Other K-Day Extreme Events

Loading next page...
 
/lp/de-gruyter/risks-due-to-variability-of-k-day-extreme-precipitation-totals-and-kYpwQVfY6w

References (12)

Publisher
de Gruyter
Copyright
Copyright © 2009 by the
ISSN
0042-790X
DOI
10.2478/v10098-009-0022-0
Publisher site
See Article on Publisher Site

Abstract

J. Hydrol. Hydromech., 57, 2009, 4, 250­263 DOI: 10.2478/v10098-009-0022-0 PAVOL BRUNOVSKÝ, MILAN LAPIN, IGOR MELICHERCÍK, JÁN SOMORCÍK, DANIEL SEVCOVIC Faculty of Mathematics, Physics and Informatics, Mlynská dolina, SK-842 48 Bratislava, Slovak Republic; Mailto: brunovsky@fmph.uniba.sk, milan.lapin@fmph.uniba.sk, igor.melichercik@fmph.uniba.sk; jan.somorcik@fmph.uniba.sk, igor.sevcovic@fmph.uniba.sk Several alternative definitions of extreme events are proposed. As the first step a statistical analysis of daily precipitation measurement time series from the Hurbanovo SHMI Observatory and elaboration of potentially dangerous precipitation events is carried out. Then, combined characteristics based on daily temperature, daily air humidity and daily precipitation totals are computed. The drought index based on normalized deviations from long-term averages is defined. Alternatively, to define extreme events "Data envelopment analysis" (DEA) is employed with K-day periods of values of temperature, humidity and precipitation corresponding to decision making units. In this paper we have used the period of K = 10 days for both methodologies for identification of extreme events. The results of all definitions of extreme events are compared. KEY WORDS: Extreme Events, Precipitation, Humidity, Temperature, Drought Index, DEA Analysis. Pavol Brunovský, Milan Lapin, Igor Melichercík, Ján Somorcík, Daniel Sevcovic: RIZIKÁ SPÔSOBENÉ VARIABILITOU K-DENNÝCH EXTRÉMNYCH ÚHRNOV ZRÁZOK A INÝMI EXTRÉMNYMI UDALOSAMI. J. Hydrol. Hydromech., 57, 2009, 4, 11 lit., 17. obr., 2 tab. V clánku navrhujeme niekoko definícií extrémnych udalostí. Ako prvý krok je vypracovaná statistická analýza denných úhrnov zrázok z observatória SHMÚ v Hurbanove, na základe ktorej oznacujeme extrémne udalosti. Následne pocítame kombinované charakteristiky období sucha zalozené na denných údajoch teploty, vlhkosti vzduchu a denných úhrnoch zrázok. Index sucha je zalozený na normalizovaných odchýlkach od dlhodobých priemerov. Alternatívne definujeme extrémne udalosti na základe DEA analýzy, kde K-denné periódy teploty, vlhkosti a zrázok slúzia ako rozhodovacie jednotky. V tomto clánku sme na identifikáciu extrémnych udalostí pre obe metodológie pouzili periódu K = 10 dní. Výsledky vsetkých prístupov nakoniec porovnávame. KÚCOVÉ SLOVÁ: extrémne udalosti, atmosférické zrázky, vlhkos, teplota, index sucha, DEA analýza. Introduction It is commonly accepted that one of the features of the climatic changes in the past is an increasing number of extreme weather events of various kind ­ droughts, floods, windstorms, etc. But what is an extreme event? For instance, drought cannot be fully characterized by a single quantity. Rather, it is a combination of high temperature, low humidity, low precipitation and possibly other quantities like duration. Extreme events are commonly being characterized by drought indices synthesizing several kinds of data (Klementová, Litschman, 2001). The disad250 vantage of this method is that it involves subjective components, e.g. the choice of weights of the data and the choice of the threshold. The main goal of this paper is to introduce a completely new methodology of the identification of extreme events, the Data Envelopment Analysis (DEA). We compare its results with the results of the drought index approach in the case of long term meteorological data observations obtained from the station in Hurbanovo, Slovakia. We recall that DEA has been widely used in the field of operations research as a tool for the evaluation of "decision making units" (see e.g. Cooper et al., 2000, Knox Lovell, Pastor, 1995, Sevcovic et al., 2001). To our knowledge this is the first application of DEA for the definition of extreme events. Extreme precipitation totals We studied the time series of daily precipitation totals at Hurbanovo from 1901 to 2006. Hurbanovo Observatory (115 m a.s.l.) is a representative meteorological station for the Danubian lowland in Slovakia. It is considered to be one of the best meteorological stations in Central Europe with sufficiently long and good-quality observations since 1871 (Melo, 2005; Lapin, Melo, 2004). Method of extreme one-day and 10-day precipitation totals analysis and its application in Slovakia was presented also in Kohnová et al. (2006), Hlavcová et al. (2008) and Szolgay et al. (2009). Complete data series including precipitation, humidity and temperature from 1951 to 2006 were used in our qualitative and quantitative analysis of the Drought index and DEA methodology for identification of extreme events. Initially we computed yearly averages of daily precipitation totals (Fig. 1). The dots stand for yearly averages of precipitation totals over rainy days (with precipitation total > 0.0 mm) whereas triangles represent yearly averages over all days of particular years. The dots lie visibly above the triangles. The U-shape of the solid line suggests that daily precipitation totals over rainy days during the beginning of the 20th century as well as during several last years were higher than in the middle part. The difference of the solid and dashed lines is depicted in Fig. 2. Here the U-shape is even more visible. Since the series of the yearly precipitation totals seems to be stationary (dashed line in Fig. 1) this means that the numbers of rainy days at the beginning of the past century and in the last years were lower than in between, whereas rainfalls were heavier. Note that the results might be affected by lower quality of observations at low precipitation totals (0.1 to 0.4 mm) before 1920. The above formulated conjecture about the trend of the numbers of rainy days based on Figs. 1 and 2 is in perfect agreement with Fig. 3 showing the number of rainy days. Further, we studied the occurrence of extreme rainfalls. For the one-day precipitation totals the limits were set to 40 mm, 50 mm, 60 mm and 70 mm. These limits were exceeded 38-, 10-, 3- and 3times respectively. The distribution of these extreme events between 1901 and 2006 can be seen in Fig. 4. We note that the highest one-day precipitation total was 88.8 mm on August 20, 1918. The study was extended to two- and three-day precipitations totals (see Fig. 5 and 6). The limits were set to 50 mm, 60 mm, 70 mm and 80 mm in case of two-day precipitation totals (exceeded 37-, 14-, 4- and 3-times respectively) and to 60 mm, 70 mm, 80 mm and 90 mm in case of three-day precipitation totals (exceeded 31-, 11-, 5- and 2-times respectively). Fig. 1. Average one-day precipitation totals during rainy days (dot marks) and during all days (triangle marks). Obr. 1. Priemerné jednodové úhrny zrázok pocas zrázkových (bodky) a vsetkých dní (trojuholníky). Fig. 2. Difference between average one-day precipitation totals during rainy days and during all days. Obr. 2. Rozdiel medzi priemernými jednodovými úhrnmi zrázok pocas zrázkových a vsetkých dní. Fig. 3. Number of rainy days. Obr. 3. Pocet zrázkových dní. Figs. 4, 5 and 6 suggest that the distribution of heavy rainfalls in the period from 1901 to 2006 is rather uniform. The effect of a heavy rainfall can become really serious when it continues to rain in the following days. This motivated us to look at the average 252 number of rainy days after one- two- or three-days with precipitation totals over 40 mm, 50 mm or 60 mm respectively (the average is taken through all extreme events in a particular year). The results are in Figs. 7, 8 and 9. No increasing or decreasing trend can be seen there. Fig. 4. Number of one-day precipitation totals exceeding a given threshold value. Obr. 4. Pocet jednodových úhrnov zrázok presahujúcich danú prahovú hodnotu. Fig. 5. Number of two-day precipitation totals exceeding a given threshold value. Obr. 5. Pocet dvojdových úhrnov zrázok presahujúcich danú prahovú hodnotu. Fig. 6. Number of three-day precipitation totals exceeding a given threshold value. Obr. 6. Pocet trojdových úhrnov zrázok presahujúcich danú prahovú hodnotu. Fig. 7. Average number of rainy days after one-day precipitation totals over 40 mm. Obr. 7. Priemerný pocet zrázkových dní po jednodových úhrnoch zrázok viac ako 40 mm. Fig. 8. Average number of rainy days after two-day precipitation totals over 50 mm. Obr. 8. Priemerný pocet zrázkových dní po dvojdových úhrnoch zrázok viac ako 50 mm. Fig. 9. Average number of rainy days after three-day precipitation totals over 60 mm. Obr. 9. Priemerný pocet zrázkových dní po trojdových úhrnoch zrázok viac ako 60 mm. Drought index To begin we investigated the occurrence of series of K days with very low precipitation totals. We searched for series of at least 5 no-rain days, series of at least 15 days with precipitation totals below 1 mm and series of at least 30 days with precipitation totals lower than 5 mm. Their occurrence is shown in Figs. 10, 11 and 12. Note the U-shape in Fig. 10 what is probably due to the lower number of rainy days at the beginning of the last century and in the last years (see Fig. 3). On the other hand, Figs. 11 and 12 do not show any deviation from the stationary trend: the occurrence of longer drought periods does not seem to have changed over the last century. As a second step we have calculated the combined characteristics based on daily temperature, daily air humidity and daily precipitation. The drought index Dij for the K-day period i (in our calculations K = 10 successive days) in the j-th year we defined by 255 Fig. 10. Number of series of at least 5 no-rain days. Obr. 10. Pocet postupností aspo 5 bezzrázkových dní. Fig. 11. Number of series of at least 15 days with precipitation totals under 1 mm. Obr. 11. Pocet postupností aspo 15 dní s úhrnom zrázok do 1 mm. Dij = wt tij - t - wp pij - p - wh hij - h (1) t , p , h ­ long term (period 1951­2005) averages of temperature, precipitation and humidity respectively, t , p , h ­ standard deviations (period 1951­ 2005) of the temperature, precipitation and humidity respectively, wt , wp , wh ­ the weights of the temperature, precipitation and humidity factors fulfilling where tij , pij , hij are the average values of temperature, precipitation and humidity respectively in the period i and year j, wt + wp + wh = 1 . Fig. 12. Number of series of at least 30 days with precipitation totals under 5 mm. Obr. 12. Pocet postupností aspo 30 dní s úhrnom zrázok do 5 mm. Let us note that the definition of the drought index is similar to Klementová, Litschman (2001) where only temperature and precipitation was taken into account. Furthermore, they used the entire growing season as a period. Instead of the original observation data tij , pij , hij , the drought index employs the normalized variables tij - t pij - p hij - h , , with zero mean and standard deviation equal to 1. This is why there was no reason to weight them by other than equal weights wt = w p = wh = 1/ 3 . Let us emphasize that these normalized variables need not be normally distributed. It turned out from our data that skewness (measuring asymmetry of the distribution) is negative for normalized temperature and humidity (­0.22 resp. ­0.23) and is positive for precipitation (1.87). Kurtosis (measuring fatness of distribution tails) is positive for precipitation (4.62) indicating leptokurtic character of precipitation data (fat tails). On the other hand kurtosis for temperature and humidity is negative (­1.01 resp. ­0.61) indicating platykurtic character of data (thin tails). Notice that different weights would not affect deviations of skewness and kurto- sis from the case of normal distribution. To overcome this difficulty one should incorporate higher statistical moments into the definition of our Drought index. Hereafter we identify the K = 10-day period with the end of the period. The day is supposed to be a drought, if the drought index of the corresponding period is higher than the 95% sample quantile of calculated indices (see Fig. 13). The number of drought days in particular years can be seen in Fig. 14 (left). One can observe that this number is slightly increasing. Monthly distributions of the drought days could be seen in Fig. 14 (right). The month with the highest number of drought days is August. Since for agriculture the growing season is more important than the rest of the year we carried out the same study as above but over precipitation, temperature and humidity data from April to the end of September only. One can observe from Fig. 15 that the results are very similar to the results with complete year benchmark. Recall the standardization (1). Another approach to the standardization of the data from the i-th period in the j-th year is to use long-term averages and standard deviation estimates appertaining to the i-th period. The drought index is then defined as Dij = wt tij - ti ti - wp pij - pi pi - wh hij - hi hi (2) Fig. 13. Histogram of drought indices and the sample 95% quantile. Obr. 13. Histogram indexov sucha a výberový 95% kvantil. Fig. 14. Number of drought days in corresponding years (left) and monthly distribution (right). Obr. 14. Pocty suchých dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). Fig. 15. Growing season (April­September): number of drought days in corresponding years (left) and monthly distribution (right). Obr. 15. Vegetacné obdobie (apríl­september): pocty suchých dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). Fig. 16. Alternative method of indication of drought days (growing season April­September): number of drought days in corresponding years (left) and monthly distribution (right). Obr. 16. Alternatívna metóda urcenia suchých dní (vegetacné obdobie apríl­september): pocty suchých dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). We consider this approach as more natural because it compares weather parameters from a particular period with averages for the same period in different years rather than long-term averages of weather parameters of all periods. We tested our approach on the growing season data. The results of year comparisons are depicted in Fig. 16 (left) and are quite similar to those in Fig. 14 (left) and 15 (left). However, the monthly distribution has changed: as one can see in Fig. 16 (right) also in May and June drought periods occur quite frequently (cf. Fig. 14 (right) and 15 (right)). Identification of extreme events by means of the DEA methodology Data Envelopment Analysis (DEA) has its origin in Operations Research. It is a tool for the evaluation of "decision making units" (DMUs) the efficiency of which is measured by several parameters rather than a single quantity. Its basic aim is to single out efficient DMUs. Those are DMUs that lie on the so-called efficient frontier. This is, by definition, a part of the boundary of the (smallest convex) region filled by the parameter vectors of all the DMUs (see e.g. Cooper et al., 2000). Equivalently, a DMU is efficient in case one can find weights under which it maximizes the weighted sum of parameters. This equivalence is a basic result of the mathematical theory of convex sets. To draw the analogy to extremes of meteorological events just replace DMUs by events and efficient DMUs by extreme events. That is, a K-day period is extreme if and only if there are weights such that it maximizes the weighted sum of the string of data (precipitation, temperature, humidity) over all periods it is compared to. The DEA identification method can be best explained by comparing it to the index method of the previous section. Imagine that for an index with fixed weights one would declare as extreme the event with maximal index. Of course, unless a very unlikely coincidence of the values of the indices would take place, such an event would be unique. Therefore, one has to declare as extreme those events the indices of which exceed a certain subjectively chosen threshold. DEA, on the other hand, does not hold the weights fixed for all events. Rather, the weights are chosen individually for each event. An event o is declared extreme if weights can be found such that the value of the index of o is maximal. Of course, in such a case the number of extreme events will become higher than one. Equivalently, one can phrase this in terms of "combined" events. By a combined event we will understand a fictive event the parameter set of which is a convex combination (the coefficients of which may be interpreted as probabilities) of two or more events. An event o is extreme if there is no combined event dominating o, that is, all parameters of which are more extreme (in our example, temperature is higher, precipitation and humidity are lower). Similarly as in the case of an index with chosen weights, an event is trivially extreme if it dominates all other events. However, an event can be declared extreme also if this is not the case (say, there was no day with extreme temperature but the tempera259 ture was high for an extended period and there was little precipitation). So, what is then the difference to the index method? Unlike the index method, DEA is completely free of subjective components. Because weights are automatically computed for every event individually, there is no need of their advance subjective choice. Further, the selection of extreme events is qualitative, not requiring a choice of a quantitative threshold. In our application we chose the period of K = 10 days. Consequently, an event is characterized by 3K(= 30 in our case) parameters. There are several computation methods for DEA, all of them based on linear programming algorithms. We have employed the weighted "additive model" method used in Sevcovic et al. (2001). It was first described by Knox Lovell and Pastor (1995). In what follows, we briefly describe this model for our particular case. As above, we use the notation t, p, h for temperature, precipitation and humidity, respectively and keep the index j to label all the r periods of the pool, the value j = o being used for the period tested for extreme. In addition, we introduce the upper index i = 1,..., K(= 10 in our case) for the particular day of the period. That is, tji will stand for the temperature during the ith day of the period labeled by j etc. In order to evaluate efficiency of the period labeled by o one solves the normalized weighted additive DEA model, which is described by the following linear program: Find multiplicators j and "slacks" sj ti, sj pi, sj hi so as to minimize K K K - vt sti - v p s pi - v h s hi i =1 i =1 i =1 vt = (1/ t ), v p = (1/ p ), v h = (1/ h ) , t , p , h being the sample standard deviations (period 1951­ ­2005) of temperature, precipitation and humidity respectively. The period labeled o is rated as extreme if the optimal value to (3) is zero, or equivalently, all the slacks vanish. It is not immediately obvious that the events with zero slacks are precisely the extreme events introduced above. Intuitively, slacks measure the extent to which an event is dominated by combined events. Zero slacks mean that the event is not dominated at all. A precise formulation requires knowledge of mathematical duality theory. Further note that this computation method does not yield the choice of weights for the tested period o. Their existence is mathematically proven and can be computed. We skip their computation because it is not important for our purpose. Dry and hot extreme periods In accord with the previous section we define the extreme period as a period with high temperature, low precipitation and low humidity. The number of drought days in corresponding years can be seen in Fig. 17 (left). One can observe that this number is slightly increasing. The highest number of drought days (97) was observed in the year 2003. This is in accordance with the index method (see Fig. 14­16 (left)). The monthly distribution of the drought days could be seen in Fig. 17 (right). The months with the highest number of drought days are July and August. Quantitative comparison of Drought index and DEA methodologies under the constraints r i t j j j =1 i - s ti = to , i + s pi = po , i + s hi = ho , r i p j j j =1 r i h j j j =1 r j j =1 (3) = 1, j 0, j = 1,..., r, s ti 0, s pi 0, s hi 0, i = 1,..., K (= 10) The v's are normalization constants defined by Tab. 1 summarizes extreme events assessed by both methodologies. One can see that 1176 periods were assessed as extreme using alternatively DEA or Drought index methodology and only 383 of them were extreme using both methodologies. Number of DEA extremes (598) is smaller comparing to Drought index extremes (961). One can observe that 383/598 = 64% of DEA extremes are also Drought index extremes and only 383/961 = 39.9% of Drought index extremes are also DEA extremes. The DEA methodology seems to be more conservative for identification of extreme events. Fig. 17. DEA method: number of extreme days in corresponding years (left) and monthly distribution (right). Obr. 17. DEA metóda: pocty extrémnych dní v príslusných rokoch (vavo) a mesacné rozlozenie (vpravo). Tab. 2 contains data for two different periods. The period in the right part of the table (year 1954, days 220­229) was identified as extreme by the DEA methodology and not extreme by the Drought index methodology. The reason is that the Drought index methodology considers only average values of relative humidity, precipitation and temperature from the K = 10-days period. These values are not extreme. On the other hand, one can observe very low humidity, no precipitation and high temperature in the 7-th day of the period. Taking into account these values one may expect a high probability of the wildfire risk and it makes sense to consider this day as extreme. The DEA methodology uses all parameters of all days of the K = 10 days period and therefore is able to identify such an extreme event. T a b l e 1. Numbers of extreme events. T a b u k a 1. Pocty extrémnych udalostí. DEA extremes Drought index extremes DEA and Drought index extremes DEA or Drought index extremes 598 961 383 1176 assessing extreme events in agriculture, where one day extreme events are not so important. T a b l e 2. Example of differently identified extreme events. (H ­ humidity in %, P ­ precipitation totals in tenth of mm, T ­ temperature in °C.) T a b u k a 2. Príklad rozdielne posúdených extrémnych udalostí. (H ­ rel. vlhkos v %, P ­ úhrny zrázok v desatinách mm, T ­ teplota v °C.) Extreme Drought index Extreme DEA 1952, days: 222­231 1954, days: 220­229 Day H 1 62.3 2 58.7 3 53.7 4 56 5 51.7 6 49 7 60.7 8 64.7 9 85 10 64.3 Mean 60.61 P 0 0 0 0 0 0 0 35 2 16 5.3 T 22.2 21.7 25 26 26.8 27.8 25.5 22.1 16.9 21.1 23.51 H P T 58 1 21.6 74.3 0 22.3 74 53 16.4 67.3 0 17.3 65.7 0 17.5 53.3 0 22 38.3 0 26.4 53 136 24.3 82 66 14.6 69.3 0 16.9 63.52 25.6 19.93 Conclusions The period in the left part of Tab. 2 (year 1952, days 222­231) was assessed extreme by the Drought index methodology and not extreme by the DEA methodology. The last row of Tab. 2 indicates that average values of observed parameters are more extreme comparing to the period in the right part of the table (lower humidity and precipitation and higher temperature). Therefore, the period in the right part of Tab. 2 is not considered to be extreme. The average values of humidity, precipitation and temperature are more adequate e.g. when We have studied several methodologies of defining extreme events. The methodologies have been compared using the time series of daily precipitation, humidity and temperature at Hurbanovo from 1901 to 2006. First we have calculated yearly averages of daily precipitation totals. These averages appear to be stationary. Comparing them to the averages over the rainy days only, one can conclude that at the beginning and at the end of the last century the number of rainy days was lower and the rainfalls were heavier. Further we have studied the 261 occurrence of 1­3 days extreme rainfalls. The distribution of heavy rainfalls in the period from 1901 to 2006 is quite uniform. As a second step we have considered the drought index that combines characteristics based on daily temperature, relative air humidity and precipitation. The number of drought days seems to be increasing. Finally, we have used the DEA methodology for the definition of extreme events. To our knowledge this is the first application of the DEA method to the determination of extreme events of any kind rather than efficiency of performance. Unlike the index method, DEA method is not subject to subjective choices of the weights and the threshold but is more computationally involved. The DEA results largely confirm the ones obtained by the drought index: The number of extreme drought periods is increasing over years, their highest occurrence being in July and August. Continuation of this study may contribute to science of disasters (Bunde et al., 2002). Preliminary results indicate that DEA method is more adequate for identifying short-term extreme events, whereas the Drought index methodology should be preferred for long-term ones. Acknowledgments. This research was supported by the European Commission under the 6th framework programme project No. 037063 "GEO-BENE". Some parts of this paper are based on the results of projects VEGA No. 1/4033/07 and observed SHMI data; the authors thank SHMI for making these data series available. pi ­ standard deviation of precipitation totals appertaining t ti wh wp to fixed period i in every year, ­ standard deviation of temperature, ­ standard deviation of temperature appertaining to fixed period i in every year, ­ weight of the humidity factor, ­ weight of the precipitation totals factor, ­ weight of the temperature factor. wt

Journal

Journal of Hydrology and Hydromechanicsde Gruyter

Published: Dec 1, 2009

There are no references for this article.