**1. Introduction**

It is a well known fact that temperatures have been increasing during the last 50 years not only at global level but also at specific locations all over the world. In this paper we examine the statistical properties of nine time series corresponding to the snowpack percentages in watersheds in Colorado, US. Using annual data dating back to 1937, we are interested in the long memory feature of the data along with the time trend coefficients to check if the snowpack percentages have been declining in the last eighty years as a consequence of the effects of global warming. In addition, we test this hypothesis under the assumption that the underlying series display a long memory property, a feature that is very common in climatological data. As far as we know there are no previous works dealing with the statistical modeling of snow packs with time series data. Our results, using fractional integration, show no evidence of long memory, and the time trend coefficients of the snow packs are statistically significantly negative in the majority of the series examined, supporting thus the global warming hypothesis.

The standard approach to test for significant trends in time series is to consider a linear regression model of the following form:

$$
\mathbf{y}\_t = \mathbf{a} + \beta \mathbf{t} + \mathbf{x}\_t,\tag{1}
$$

where a significant slope coefficient for β implies the presence of a trend (positive or negative, depending on the sign of the coefficient). However, in order to get consistent estimates of the unknown coefficients in (1), this set-up implicitly assumes that the error term, xt must be well behaved and more specifically, it must be integrated of order 0 or I(0). This is a standard regularity condition and indicates that the infinite sum of its autocovariance values should be finite. This condition, however, is not always satisfied. If that sum is infinite, the series is said to be long memory, a feature widely observed in time series in many different disciplines including geophysical and climatological series, e.g., Beran [1], Percival et al. [2], Gil-Alana [3, 4], Ercan et al. [5], Graves et al. [6], etc.

In this article, this long memory feature is incorporated in our set-up by means of using a fractional integration model, which is described in the following section, and that is used to describe the error term x(t) in (1). Based on that, we test for the presence of significant time trends in the snowpack percentages at Colorado's watersheds.

The rest of the paper is structured as follows. Section 2 briefly describes the main idea of long memory or long range dependence processes and also presents the series under examination. Section 3 is devoted to the empirical results, while Section 4 concludes the manuscript.

## **2. Methodology and data: long memory**

Given a covariance stationary process {x(t), t = 0, �1, … } we say that it is short memory or integrated of order 0 (and denoted as x(t) � I(0)) if the infinite sum of its autocovariances, defined as γ(u) = Cov(x(t), x(t + u)) is finite, i.e.,

$$\sum\_{u = -\infty}^{\infty} \gamma(u) < \infty. \tag{2}$$

Within this category, we can include the white noise process but also the standard stationary AutoRegressive Moving Average (ARMA) type of models. This latter category allows for some type of dependence between the observations and is named "weak" (dependence) in the sense that the autocorrelations decay exponentially fast. However, many time series show higher degrees of dependence and belong to a category denominated as "long memory", characterized because the infinite sum of the autocovariances is infinite, i.e.,

$$\sum\_{\mu=-\infty}^{\infty} \gamma(\mu) = \infty. \tag{3}$$

This long memory feature has been observed in time series data referring to many different disciplines, including economics and finance [7–9], energy [10–13], tourism [14, 15], environmental issues [16] and climatology [3, 17–19] among many others.

A very simple model, very popular among econometricians, and satisfying the above property (3) is the fractionally integrated or I(d, d > 0) model, which is expressed as:

$$\left(\left(1-L\right)^{d}\mathbf{x}(t) = u(t), \quad t = 1, 2, \ldots,\tag{4}$$

where L is the lag-operator, ie., Lk x(t) = x(t-k), d can be any real positive value, and u(t) is I(0) or short memory as defined above. In this context, x(t) displays the property of long memory if d > 0. Using a Binomial expansion, the polynomial in the left-hand side in (4) can be expressed as:

*Time Trends and Persistence in the Snowpack Percentages by Watershed in Colorado DOI: http://dx.doi.org/10.5772/intechopen.95911*

$$(\mathbf{1} - L)^d = \mathbf{1} - dL + \frac{d\left(d - \mathbf{1}\right)}{2}L^2 - \frac{d\left(d - \mathbf{1}\right)\left(d - \mathbf{2}\right)}{6}L^3 + \dots \tag{5}$$

and x(t) in (4) can then be expressed as:

$$\mathbf{x}(t) = d\mathbf{x}(t-1) - \frac{d\left(d-1\right)}{2}\mathbf{x}(t-2) + \frac{d\left(d-1\right)\left(d-2\right)}{6}\mathbf{x}(t-3) + \dots + \mathbf{e}(t) \tag{6}$$

and higher the value of d is, the higher the level of association between the observations is. A wide range of possibilities can be examined depending on the value of d in the real range. Examples are.


This specification is clearly more general than the standard methods used in the literature and that are based only on integer degrees of differentiation, i.e., d = 0 for stationarity and d = 1 for unit root or nonstationarity.

The series examined refer to the snowpack percentages in seven watersheds in Colorado, US, (see, **Figure 1**) namely, Arkansas, Colorado, Gunnison, North Platte, Rio Grande, San Juan-Animas-Dolores-and-San Miguel, South Platte, Yampa &

White and Colorado Statewide, using annual data. The starting year changes from one watershed to another and all of them end in 2019. The source of the data is the USDA (United States Department of Agriculture), Natural Resources Conservation Service Colorado (https://www.nrcs.usda.gov). All values use medians calculated from the most recent normal period between 1981 and 2010 For each basin we use data for the months of March, April and May, which are the months that present complete datasets for the whole period examined.

**Table 1** displays the time series examined along with the acronym and the starting and ending years for each series. We observe that the long series are those referring to North Platte and Colorado Statewide (with data starting in 1937) and followed by South Platte and Colorado (starting in 1938), while the shortest one refers to San Juan, Animas, Dolores and San Miguel, starting in 1973.

The objective in this paper is twofold. First, to determine if long memory holds in the snowpack data examined, and second, to see if there is a decline in the time evolution of the data as a consequence of climate warming. Thus, in order to examine these two issues together, we consider the following model

$$y(t) = a + \beta t + \varkappa(t); \ (1 - L)^d \varkappa(t) = u(t),\tag{7}$$

where y(t) refers to the observed data, the snowpack percentages, α and β are unknown parameters referring respectively to a constant and a linear time trend, and the regressions errors x(t) are supposed to be I(d). Therefore, there are two main parameters in the above specification: β, related with the evolution over time of the series, and d, dealing with the short/memory feature of the data.

In the following section we display the estimated coefficients of d in Eq. (7) (along with their corresponding 95% confidence intervals) under three potential set-ups. The results presented in the second column in the table refer to the case where we impose α = β = 0 in (7); thus, no deterministic terms are included in the model. The results in column 3 refer to the model with an intercept, i.e. imposing β = 0 in (7). Finally, the last column reports the results where α and β are estimated from the data along with d.

We have marked the most appropriate case for each series in **Table 2** in bold. This selection has been made based on the significance of the estimated coefficients in the d-differenced process, noting that the two equations in (7) can be expressed in a single one as:

$$
\tilde{y}(t) = a\tilde{\mathbf{1}}(t) + \beta \tilde{t}(t) + u(t), \tag{8}
$$


#### **Table 1.**

*Time series under examination.*

$$
\tilde{\boldsymbol{y}}(t) = (\mathbf{1} - L)^d \boldsymbol{y}(t), \tag{9}
$$

$$
\tilde{\mathbf{1}}(t) = (\mathbf{1} - L)^d \mathbf{1},\tag{10}
$$

$$
\tilde{t}(t) = (\mathbf{1} - L)^d t,\tag{11}
$$



*The values in parenthesis refer to the 95% confidence bands; in bold, the selected model for each series according to the deterministic terms.*

#### **Table 2.**

*Estimates of the differencing parameter.*


#### **Table 3.**

*Estimated coefficients of the selected models.*

*Time Trends and Persistence in the Snowpack Percentages by Watershed in Colorado DOI: http://dx.doi.org/10.5772/intechopen.95911*

**Table 3** displays the estimated coefficients of the selected models in **Table 2**. If we first focus on the values of d, we observe that there is no evidence of long memory behavior in any single case, since the confidence intervals include the value 0 in all cases examined. There are 21 series where the I(0) hypothesis of short memory cannot be rejected and in the remaining 6, anti-persistence (d < 0) is detected. The series showing anti-persistence are Colorado (April), Rio Grange (March) and San Juan … and Yampa & White (in March and April). Antipersistence is not as frequent as persistence, though some examples can be found in the literature [20, 21]. Anti-persistent behavior exhibits prolonged damped oscillations with the spectral density function showing a zero value at the origin [22].

If we look now at the time trend coefficients the first noticeable feature is that all the significant coefficients are negative, supporting the hypothesis of a decline in the snowpack percentage in a number of cases. The highest coefficients refer to the cases of San Juan, Animas, Dolores and San Miguel in May (with an estimated time trend coefficient of 2.556) and in April (1.005). Other high significant negative coefficients are those of Yampa & White (in May, 0.663, and April, 0.441) and San Juan … in March (0.461). All these cases support the hypothesis of a decreasing trend in the snow packs in various Colorado's watersheds, which might be a consequence of the global warming climate hypothesis.

As a robustness method, we also employ alternative parametric and semiparametric methods of estimating the differencing parameter in the context of fractional integration, including among other Sowell's [23] maximum likelihood estimation method, the classical semiparametric Geweke and Porter-Hudak's [24] approach and the most recent developed method in Shimotsu [25] and the results support our conclusions in all cases, finding evidence of short memory and negative time trend coefficients in most of the series examined.
