Research Article Volume 7 Issue 3
University Francisco de Vitoria, Spain
Correspondence: Prof. Luis A Gil-Alana, Universidad Francisco de Vitoria, UFV, Madrid, Spain
Received: July 01, 2023 | Published: July 17, 2023
Citation: Luque RAC, Gil-Alana LA. Long memory of river streams in the canal of Panama watershed. Int J Hydro. 2023;7(3):130-135. DOI: 10.15406/ijh.2023.07.00348
Study region: It refers to the time series analysis of streamflow patterns in the six principal rivers of the Canal of Panama.
Study focus: Using recent time series modeling techniques in fractional or I(d) frameworks, it offer an insight into the hydrology of the Panama Canal watershed.
Objectives: The main objective is to determine if the river flows at the Canal of Panama watershed display long memory, and based on that, investigate if significant trends are present across time.
Novelty: The use of updated time series techniques in the analysis of long memory and time trends in river flow data.
New hydrological insights for the region: The results first indicate that long memory is present in all cases, with no significant trends whatsoever. While some series display evidence of seasonality, they are all mean-reverting. The implication of the results obtained is that climatological or hydrological shocks affecting these river streamflows may take a longer time to disappear.
Keywords: canal of Panama, river streams, long memory, persistence
JEL Classification: C22; C55; Y90.
MB, Mitochondrial diseases; mtDNA, mitochondrial DNA; MRI, magnetic resonance imaging; GTCS, generalized tonic-clonic seizure; HSV, herpes simplex virus; ENMG, electroneuromyography
In this article we investigate the long range dependence property in six river streams of the Canal of Panama watershed. This property, that produces high levels of dependence across data which are far away in time, has been proven to be present in many hydrological data since the seminal work by the famous hydrologist Harold E. Hurst. Hurst1 examined the Nile river flows for the time period 641–1946, and noticed that long periods of dryness were followed by long periods of yearly returning floods, supporting this long term persistence effect. He then examined 600 time series that covered 75 different geophysical phenomena such as rainfall, atmospheric pressure, temperature etc., observing a similar pattern in all them. Since then, the stochastic modelling of river flows has produced numerous methods to explore the long memory or long range dependence property of the data. This long memory property is important because ignoring it may produce incorrect estimates in the remaining parameters of the model and thus erroneous conclusions about its behaviour. In this article, we focus on the rivers around the Canal of Panama watershed to examine the existence of time trends in the context of strong dependence models.
The Panama Canal is a man-made construction that connects the Atlantic and Pacific oceans, being one of the two most important strategic artificial waterways in the world. The Panama Canal watershed is the most important area of the country for Panama given the multiple uses it provides such as the continuous inter-oceanic navigation and generation of electric energy. The Canal watershed's main sub-basins include the Gatun reservoir and the Alajuela reservoir into which six principal rivers flow. The reservoirs are mainly used to store water needed for consumption by the population and the operation of the Canal. Making them one of the primary sources of water for the country.
There are many research projects which have studied the watershed, mostly with regard to contamination, dynamics of precipitation, physical controls, water management, and many more have focused in the sector of hydrology. Some of these issues are included in the Quadrennial report on regional progress and challenges in relation to the 2030 Agenda for Sustainable Development in Latin America and the Caribbean.2 Some papers that study streamflows inside the Canal watershed utilize a time series method focused on deseasonalized anomalies and they use basic spectral techniques to study their relationship with El-Niño-Southern Oscillation (ENSO) and climate change in the region.3 On the other hand, the Canal of Panama Authority conducted a research based on sequential analysis technique (CUSUM), in which they analyzed the degree of deviation concerning the average of the streamflow series, observing positive tendencies, negative and stable periods from data taken inside the years 1934-2009, concluding no general tendency for the increase or decrease of the streamflows. Other studies reporting issues with respect to the Panama watershed include Harmon et al.,4–8 etc. Nevertheless, none of these studies takes into account the potential long memory feature of the river flows at the Canal, which may produce biased results in terms of these trends.
Focusing on this perspective, we find only scarce literature based on streamflow time series for the case of the Canal Watershed. There are various studies that investigate long-memory streamflow properties for the Taiţa River (Rumania), the River Nile (Africa), and most of Canada’s major hydrologic regions.9–11 Since streamflow is a water volume and velocity variable, there are various methods that deal with the variability of streamflow series exhibiting long-memory,12,11 clearly based on the early works of Hurst1 where many time series data in diverse fields of application, including hydrology, exhibited the property of long memory or long range dependence.
In this research work, we investigate the persistence or long memory feature within monthly streamflows from the six principal rivers of the Canal Watershed, by using fractional integration methods which address the degree of dependence in the data.13 This approach (fractional integration) is only one of the many models that satisfy the long memory feature but is very appropriate since throughout a single parameter (the differencing parameter, d) we can verify if the series of interest display such a property. The study utilizes monthly data provided by the Panama Canal Authority (ACP), between the periods of 1960 and 2020 to investigate the recent long memory dynamics and time trends present in the six main rivers of the Canal Watershed. The long-memory feature refers to a not negligible dependence between distant observations in time12 and is a feature that is present in most of the climatological and geophysical time series.14–17 Furthermore, the purpose of the paper is also to examine if time trends are present in the data using updated methods as proposed in Gil-Alana et al.18 and Awe and Gil-Alana19 among many others. Note that ignoring this long memory property can produce erroneous conclusions about the existence of time trends in the data.
The objectives of the paper are thus first to examine if long memory is present in the data, and based on this, investigate if the data display significant positive or negative trends across time. Our main results can be summarized as follows: evidence of long memory is found in the six rivers examined, however significant time trends were not found in any single case; high levels of strong dependence were found in the cases of Trinidad and Ciri Grande rivers while the lowest correspond to Boqueron and Pequeni. The rest of the paper is structured as follows: In Section 2, we present the materials and methods, including the area of study, the data and the methodology used in the paper. Section 3 displays the main results and the empirical analysis, while Section 4 renders the concluding remarks.
Study area
The Panama Canal Watershed has a surface of 3338 km² (Figure 1), which encompasses the two major sub-basins mentioned earlier, the Chagres, Pequeni, Boqueron, and Indio Este (not included in the study) rivers flow into the Alajuela reservoir with an area of 1026 km², while the rivers Trinidad, Gatun, and Ciri Grande makes their course for the Gatun lake compromising an area of 2312 km². Rio Chagres, Rio Pequeni, Rio Boqueron, Rio Gatun, and Rio Indio Este possess largely forested, steeply-sloping sub-watersheds on the north side of the Canal, while Rio Trinidad and Rio Ciri Grande are located at the south-west side of the watershed with mostly deforested areas.7 These river flows are affected especially by precipitation increases, the rainy season of Panama has a long duration, from May to November20,21 and the dry season typically occurs from December to April,22,23 precipitation is twice as high on the Atlantic coast side (Bahia de Panama) as on the Pacific coast (Mar Caribe) side with a difference of 4,000mm/year compared to 1,800mm/year.7
Figure 1 Canal of Panama Watershed area highlighted with a red line. While each river basin is highlighted with a specific color seen above. (Source: Hydrological Report, ACP, 2019)
Data
The data employed in this work were provided by the Panama Canal Authority through the department of Hydric Resources, which include monthly streamflows from six hydrometric stations located on the watershed principal rivers Rio Chagres, Pequeni, Boqueron, Gatun, Trinidad, and the Rio Ciri Grande. The data from January 1960 and onwards from the streams is measured in cubic meters per second (m³/s), which is acceptable for most practical purposes when calculating water flow. As the data are raw, there is a possible appearance of a trend and seasonal effect, which can be explained because the rivers volumetric flow rate is affected by several regional hydrological cycles that are determined by the climate conditions, such as water supply (precipitation) and energy availability (potential evapotranspiration), and regulated by corresponding underlying characterizations, such as vegetation, soil and geography.24 Figure 2 displays plots of the six time series in which we can observe a clear seasonal pattern, while we can also observe a stationary pattern for most of the streamflow series.
Figure 2 Time series plots for the streamflow data (m³/s). Vertical axis of the plat is the amount of water in cubic meters per second, while horizontal axis is the years in monthly time frequency. The horizontal axe refers to the years, while the vertical axe refers to cubic meters per second.
Methodology
Long memory is defined in the time domain in terms of decay rates of long-lag autocorrelations; in other words, it is a feature that indicates a high level of dependence between observations that are far apart in time. As mentioned before, we will use a long range dependence approach that is based on integrational processes or I(d) models, which have been widely used in the analysis of hydrological and climatological data.25–27,18,19 Long memory structure in a covariance stationary process may be characterized via an autocovariance sequence, or by its spectral density function. More in detail, we say that Yt is a stationary long memory process if there exists a real number -1 < β < 0 and a constant Cs > 0 such that
(1)
being the autocovariance sequence and the differences in time.281 Various methods are available to detect the existence of long-memory. Some of them are parametric, while others are semiparametric or even non-parametric. In this work we use a parametric approach based on fractional integration, estimating the fractional differencing parameter d, an approach widely used when modeling persistence in hydrological time series.
An I(0) process ut, t = 0, ± 1, … , is defined as a covariance stationary process with spectral density that is finite and positive at the zero frequency. Then a process, {xt , t = 0, ±1, … , } which is integrated of order d, and expressed as I(d) is described by the following equation:
(2)
where is an I(0) process; is the backshift- (or lag-) operator, while d is any real number playing a very important role in determining the level of association between and its lagged values. By allowing d to be any real value, it can also be fractional and cover the standard cases of nonstationarity unit roots I(1) and stationarity I(0) if the differencing parameters are equal to 1 and 0 respectively. The polynomial in Eq. (2) can be expanded in terms of a binomial representation, such that
(3)
and thus
(4)
implying that Eq. (1) can be represented as
. (5)
Remembering that d is an indicator of the level of dependence of the data, in the sense that the higher the value of d is, the higher the degree of the association will be between the observations. When 0 < d < 0.5, the spectral density function has a singularity at 0, displaying the property of “long memory”, because the process exhibits slowly decaying autocovariance as originally introduced by Granger and Joyeux.29 These processes have been widely employed in recent years in the analysis of hydrological data.30–33,12
We estimate the differencing parameter d in equation (2) using the Whittle function, which is basically an approximation to the likelihood function and use a frequency domain version of it as in Dahlhaus34 alongside a testing procedure developed by Robinson35 which is based on the Lagrange Multiplier (LM) principle. This method tests the null hypothesis,
. (6)
for any real value do, in the following set-up:
(7)
where zt is a set of non-stochastic terms and xt are the errors in the regression model which are supposed to be I(d). Robinson35 showed that the test statistic follows asymptotically a N(0,1) distribution, independently of the value of do and the choice of the deterministic terms. The specific functional form of this method can be found in Gil-Alana and Robinson.36
1Alternatively, long memory is defined in the frequency domain if its spectral density function goes to infinity at some frequencies in the interval [0, π).
We consider the model described by equation (7) with zt = (1, t)T to allow for a constant and a linear time trend, so the estimated model is the following one:
(8)
where refers to each time series we observe, β0 and β1 refer respectively to the constant and a linear time trend, and the regression errors, are integrated of order d or I(d) such that the short memory errors follow a seasonal (monthly) AR(1) model, with as a white noise process. Anticipating the results, the time trend is not required in any single case, the intercept being sufficient to describe the non-stochastic part. The estimates of d are all in the (0,1) interval, implying long range dependence (d > 0) and fractional integration with a mean reverting pattern.
Table 1 reports the estimated values of d in three scenarios: with no regressors, with an intercept, and with an intercept and a linear time trend. Bold numbers in the table indicate the most significant model according to these non-stochastic terms. It is observed that evidence of stationarity (d < 0.5) is found in the cases of the Boqueron, Pequeni, Chagres and Gatun rivers, while for the Ciri grande and Trinidad, the confidence bands include both stationary and nonstationary values. While the time trend is found to be statistically insignificantly different from zero in all cases, we can see that in all cases that there is evidence of mean reversion (d < 1). We observe similar patterns in all cases for the six principal rivers, where long memory is present, given that the estimated value of d is significantly positive, rejecting the I(0) null hypothesis of short memory (d = 0). Table 2 displays the coefficients for the selected estimated model in each river streamflow, showing that the seasonal component is higher for the cases of the Trinidad and Gatun, followed by the Ciri Grande. Nevertheless, the coefficients and test statistics within the Chagres river should be taken with caution given that the river streamflows are affected by the Gatun Dam, which can alter the low flow regime and reduce peak flows.37
Series |
No terms |
An intercept |
A linear time trend |
Chagres |
0.35 (0.26, 0.44) |
0.32 (0.24, 0.42) |
0.32 (0.24, 0.42) |
Pequeni |
0.21 (0.12, 0.31) |
0.18 (0.10, 0.27) |
0.18 (0.10, 0.27) |
Boqueron |
0.18 (0.09, 0.28) |
0.16 (0.08, 0.25) |
0.16 (0.08, 0.25) |
Gatun |
0.37 (0.28, 0.48) |
0.36 (0.27, 0.46) |
0.36 (0.26, 0.46) |
Trinidad |
0.52 (0.44, 0.63) |
0.51 (0.42, 0.62) |
0.51 (0.42, 0.62) |
Ciri grande |
0.48 (0.39, 0.58) |
0.46 (0.37, 0.56) |
0.46 (0.37, 0.56) |
Table 1 Estimated values of the differencing parameter d
The values in parenthesis are the 95% confidence band of the non-rejection values of d. In bold, the selected specification for each series according to the deterministic terms. Column 2 refers to the case with no deterministic terms. In Column 3 we report the results in a model including an intercept, while in Column 4, the model includes a constant and a linear time trend.
Series |
d (95% band) |
Intercept (tvalue) |
Seasonal AR coef. |
Chagres |
0.32 (0.24, 0.42) |
30.8487 (6.94) |
0.378 |
Pequeni |
0.18 (0.10, 0.27) |
13.5342 (14.15) |
0.347 |
Boqueron |
0.16 (0.08, 0.25) |
7.5011 (13.82) |
0.32 |
Gatun |
0.36 (0.27, 0.46) |
6.8055 (3.99) |
0.523 |
Trinidad |
0.51 (0.42, 0.62) |
5.8822 (2.41) |
0.524 |
Ciri grande |
0.46 (0.37, 0.56) |
8.6601 (3.06) |
0.515 |
Table 2 Estimated coefficients of the selected model for each series
The second column reports the estimated values of the differencing parameter (and its 95% confidence band) under the most appropriate specification; column 3 reports the intercept and the corresponding t-values, while column 4 displays the seasonal AR coefficient.
As mentioned before, streamflow is mainly affected by precipitation and evapotranspiration, creating an extensive effect on water flows of these rivers. The department of Hydric Resources and Water Quality division is responsible of monitoring and controlling the water flows in these regions. Furthermore, The ACP research on the study behavior of these streamflows in 2009 composed data from our six stations that included years from 1934-2009. Identifying certain tendencies in the stations, separating them between different periods in time (e.g., 1934-1956; 1957-1965) concluding that conducting a study between certain periods in time will lead to a positive, negative, and neutral tendencies inside those estimated periods. They applied statistical differences with the following simple formula:
(9)
being the average annual streamflow with respect to year t, representing the annual streamflow historic average, while refers to the difference in streamflow contribution in contrast to the annual streamflow historic average.
Table 3 is a descriptive table (taken and updated from ACP Water and Jungle report, pg.12, 2009), in which we can see an updated summary from tendencies and differences in annual streamflow contributions from the six river flow stations.
Station |
Average Anual streamflow, m3/s (1934- 2020) |
Period |
Average Anual Streamflow, m³/s |
Difference of contribution (%) |
Tendency |
Los Cañones (Rio Ciri Grande) |
9.58 |
1934-1956 |
10.65 |
+11.2 |
|
1957-1965 |
8.33 |
-13 |
Increasing |
||
1966-1981 |
10.35 |
+8.1 |
Decreasing |
||
1982-1997 |
8.32 |
-13.2 |
Increasing |
||
1998-2009 |
9.48 |
-1 |
Neutral |
||
2010-2020 |
9.17 |
-4.3 |
Decreasing |
||
El Chorro (Rio Trinidad) |
6.77 |
1934-1956 |
7.51 |
+10.9 |
Increasing |
|
|
1957-1965 |
5.64 |
-16.8 |
Decreasing |
1966-1981 |
7.23 |
+6.7 |
Increasing |
||
1982-1997 |
5.97 |
-11.9 |
Decreasing |
||
1998-2009 |
6.83 |
+0.9 |
Neutral |
||
2010-2020 |
6.6 |
-2.6 |
Neutral |
||
Ciento (Rio Gatún) |
6.96 |
1934-1956 |
7.18 |
+3.2 |
Neutral |
1957-1965 |
8.25 |
+18.4 |
Increasing |
||
1966-1981 |
7.04 |
+1.1 |
Neutral |
||
1982-1997 |
6.63 |
-4.8 |
Neutral |
||
1998-2009 |
6.45 |
-7.4 |
Decreasing |
||
2010-2020 |
6.39 |
-8.3 |
Decreasing |
||
Peluca (Rio Boqueron) |
7.64 |
1934-1956 |
8.05 |
5.5 |
Increasing |
1957-1965 |
7.24 |
-5.2 |
Decreasing |
||
1966-1981 |
7.35 |
-3.8 |
Neutral |
||
1982-1997 |
7.64 |
+0.1 |
Neutral |
||
1998-2009 |
7.74 |
+1.3 |
Neutral |
||
2010-2020 |
7.4 |
-3.2 |
Neutral |
||
Candelaria (Rio Pequeni)* |
13.55 |
1960-1973 |
14.16 |
+5 |
Increasing |
1974-1991 |
12.91 |
-5 |
Decreasing |
||
1992-2009 |
13.95 |
+3 |
Neutral |
||
2010-2020 |
13.17 |
-3 |
Neutral |
||
Chico (Rio Chagres) |
30.91 |
1934-1956 |
32.6 |
+5.5 |
Increasing |
1957-1965 |
26.43 |
-14.5 |
Decreasing |
||
1966-1981 |
29.68 |
-4 |
Neutral |
||
1982-1997 |
30.04 |
-2.8 |
Neutral |
||
1998-2009 |
33.58 |
+8.7 |
Increasing |
||
2010-2020 |
31.18 |
+0.9 |
Neutral |
Table 3 Descriptive statistics and summary of tendencies observed
From Table 3 we observe strong patterns inside certain specific periods, for Rio Ciri Grande and Trinidad we have an increasing, decreasing, increasing, and a neutral tendency for specified periods in time, which can be explained due to seasonality as the previous statistic model suggested. As long memory is present in all six river stations, there is strong persistence (long memory) that can affect the categorized tendencies. This is very important because the discharge of these rivers affects the water levels of the Gatun and Alajuela lakes, whose main use is for the transit of commercial ships (Gatun) and provision of drinking water for 55% of the population in Panama (Alajuela) (IDB, 2020). Gatun Lake receives its water from precipitation and streamflows of the Trinidad, Gatun, and Ciri Grande rivers, which suffer from the highest seasonal components and long memory, meaning that shocks in these three river flows may take a longer time to disappear even though their streamflows are mean-reverting. On the other hand, streamflows of the Chagres, Pequeni, and Boqueron rivers end up in the Alajuela lake, whose annual water contributions are lower than Gatun lake, which is explained logically because of their huge difference in surface area and the Madden Dam that impounds the Chagres river. Although the rivers on this side of the watershed tend to have lower water contributions, seasonality and long memory also affect them, with strong implications especially for the Chagres river which is dammed twice, resulting in the reservoir to Gatun Lake and Alajuela, forming one of the principal sources of regulation for navigation runoff for the operation of the Panama Canal.
Therefore, if a climatological shock like the phenomenon of El Niño-Southern Oscillation (ENSO)2 is present, it can have a major effect on the streamflow of the rivers with a higher degree of persistency, and a lower effect for those which possess short memory (d = 0). Even though all our river flows are mean-reverting (d < 1), the annual average streamflows with respect to the historic average annual streamflow from 2015 has been negative for most of these rivers (Appendix), which is no surprise as long memory is present (d > 0) in all of our studied rivers implying that shocks caused by climatological and hydrological factors may take longer to disappear completely.38–44
2The ENSO is a warming of the ocean surface, or above-average sea surface temperatures in the central and eastern Tropical Pacific Ocean. This affects global climate and can lead to intense storms in some places and droughts in others. Source: https://www.weather.gov/mhx/ensowhat
The fractional integrated techniques introduced here provide a method to detect the presence of long memory in time series. The model allows for both long-memory and short-memory non-seasonal and seasonal components such as persistence, time trends, and seasonality. In this article, the long-memory properties in streamflows at the Canal Watershed have been investigated in six river stations, providing streamflow data from the Chagres, Pequeni, Boqueron, Ciri Grande, Trinidad, and Gatun rivers. We observe that the distribution of streamflows for Trinidad, Gatun, and Ciri Grande offers the highest seasonality, nevertheless, there is no time trend found in any of the series. Dealing with the order of integration of the series, the results indicate that long memory or high persistence is found in all cases, with Gatun, Chagres, Trinidad, and Ciri Grande being the highest while short memory or I(0) cannot be found in any case, all series displaying mean-reverting patterns. Although no time trends were found in our study, there exist certain tendencies for specific periods in time. Thus, for example,
The instruments utilized in this study can provide explicit information to policymakers regarding changes in future hydrological patterns in these regions. Finally, other lines of research still based on long memory processes can be implemented in these series. Thus, for example, the possibility of non-linear structures or structural breaks are relevant issues that may be worthy of our attention since several authors have argued that fractional integration may be an artificial artifact generated by the presence of breaks which have not been taken into account. Non-linear deterministic structures like those based on Chebyshev’s polynomials in time proposed by Cuestas and Gil-Alana (2016) or Fourier transforms (Gil-Alana and Yaya, 2020) or even neural networks as in Yaya et al. (2021) can be implemented in these and other data.
The authors gratefully acknowledge the provision and support of data by the Panama Canal Administration.
The author declares there is no conflict of interest.
©2023 Luque, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.