Application of a Long Short-Term Memory neural network for forecasting meteorological drought in Al Hoceima Province, Morocco

Ahmed Zian; Karim EL Moutaouakil; Lahcen Malghi; Abderrahim Boulanouar; Ahmed El Bakori

doi:10.26491/mhwm/218885

Upcoming papers

Stats

CC BY-NC 4.0

Get citation

ORIGINAL PAPER

Application of a Long Short-Term Memory neural network for forecasting meteorological drought in Al Hoceima Province, Morocco

Ahmed Zian ¹

Karim EL Moutaouakil ²

Lahcen Malghi ²

Abderrahim Boulanouar ³

Ahmed El Bakori ⁴

More details

Hide details

Laboratory of Engineering Sciences and Applications, Department of Civil Engineering, Water and Environment, Energy and Renewable Energy, National School of Applied Sciences of Al Hoceima, BP 03, Ajdir Al-Hoceima, Abdelmalek Essaadi University, Morocco

Laboratory of Engineering Science, Mathematics Department, Polydisciplinary Faculty of Taza

Laboratory of Applied Sciences Modeling, Optimization and Structural Dynamics in Civil Engineering, National School of Applied Sciences of Al Hoceima, Abdelmalek Essaadi University, Morocco

Laboratory of Natural Resources and Sustainable Development (RN2D), Faculty of Sciences, Ibn Tofail University, Kenitra, Morocco

Corresponding author

Ahmed Zian

DOI: https://doi.org/10.26491/mhwm/218885

Article (PDF)

References (31)

KEYWORDS

meteorological drought

TOPICS

ABSTRACT

Drought assessment and forecasting are critical components of water resource management. Few studies have assessed and forecast meteorological drought in the Al Hoceima province. Given the region’s infrequent rainfall, where precipitation deficit is the main driver of water scarcity, the Standardized Precipitation Index (SPI) provides a sufficient basis for analysis. Analysis of the SPI index shows that there were more than 30 years of drought during the 42-year period (1975-2016), when the SPI index was below –1. The 1980s were the most severely affected by drought. However, the durations of extreme drought varied, with 10, 9, and 7 years recorded at all stations except for the Targuist station, which recorded only 4 years of drought. This result indicates an irregularity of drought in the region. A Long Short-Term Memory (LSTM) deep neural network was used in this study to forecast drought patterns based on long-term meteorological data from 1975 to 2016, using historical precipitation data recorded at five synoptic stations. The model provided short-term predictions, and there was a strong correlation between observed and expected data. The forecast results for the 100 months succeeding the analyzed time series data indicate that arid conditions will persist, similar to trends observed in the previous decade. These findings demonstrate that the LSTM model is a valuable tool for predicting meteorological drought, with particular utility for Mediterranean regions characterized by complex topography and small basins.

1. Introduction

Drought is a natural phenomenon characterized by an extended period of exceptionally low precipitation leading to a deficiency in water resources with serious consequences for agriculture, water supply, ecosystems, and public health (Wilhite, Glantz 1985; Haile et al. 2020). Nowhere is this challenge more acute than in the Mediterranean basin, where recurrent droughts significantly threaten water security and socio-economic stability (Tramblay et al. 2020). The Al Hoceima region in northern Morocco, with its complex mountainous topography and Mediterranean climate, is particularly vulnerable to precipitation variability and extended dry periods (Benassi 2008). Consequently, effective drought risk management here depends not only on robust monitoring but, crucially, on reliable forecasting to enable proactive adaptation.

For monitoring, the Standardized Precipitation Index (SPI) has emerged as a key metric due to its statistical robustness and reliance solely on precipitation data, making it widely applicable and recommended (WMO 2012). It effectively identifies and quantifies drought severity and duration. Although the SPI excels at diagnosing past and current conditions, it is inherently descriptive, not predictive. Translating monitoring into actionable forecasting requires complementary modeling approaches.

To this end, a wide array of models has been employed, from statistical time-series analyses to conceptual hydrological models (Fung et al. 2020). More recently, machine learning (ML) and deep learning models have shown exceptional promise in capturing complex, non-linear patterns in hydrological data (Kratzert et al. 2018). Among these, Long Short-Term Memory (LSTM) neural networks are especially suited for sequential data such as precipitation time series, due to their ability to learn long-term dependencies.

Despite this potential, a clear gap persists. The application of advanced deep learning frameworks, such as LSTM, for operational, medium-term meteorological drought forecasting remains underexplored in the data-scarce, topographically complex regions of North Africa, specifically in the Al Hoceima province. Most existing regional studies focus on retrospective drought assessment using traditional indices or models rather than on predictive modeling with state-of-the-art tools. Furthermore, the comparative performance of such data-driven models against simpler benchmarks in this specific climatic context is not well established.

Therefore, based on 42 years of precipitation data from five synoptic stations, this study aims to bridge this gap by: (1) conducting a comprehensive historical analysis of meteorological drought (1975-2016) in Al Hoceima using the SPI -12 index, and (2) developing and validating an LSTM-based model to forecast drought conditions. A specific objective is to evaluate the LSTM’s forecast skill against a simple linear dynamical model, thereby critically assessing the added value of a complex neural network approach for early warning in this vulnerable region.

2. Materials and methods

2.1. Geographical location

The Al Hoceima basin, located in the northern part of the Rif Mountains in Morocco (46°02’ – 51°08’N and 57°07’ – 64°06’E), covers a drainage area of 2315 km2 (Fig. 1). The river basin is divided into two major subbasins, Nekkour and Rhys, and three minor basins, El Ansar, Bni Boufrah, Feddal, and Ouringa. To collect meteorological data, five stations were strategically located to cover the whole region. Table 1 provides specific information on those stations, while Figure 1 shows the precise location of the study area and the hydrometric stations.

Table 1.

Rainfall stations in the Al Hoceima area.

Station	X	Y	River name	Period of data
Tamassint	626 550	495650	Ghiss	1975-2016
Tamallaht	645 050	488950	Nekkour	1975-2016
Beni Boufrah	598 200	506000	Bni Boufrah	1975-2016
Targhist	605 800	477500	Ghys	1975-2016
Al Hoceima	634 000	516 800	-	1975-2016

Fig. 1.

Study area: the Al Hoceima province in Morocco, with a detailed map showing topography, river network, and rainfall stations.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g001_min.jpg

2.2. General climate of the Al Hoceima region

The Mediterranean climatic regime in the Al Hoceima basin is characterized by periods of abundant, low-frequency rainfall throughout the fall and winter seasons. On the other hand, the dry season is characterized by low flows and insufficient precipitation for an extended period of the year. Three factors contribute to the distinct climate of the al Hoceima area (Agharroud et al. 2023): (1) the amount of surface solar radiation is influenced by its position, which is between 46° and 51° north latitude, (2) its remarkable topographic variety, ranging from sea level to 2036 meters, which blocks rainfall from Atlantic Ocean winter cyclonic depressions, and (3) because the region is located in northern Morocco, it receives a higher amount of precipitation from the northern flows, which have been relatively low in recent years.

This study used precipitation data from 1975 to 2016. Analysis indicates that annual rainfall in the Al Hoceima basin varies based on location. The average annual rainfall at the Beni Boufrah station, in the northwestern part of the region, is 243 mm, whereas the Targuist station, in the eastern part, has a higher average of 463 mm.

2.3. Analytical workflow

2.3.1. Methodological sequence

The overall methodology of this study follows a sequential analytical workflow, designed to progress from data preparation through model forecasting and validation. The key steps are summarized in Figure 2 and detailed in the subsequent subsections:

Fig. 2.

Analytical framework: key steps.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g002_min.jpg

Data Collection and Preparation: Historical monthly precipitation data (1975-2016) from five synoptic stations were compiled. The data underwent quality control, including the imputation of missing values and standardization.
Drought Index Calculation: The Standardized Precipitation Index for a 12-month timescale (SPI-12) was computed for each station to quantify historical meteorological drought.
Model Development and Training: A long short-term memory (LSTM) neural network was configured and trained on 80% of the SPI-12 time series to learn the temporal patterns of drought.
Model Testing and Validation: The trained LSTM model was evaluated on the withheld 20% of the data using statistical metrics (RMSE, Loss). Its architecture (e.g., number of neurons) was optimized during this phase.
Forecasting and Benchmarking: The validated model was used to generate SPI forecasts for 100 future months. To contextualize its performance, the LSTM’s forecasts were compared against those from a simple linear dynamical model.

2.3.2. Standardized precipitation index (SPI)

The Standardized Precipitation Index (SPI) quantifies precipitation anomalies over a user-defined accumulation period (McKee et al. 1993). For a given time scale (e.g., 12 months), the procedure involves fitting a long-term series of precipitation totals to a probability distribution and then transforming it to the standard normal distribution.

Step 1. Data Aggregation

For a chosen time scale k (where k = 12 months in this study), the precipitation series P_(i, j) for month I and year j is aggregated into a rolling sum:

(1)

Xi,j(k)=∑l=0k−1Pi−l,j*

where j^* adjusts for year cross-over. This creates a series X^(k) of k-month precipitation totals.

Step 2. Gamma Distribution Fitting

The distribution of precipitation totals X (where X>0) is well modeled by a two-parameter gamma distribution, with a probability density function (PDF):

(2)

g(x)=1βαΓ(α)xα−1e−x/β, for x>0

where: α>0 is the shape parameter, β>0 is the scale parameter, Γ(α) is the gamma function.

The parameters α and β are estimated for each calendar month and time scale k from the n-year historical record using the method of maximum likelihood (Thom 1966; Edwards McKee 1997):

(3)

α=14a1+1+4A3,β=X¯α

With:

(4)

A=ln(X¯)−1n∑i=1nln(Xi)

Where (X¯) is the mean precipitation total.

Step 3. Cumulative Probability and Zero Precipitation Adjustment

The cumulative probability G(x) for an observed precipitation total x is given by the incomplete gamma function:

(5)

G(x)=∫0xg(t)dt

Since the gamma distribution is undefined for x = 0 and precipitation datasets may contain zeros, the cumulative probability H(x) is adjusted as:

H(x) = q + (1 − q)G(x) (6)

where q is the probability of zero precipitation, estimated as m/n, with “m” being the number of zeros in the sample of size “n”. For the 12-month time scale used here, q was effectively zero.

Step 4. Transformation to a normal distribution (SPI)

Finally, the SPI value is obtained by converting the adjusted cumulative probability H(x) to the standard normal variate Z (with mean = 0 and standard deviation = 1), such that:

(7)

SPI=Z, where H(x)=12π ∫−∞Ze−t2/2dt

This transformation ensures that negative SPI values indicate drier than median conditions, and positive values indicate wetter conditions, with the magnitude representing the severity of the anomaly (see Table 2 for classification).

Table 2.

Drought classes for SPI index (McKee 1995)

Range SPI values	Drought category
More than 2	extremely wet
1.5 ~ 1.99	very wet
1.00 ~1.49	moderately wet
–0.99 ~ 0.99	near normal
–1.49 ~ –1.00	moderately dry
–1.99 ~ –1.50	severely dry
Less than 2.00	extremely dry

In this study, the SPI for the 12-month time scale (SPI-12) was computed for each of the five stations using the full 1975-2016 monthly precipitation record. The calculations were performed using the RDIT – Drought Indices Calculator software (AgriMetSoft), which implements the standard procedure as described.

2.3.3. LSTM model architecture and hyperparameter tuning

The LSTM networks are a type of recurrent neural network used in deep learning, particularly for time-series prediction in various fields (environment, medical, autonomous driving, economic, etc.) (Kratzert et al. 2018). These models are increasingly used in hydrology for their ability to model complex temporal sequences and are also applicable to complex basins such as the region studied here, which is characterized by mountain basins with a nival-rainfall regime. Thus, the core forecasting model employed in this study is an LSTM recurrent neural network. A single LSTM layer was used to capture the temporal dependencies within the SPI-12 time series. To determine the optimal network architecture and learning parameters, a systematic hyperparameter tuning process was conducted, following established best practices in machine learning for hydrological time series (Kratzert et al. 2018; Fang et al. 2022).

Hyperparameter search and selection rationale

The following key hyperparameters were optimized via a grid search, with model performance evaluated on a held-out validation set (20% of the training data) using root mean square error (RMSE) and Loss as the primary metric:

− Number of hidden neurons: Values of {50, 100, 150, 200, 250, 300} were tested. Smaller networks (50-200 neurons) showed higher and more volatile validation loss, indicating insufficient capacity to model the complexity of the multi-decadal drought signal. Networks with 250 and 300 neurons yielded significantly lower and more stable RMSE. The final choice of 300 neurons provided the best performance without signs of overfitting, as evidenced by the close convergence of training and validation loss curves (see Fig. 5).
− Learning rate and optimization: The Adam optimizer (Kingma, Ba 2014) was selected for its adaptive learning rate capabilities, which are well-suited for noisy, non-stationary time series like drought indices. An initial learning rate of 0.005 was chosen after experimentation; this value provided a balance between fast convergence and stability.
− Training duration: A maximum of 800 epochs was set, with an early stopping callback (Prechelt 1998) monitoring validation loss with a patience of 50 epochs. This procedure prevented overfitting and ensured efficient training.

Final Architecture and Training

The final LSTM model configuration is summarized in Table 3. The model was implemented using the Keras/TensorFlow framework. The input sequence length (look-back period) was set to 12 time-steps (one year) to align with the annual cycle inherent in the SPI-12 data. The model was trained on the standardized SPI-12 series from January 1975 to October 2012 (80% of the data), using mean squared error (MSE) as the loss function.

Table 3.

Final hyperparameter configuration for the LSTM model after the tuning process.

LSTM Propriety	Learning method	Max Epochs	Gradient Threshold	ILR	LRS	LRDP	LRDF	Verbose
Value	Adam	Min = 500 Max = 800	1	0.005	Piece wise	125	0.2	1

[i] Abbreviation: LRDP – learning rate drop period, LRDF – rate drop factor, LRS – learning rate schedule, ILR – initial learning rate.

To measure the performance of the built LSTM and the quality of its predictions for the test period, the RMSE and loss were evaluated. For the test data, let O_t be the observed values and P_t the predicted values;

Loss (cross entropy): cross entropy quantifies the difference between probability distributions. The considered loss is given by:

(8)

loss=−∑i∈TEST(Ot,ilog log(Pt,i)+(1−Ot,i)loglog((1−Pt,i))

Root Mean Square Error (RMSE): For a given period T, the RMSE is given by:

RMSE = ‖Q_t − P_t‖ (9)

To evaluate the model’s performance, the results of the LSTM model were compared with those from a linear model as described in section 2.3.4.

2.3.4. Linear model (benchmark)

To thoroughly evaluate the performance of the LSTM network for drought determination, we compared it against a simple linear dynamical model. This provided a baseline for comparison. We hypothesize that the SPI dynamics at each site can be described by a first-order linear differential equation, as defined in model “Estat” (10).

(10)

(Estat)dSPI(t)dt=astationSPI(t)+bstationSPI(0)=SPI0(user defined initial condition)

Here, “station” refers to one of the five stations under study (Boufrah, Targuist, Tamellaht, Tamassit, and Al Hoceima).

To estimate the parameters astation and bstation, we discretize equation 10 using the Euler-Cauchy method (Hopfield). This results in a discrete linear system that relates the SPI at time t and t+1, SPIt and SPIt+1 for a given time step δt. The optimal parameters are found by solving a quadratic optimization problem (El Ouissari et al. 2022), presented as equation E2:

(11)

(E2)Min∑i=1N(SPIt+1−(1+astation)SPIt−bstation)2astation,bstation∈IR

The optimal parameters astation and bstation are those that minimize the sum of squared errors for all data points from a given station. The SPIt values were taken from the training covering the years 1975-2008. We solved this optimization problem using a genetic algorithm (Ahourag et al. 2023) configured with the parameters listed in Table 4:

Table 4.

Configuration of the genetic algorithm.

Option	Value
Cross-over function	Cross-over single point
Cross-over fraction	0.8
Initial population range	Random
Max generations	200
Population size	50
Selection function	Selection tournament
Mutation function	Mutation uniform

2.3.5. Data preprocessing and missing value imputation

The historical monthly precipitation dataset (1975-2016) from the five stations was first subjected to quality control. A small percentage of monthly records (<2% of the total dataset) were missing.

To address this, missing values were imputed using the mean of the 10 nearest neighboring stations for the same month and year. This spatial imputation method was chosen over temporal interpolation (e.g., using preceding and following months) for two principal reasons aligned with the region’s climatology: (1) preservation of spatial coherence: in mountainous regions such as Al Hoceima, precipitation patterns are highly influenced by topography. A missing value at one station is more likely related to the spatial precipitation field recorded at neighboring stations in the same synoptic event than to the temporal sequence at a single point. Using neighboring stations helps maintain the spatial structure of the data, which is crucial for calculating a regionally consistent SPI and (2) minimization of temporal autocorrelation bias: simple temporal interpolation (e.g., linear or seasonal averaging) can artificially reduce the variance and alter the autocorrelation structure of the time series, which is detrimental for both SPI calculation (which relies on the distribution’s shape) and for training time-series models like LSTM that learn from temporal dependencies. The impact of this imputation on the final analysis is considered negligible for the following reasons:

− The proportion of missing data was very low (< 2%).
− Sensitivity tests were conducted by comparing key statistics (mean, standard deviation, skewness) of the original series with gaps against the imputed series. The differences were statistically insignificant (p > 0.05), confirming that the gamma distribution parameters for SPI calculation remained stable.
− The LSTM model’s training stability was verified by monitoring the loss function convergence; no instabilities or anomalies attributable to the imputed values were observed.

Therefore, the chosen imputation method is judged to be appropriate and introduces no substantial bias to the subsequent drought index calculation or forecasting model training.

3. Results

3.1. Analysis of drought conditions in the Al Hoceima region

Figure 3 illustrates the temporal progression of SPI-12 (standardized precipitation index calculated over 12 months) across the five synoptic sites between 1975 and 2016. Table 5 presents an overview of drought characteristics over 12 months, based on the drought severity classification in Table 2. Multiple drought periods occurred throughout the measurement period (1975-2016), with the SPI below –1 for more than 30 years at all stations. Two significant drought periods were 1978-1984 and 1990-2005. The timing of these major drought episodes aligns with known phases of large-scale atmospheric circulation. The early 1980s droughts coincide with a persistently positive phase of the North Atlantic Oscillation (NAO), which is associated with reduced winter precipitation across the western Mediterranean (Hurrell 1995; Trigo et al. 2002). Similarly, the prolonged dry period in the 1990s and early 2000s corresponds with a documented multi-decadal shift toward drier conditions in the Mediterranean basin, influenced by both NAO variability and broader warming trends (Hoerling et al. 2012). According to Table 5, among the three locations (Tamellaht, Bni Boufrah, and Tamassit), 1999 was the most severe drought year, while 1982 was the most severe drought year for the other stations (Al Hoceima and Targuist). There were 10-7 years recognized as periods of extreme drought at all stations except Targuist, which recorded just four extreme drought years. This shows the irregularity of drought in the region. In particular, the Bni Boufrah and Tamassit stations had the lowest SPI index results, exceeding –3. Within the study’s timeframe, Table 6 displays the correlation of drought years between locations, significant at the 0.01 level, which shows a strong connection between the eastern, western, and central regions of the northern basin. The stations of Al Hoceima, Bni Boufrah, and Tamassit exhibit correlation coefficients of 0.97, 0.85, and 0.82, respectively, indicating a strong relationship between these areas. The spatial heterogeneity in drought severity – with stations like Targuist experiencing fewer extreme droughts than Tamellaht – likely reflects the pronounced topographic complexity of the Rif Mountains. Orographic effects create strong precipitation gradients and rain shadows, leading to microclimates where proximity to moisture sources (the Mediterranean Sea) and elevation critically modulate local rainfall deficits (Knippertz et al. 2003).

Table 5.

Characteristics of droughts identified within the annual time scale.

Station name	Highlight severe drought			Drought years number
Station name	SPI	Year	Season	Moderate drought	Severe drought	Extreme drought	Total
Tamassit	–3.1	1982	Winter	14	12	07	33
Tamellaht	–2.979	1999	Winter	16	09	10	35
Bni Boufrah	–3.162	1999	Winter	13	11	07	31
Targhist	–2.448	1999	Spring	10	18	4	32
Al Hoceima	–3.066	1982	Winter	13	11	9	33

Table 6.

The Pearson correlation matrix between the five synoptic stations.

	Bni Boufrah	Targuist	Tamellaht	Tamassit	Al Hoceima
Bni Boufrah	1	0.727	0.407	0.853	0.971
Targuist	0.727	1	0.349	0.620	0.718
Tamellaht	0.407	0.349	1	0.294	0.400
Tamassint	0.853	0.620	0.294	1	0.825
Al Hoceima	0.971	0.718	0.400	0.825	1

Fig. 3.

SPI-12 time series (1975-2016) for the five synoptic stations.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g003_min.jpg

3.2. Testing phase of the LSTM model

The SPI-12 series for all synoptic stations was predicted using the LSTM model. Twenty percent of the historical data (the test set from November 2012 to December 2016) was used to evaluate the LSTM model’s capability to generalize to unseen data. Predictions from the LSTM are presented in Figure 4. The red curve represents the forecast data, and the blue curve represents the observed data. We note that these curves are remarkably close to each other.

Fig. 4.

Observed (blue) vs. LSTM-predicted (red) SPI-12 values during the test period.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g004_min.jpg

Note that Tamellaht and Al Hoceima stations have forecast SPI values below –1.5, indicating a forecasted severe drought season between January 2017 and March 2025. However, the other stations were forecast to experience moderate drought, with SPI values above –1 for the same prediction period. These forecasts have been very close to reality, considering the drought years that have occurred recently.

3.3. Training phase of the LSTM model

The dataset from January 1975 to October 2012 was used to train the LSTM model with 80% of historical data (455 months). The LSTM’s training behavior is presented in terms of RMSE and loss criteria (Fig. 5). Because of its recurrent nature, the model learned from the historical data within a few epochs, achieving good performance over 250 months.

Fig. 5.

LSTM model training performance: (a) RMSE and (b) loss convergence over epochs.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g005_min.jpg

3.4. LSTM model criteria validation

To validate our visual results, we measured the RMSE for the test data. It was found that the RMSE ranges from 0.86438 (Tamassint station) to 1.3091 (Targuist station), which is acceptable. The trained LSTM model may therefore be employed to forecast drought conditions, based on the estimated Standardized Precipitation Index (SPI), for the forthcoming months.

Several tests were conducted with varying numbers of hidden neurons (50, 100, 150, 200, 250, and 300) to find the optimal number of LSTM hidden layers. Values for loss and RMSE reveal that these metrics decrease with an increasing number of epochs for each neuron configuration, yet remain relatively high for 50, 100, 150, and 200 neurons. For 250 and 300 neurons, these performance measures become very small, indicating a significant improvement in LSTM quality. Lower RMSE values indicate higher forecast accuracy, which is crucial for reliable decisions in agriculture and water resource management. When comparing the observable-SPI vs. predicted-SPI and RMSE errors, the observable-SPI and predicted-SPI are initially far apart; however, as the number of neurons increases (up to 250), the curves become much closer (Fig. 6). Additionally, the RMSE was quite large initially but decreased significantly as the number of neurons increased (above 250).

Fig. 6.

Effect of hidden neuron count on LSTM performance: test RMSE (bars) and forecast-observation gap (line).

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g006_min.jpg

It is possible to enhance prediction quality by using LSTM with a large number of hidden neurons; however, to prevent overfitting, we use 300 neurons, which produces excellent predictions and was used for forecasting the coming years.

3.5. SPI -12 time series prediction

After obtaining satisfactory predictions using the test series, we validated the model on the entire data series. Figure 7 shows the error curve between the observed and predicted data. Acceptable errors are observed for all data from the five stations. The mean error ranges from 0.849 at the Tamassint station to 1.073 at the Tamellaht station, with a low standard deviation of 0.006 (El Hoceima) and not exceeding 0.802 (Tamellaht) (Table 7). The high correlation suggests that the model effectively captures the underlying patterns in the data.

Table 7.

Summary table of the statistical parameters of the error between the observed and the simulated data.

Stations	Mean	Standard deviation
Bni Boufrah	0.994	0.762
Targuist	1.056	0.797
Tamellaht	1.073	0.802
Tamassint	0.849	0.649
Al Hoceima	0.909	0.006

Fig. 7.

Prediction error (observed – predicted) for each station across the entire dataset.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g007_min.jpg

The LSTM’s strong performance in capturing both inter-annual variability and decadal trends suggests that the primary drivers of meteorological drought in Al Hoceima are embedded within the temporal structure of the precipitation series itself. This implies that while external climatic forcings (e.g., NAO) set the background state, the sequence and memory of precipitation deficits, which the LSTM excels at modeling, are key to successful short-to-medium-term forecasting in this region.

The model’s projection of continued negative SPI values indicates a high probability that current atmospheric patterns influencing precipitation in Al Hoceima will persist in the near term. This persistence forecast is consistent with the LSTM identifying a strong autocorrelation structure in the SPI-12 series, a common feature in Mediterranean climates where drought conditions tend to exhibit multi-year memory due to land-atmosphere feedback and slowly varying sea surface temperature anomalies (SSTs) in the Atlantic and Mediterranean basins (García-Herrera et al. 2019).

3.6. Application of linear model

The estimated parameters for the “Estate 2” model (equation 11) for each station, along with the corresponding training and test data, are presented in Table 8.

Table 8.

Linear model parameters applied to the SPI training and test data .

Station	a_station	b_station	Train error	Test error
Boufrah	–0.8404	–0.0619	18.92	9.41
Targuist	–0.8476	0.03042	18.67	9.92
Tamellaht	–0.8505	–0.0255	19.29	9.18
Tamassit	–0.8609	–0.0408	18.93	9.35
Al Hoceima	–0.8792	–0.0556	19.08	9.40

Table 9.

Summary table of the critical months according to the 2024-2025 predicted years data for all stations (SPI values less than –0.5)

Station	Critical months	SPI value
Bni Boufrah	December 2024	–0.81
Bni Boufrah	April 2025	–0.94
Targuist	December 2024	–0.95
Targuist	April 20025	–1.64
Tamellaht	January 2025	–1.94
Tamellaht	June 2025	–0.64
Tamassint	-	<–0.5
Al Hoceima	July 2025	–0.5

Using the values from Table 8, we can write the specific model for each station:

(EBoufrah)dSPI(t)dt=−0,8408SPI(t)−0,0619SPI(0)=−0,702(ETarguist)dSPI(t)dt=−0,8476*SPI(t)+0,03042SPI(0)=−0,17(ETamellaht)dSPI(t)dt=−0,8505*SPI(t)+0,0255SPI(0)=−0,613(ETamassit)dSPI(t)dt=−0,8609*SPI(t)−0,0408SPI(0)=−0,596

A numerical comparison shows that the parameters for all stations are negative and range between –1 and 1. However, linear dynamic models are highly sensitive to their parameters; even small differences can yield significantly different solutions. Furthermore, the models exhibit similarly high errors in both the training and testing phases (Table 8). It is important to note that these error values are substantially higher than those of the LSTM model (c.f. Section 3.4). Consequently, we do not recommend using this linear model for forecasting SPI in this study region. The poor performance is visually confirmed by Figure 8, which shows a clear and significant discrepancy between the observed data (blue line) and the predictions from the linear model (red line).

Fig. 8.

Linear model performance: observed SPI (blue) vs. model simulations (red) for training (left) and testing (right) periods.

https://www.mhwm.pl/f/fulltexts/218885/MHWM-13-0008-g008_min.jpg

4. Discussion

The strong predictive skill of the LSTM model, evidenced by low RMSE and high correlation, underscores its suitability for drought forecasting in the climatically complex Al Hoceima region. This performance can be attributed to the model’s capacity to capture the non-linear temporal dependencies and long-term memory inherent in Mediterranean precipitation series. These features are influenced by persistent atmospheric patterns, such as phases of the North Atlantic Oscillation, and are exacerbated by the topographic variability of the Rif Mountains. The model’s projection of continued drought conditions is mechanistically consistent with this setting; the forecast of persistent negative SPI values reflects a strong autocorrelation structure in the SPI-12 series, a hallmark of Mediterranean climates where drought exhibits multi-year memory due to land-atmosphere feedback and slowly varying sea surface temperature anomalies (García-Herrera et al. 2019). The stark failure of the simpler linear model, in contrast, highlights that drought dynamics here are inherently non-linear and state-dependent, justifying the use of sophisticated data-driven approaches.

The practical utility of this forecast is twofold, providing both strategic and operational guidance for water resource management. At a strategic level, the projection of persistent aridity over the next 100 months provides critical evidence for shifting from reactive crisis response to proactive resilience building. This long-term insight supports the case for investing in water-efficient infrastructure, diversifying water sources, and adapting agricultural policies. Operationally, the model’s spatially and temporally specific forecasts enable targeted interventions. For instance, predictions of moderate drought during critical sowing periods in agricultural zones like Bni Boufrah allow for proactive measures such as switching to drought-resistant cultivars or optimizing irrigation schedules. Similarly, forecasting a severely dry January, a typically wet month, for the urban center of Al Hoceima offers water authorities crucial lead time to implement conservation measures and manage reservoir storage.

While the LSTM model provides a powerful forecasting tool, its current implementation, based solely on precipitation via the SPI-12 index, presents a limitation, particularly under a warming climate where increased evapotranspiration can intensify drought. Future research should therefore integrate temperature and radiation data using indices like the standardized precipitation-evapotranspiration index (SPEI) to better capture thermodynamic drivers. Furthermore, developing hybrid models that couple the pattern-recognition strength of LSTM with foundational physical principles (e.g., water balance constraints) could enhance extrapolation reliability and process interpretability. Establishing a framework for real-time data assimilation will be the essential final step toward operationalizing this forecasting approach for end-users in the future of agriculture and water governance.

5. Conclusion

This study presents a comprehensive analysis and forecast of meteorological drought in the Al Hoceima region of northern Morocco. Historical analysis of the SPI-12 index (1975-2016) revealed a severe and persistent drought risk, with more than 71% of years experiencing water deficits and two major drought periods identified (1978-1984 and 1990-2005). To forecast future conditions, an LSTM neural network was successfully applied. The model demonstrated high predictive accuracy, outperforming a simple linear benchmark, and projected continuing arid conditions over the next 100 months. This projection of persistent drying is consistent with recent hydrological analyses of the Nekor basin, a major sub -basin within the greater Al Hoceima study area, which has also documented a clear trend toward increasing drought severity and water resource vulnerability (Machrafi et al. 2022).

The primary methodological contribution of this work is the successful application of a deep learning framework for drought forecasting in a data-scarce, topographically complex region where traditional hydrological models are often difficult to calibrate. A notable limitation, however, is the reliance on a single meteorological variable (precipitation) via the SPI. Future climate warming necessitates the incorporation of temperature effects to fully capture drought intensity.

Consequently, future research should prioritize the integration of temperature data using more comprehensive indices, such as the Standardized Precipitation-Evapotranspiration Index (SPEI). Further validation with post-2016 data and the development of spatio-temporal forecasting frameworks will be crucial steps toward implementing a robust early warning system to enhance climate resilience for water resources and agriculture in the Al Hoceima province and similar vulnerable regions.

REFERENCES (31)

Agharroud K., Puddu M., Ivčević A., Satta A., Kolker A.S., Snoussi M., 2023, Climate risk assessment of the Tangier-Tetouan-Al Hoceima coastal region (Morocco), Frontiers in Marine Science, 10, DOI: 10.3389/fmars.2023.1176350.

CrossRef

Google Scholar

Ahourag A., El Moutaouakil K., Cheggour M., Chellak S., Baizri H., 2023, Multiobjective optimization to optimal moroccan diet using genetic algorithm, International Journal for Engineering Modelling, 36 (1), 67-79, DOI: 10.31534/engmod.2023.1.ri.05a.

CrossRef

Google Scholar

Benassi M., 2008, Drought and climate change in Morocco. Analysis of precipitation field and water supply, Options Méditerranéennes, 80, 83-87.

Google Scholar

Dai A., 2021, Hydroclimatic trends during 1950-2018 over global land, Climate Dynamics, 56 (11), 4027-4049, DOI: 10.1007/s00382-021-05684-1.

CrossRef

Google Scholar

Edwards D.C., McKee T.B., 1997, Characteristics of 20th century drought in the United States at multiple time scales, Climatology Report No. 97-2, Paper No. 634, Colorado State University, 155 pp.

Google Scholar

El Ouissari A., El Moutaouakil K., Baizri H., Chellak S., 2022, Intelligent local search for an optimal control of diabetic population dynamics, Mathematical Models and Computer Simulations, 14 (6), 1051-1071, DOI: 10.1134/S2070048222060047.

CrossRef

Google Scholar

Fang K., Kifer D., Lawson K., Feng D., Shen C., 2022, The data synergy effects of time-series deep learning models in hydrology, Water Resources Research, 58 (4), DOI: 10.1029/2021WR029583.

CrossRef

Google Scholar

Fung K.F., Huang Y.F., Koo C.H., Soh Y.W., 2020, Drought forecasting: A review of modelling approaches 2007-2017, Journal of Water and Climate Change, 11 (3), 771-799, DOI: 10.2166/wcc.2019.236.

CrossRef

Google Scholar

García-Herrera R., Garrido-Perez J.M., Barriopedro D., Ordóñez C., Vicente-Serrano S.M., Nieto R., Gimeno L., Sorí R., Yiou P., 2019, The European 2016/17 drought, Journal of Climate, 32 (11), 3169-3187, DOI: 10.1175/JCLI-D-18-0331.1.

CrossRef

Google Scholar

10.

Haile G.G., Tang Q., Li W., Liu X., Zhang X., 2020, Drought: Progress in broadening its understanding, WIREs Water, 7 (2), DOI: 10.1002/wat2.1407.

CrossRef

Google Scholar

11.

Hoerling M., Eischeid J., Perlwitz J., Quan X., Zhang T., Pegion P., 2012, On the increased frequency of Mediterranean drought, Journal of Climate, 25 (6), 2146-2161, DOI: 10.1175/JCLI-D-11-00296.1.

CrossRef

Google Scholar

12.

Hurrell J.W., 1995, Decadal trends in the North Atlantic Oscillation: Regional temperatures and precipitation, Science, 269 (5224), 676-679, DOI: 10.1126/science.269.5224.676.

CrossRef

Google Scholar

13.

Kingma D.P., Ba J., 2014, Adam: A method for stochastic optimization, available online at https://arxiv.org/abs/1412.6980 (data access 06.03.2026).

WWW

Google Scholar

14.

Knippertz P., Christoph M., Speth P., 2003, Long-term precipitation variability in Morocco and the link to the large-scale circulation in recent and future climates, Meteorology and Atmospheric Physics, 83 (1), 67-88, DOI: 10.1007/s00703-002-0561-y.

CrossRef

Google Scholar

15.

Kratzert F., Klotz D., Brenner C., Schulz K., Herrnegger M., 2018, Rainfall-runoff modelling using long short-term memory (LSTM) networks, Hydrology and Earth System Sciences, 22 (11), 6005-6022, DOI: 10.5194/hess-22-6005-2018.

CrossRef

Google Scholar

16.

Liu X., Ren L., Yuan F., Yang B., 2009, Meteorological drought forecasting using Markov Chain model, [in:] 2009 International Conference on Environmental Science and Information Application Technology, 2, 23-26, DOI: 10.1109/ESIAT.2009.19.

CrossRef

Google Scholar

17.

Machrafi O., Sguigaa A., Attou A., Sabir M., Naimi M., Chikhaoui M., 2022, Analysis of the water management system in a mountain territory, the case of the Nekor Watershed, Rif, Morocco, Open Journal of Modern Hydrology, 12 (4), 125-154, DOI: 10.4236/ojmh.2022.124008.

CrossRef

Google Scholar

18.

McKee T.B., 1995, Drought monitoring with multiple time scales, [in:] Proceedings of 9th Conference on Applied Climatology, Dallas, TX, American Meteorological Society, 233-236.

Google Scholar

19.

McKee T.B., Doesken N.J, Kleist J., 1993, The relationship of drought frequency and duration to time scales, [in:] Proceedings of the 8th Conference on Applied Climatology, 179-183.

Google Scholar

20.

Muthuvel D., Sivakumar B., Mahesha A., 2023, Future global concurrent droughts and their effects on maize yield, Science of the Total Environment, 855, DOI: 10.1016/j.scitotenv.2022.158860.

CrossRef

Google Scholar

21.

Naumann G., Alfieri L., Wyser K., Mentaschi L., Betts R.A., Carrao H., Spinoni J., Vogt J., Feyen L., 2018, Global changes in drought conditions under different levels of warming, Geophysical Research Letters, 45 (7), 3285-3296, DOI: 10.1002/2017GL076521.

CrossRef

Google Scholar

22.

Pascanu R., Gulcehre C., Cho K., Bengio Y., 2013, How to construct deep recurrent neural networks, DOI: 10.48550/arXiv.1312.6026.

CrossRef

Google Scholar

23.

Prechelt L., 1998, Automatic early stopping using cross validation: quantifying the criteria, Neural Networks, 11 (4), 761-767, DOI: 10.1016/S0893-6080(98)00010-0.

CrossRef

Google Scholar

24.

Spinoni J., Barbosa P., De Jager A., McCormick N., Naumann G., Vogt J.V., Magni D., Masante D., Mazzeschi M., 2019, A new global database of meteorological drought events from 1951 to 2016, Journal of Hydrology: Regional Studies, 22, DOI: 10.1016/j.ejrh.2019.100593.

CrossRef

Google Scholar

25.

Thom H.C.S., 1958, A note on the gamma distribution, Monthly Weather Review, 86, 117-122, DOI: 10.1175/1520-0493(1958)086<0117:ANOTGD>2.0.CO;2.

CrossRef

Google Scholar

26.

Thom H.C.S., 1966, Some Methods of Climatological Analysis, Technical Note No. 71, World Meteorological Organization, 53 pp.

Google Scholar

27.

Tramblay Y., Koutroulis A., Samaniego L., Vicente-Serrano S.M., Volaire F., Boone A., Le Page M., Llasat M.C., Albergel C., Burak S., Cailleret M., Kalin K.C., Davi H., Dupuy J.-L., Greve P., Grillakis M., Hanich L., Jarlan L., Martin-StPaul N., Martinez-Vilalta J., Mouillot F., Pulido-Velazquez D., Quintana-Vilalta J., Renard D., Turco M., Tukres M., Trigo R., Vidal J.-P., Vilagrosa A., Zribi M., Polcher J., 2020, Challenges for drought assessment in the Mediterranean region under future climate scenarios, Earth-Science Reviews, 210, DOI: 10.1016/j.earscirev.2020.103348.

CrossRef

Google Scholar

28.

Trigo R.M., Osborn T.J., Corte-Real J.M., 2002, The North Atlantic Oscillation influence on Europe: climate impacts and associated physical mechanisms, Climate Research, 20 (1), 9-17.

Google Scholar

29.

Weng P., Tian Y., Liu Y., Zheng Y., 2023, Time-series generative adversarial networks for flood forecasting, Journal of Hydrology, 622, DOI: 10.1016/j.jhydrol.2023.129702.

CrossRef

Google Scholar

30.

Wilhite D.A., Glantz M.H., 1985, Understanding: the drought phenomenon: The role of definitions, Water International, 10 (3), 111-120, DOI: 10.1080/02508068508686328.

CrossRef

Google Scholar

31.

WMO, 2012, Standardized Precipitation Index User Guide, WMO-No. 1090. World Meteorological Organization, Geneva, Switzerland.

Google Scholar

Submit your paper

Instructions for Authors