Journal of Natural Resources and Environmental Management 12. : 587Ae598. http://dx. org/10. 29244/jpsl. 587Ae598 E-ISSN: 2460-5824 http://journal. id/index. php/jpsl An application of deep learning technique to improve S2S Rainfall Forecast over Java Island. Indonesia Adyaksa Budi Raharjaab. Akhmad Faqihb. Amsari Mudzakir Setiawana a Center for Climate Change Information. Meteorological Climatological and Geophysical Agency (BMKG). Kemayoran. Central Jakarta, 10610. Indonesia b Department of Geophysics and Meteorology. Faculty of Mathematics and Natural Sciences. IPB University. IPB Dramaga Campus. Bogor, 16680. Indonesia Article Info: Received: 13 - 01 - 2022 Accepted: 12 - 08 - 2022 Keywords: Bias correction, convolutional autoencoders, rainfall forecasts, sub-seasonal to seasonal Corresponding Author: Akhmad Faqih Department of Geophysics and Meteorology. Faculty of Mathematics and Natural Sciences. IPB University. Email: akhmadfa@apps. Abstract. Subseasonal to seasonal (S2S) rainfall forecast can benefit several sectors, such as water resources, hazard management, and agriculture. However, the forecast remains challenging due to its lack of skill. This study applies Convolutional AutoEncoders (ConvAE), a deep learning technique, to improve the quality of the S2S rainfall forecast. Seven S2S model outputs incorporated with Subseasonal Experiments Projects (SubX), including CCSM4. CFSv2. FIMr1p1. GEFS. GEOS_v2p1. GEPS6, and NESM, are corrected using the ConvAE approach. We combine 407 ground observations and the CHIRPS dataset using regression kriging methods producing gridded daily precipitation data with 0,05A spatial resolution. utilize this dataset as a label to train ConvAE models and to perform bias corrections to all members of the SubX forecast data. The results show that ConvAE is able to increase the quality of weekly S2S rainfall forecasts over Java. Indonesia. The Correlation Coefficient for 1Ae4 weeks lead time are improved from: 0,76. 0,715. 0,692 and 0,722 towards 0,809. 0,751. 0,719 and 0,74. Furthermore, the average CRPSS improves between 20-30% for all lead times. How to cite (CSE Style 8th Editio. Raharja AB. Faqih A. Setiawan AM. An application of deep learning technique to improve S2S Rainfall Forecast over Java Island. Indonesia. JPSL 12. : 587Ae598. http://dx. org/10. 29244/jpsl. 587Ae598. INTRODUCTION Subseasonal to Seasonal forecasting (S2S) is a forecast that scales around two weeks to a season in advance, bridging weather and climate forecasting (Robertson and Vitart 2. S2S was recognized as a challenging time scale compared to other time scales and is often known as a Audesert of predictabilityAy due to its lack of skill (Vitart et al. However, its advantages to several sectors, such as public health, humanitarian, energy, water resource, hazard management, and agricultural field, lead to increasing demand for forecasts at this time scale (White et al. Vuillaume et al. Therefore, the operational, scientific, and application communities are becoming increasingly interested in developing (Robertson et al. and improving the quality of S2S forecasts (Kolachian and Saghafian 2019. Li et al. Wang et al. Raharja AB. Faqih A. Setiawan AM There are at least two approaches that can be used to improve the skills of S2S climate forecasting. The first approach is improving the quality of the dynamic climate model or the Earth system model itself, and the second one is postprocessing the climate modelAos output (Baker et al. Statistical postprocessing is aimed at correcting the biases and dispersion errors of the dynamic forecast model output. Its methods span from traditional bias correction to more sophisticated methods (Ratri et al. , including machine Recently, machine learning has received renewed attention in various fields over the last decade, owing to significant breakthroughs made with Deep Learning (DL) models. Because of its . layered structure. DL can hierarchically extract high-level feature illustrations. Convolutional Neural Network (CNN) has gotten much attention in spatiotemporal data sets because of their ability to learn spatial pattern from data (Bayo-Medina et al. Han et al. implemented CU-net, a CNN-based deep learning technique, in order to bias correct ECMWF 1-10 days forecasts. The results show an improvement in bias correction, coefficient correlation, root mean squared error and Mean Absolute Error (MAE). Le et al. corrected the grid-by-grid bias in satellite precipitation estimates using the Convolutional autoencoder (ConvAE), a CNN-based bias correction approach. The ConvAE model, according to Le et al. , performed admirably in capturing spatial patterns, trends, and extremes. proves the ability of the ConvAE model to bias correct satellite precipitation data. The same approach but with a different architecture was used to develop a Statistical Downscaling Model (SDM) for precipitation estimates over Northern Africa, demonstrating excellent reproducibility of the temporal rainfall projections (Babaousmail et al. To our knowledge. CNN has never been applied in correcting S2S forecasts in Indonesia. Nevertheless, a comparable method, one-dimensional CNN, has been used to predict the air temperature in Padang and produce satisfying results compared to Multilayer Perceptron (MLP) (Kurniawan et al. Despite error and dispersion problems, the dynamic forecast model output will always exhibit uncertainty problems (Muharsyah et al. One source of this uncertainty is the limited information regarding the initial conditions of the climate system (Monier et al. However, this problem can be addressed using an ensemble prediction system or Multimodel Ensemble (MME) approach (Krasnopolsky and Lin 2. This study aims to improve the quality of S2S rainfall prediction by applying ConvAE, a CNN-based bias correction technique, and employing MME in its corrected forecasts. We train historical S2S forecast data from The Subseasonal Experiment Projects (SubX) using gridded precipitation data that we produced using multiple linear regression residual kriging to develop a ConvAE trained model. Furthermore, we use the ConvAE trained model to bias correct the SubX S2S forecasts and acquire improved S2S forecasts. After that, we performed the evaluation using RMSE and Correlation Coefficient, also Continuous Ranked Probalistic Skill Score (CRPSS) for deterministic and probabilistic forecasts, respectively. The evaluation was performed for every single model and multimodel ensemble on a weekly time scale. METHODOLOGY Data Combined satellite and ground observation with spatial resolution 0,05A x 0,05A at daily time scales have been used in this study. Following Misnawati et al. , we combine 378 ground observations obtained from BMKG . ind blue dots in Figure . and CHIRPS satellite estimation products (Funk et al. using multiple linear regression and residual kriging, while 99 stations . ind black dots in Figure . have been used to evaluate this method. This approach has been exposed to perform well in correcting CHIRPS data over Java and introducing the best agreement to the ground observation data (Misnawati et al. This combined dataset was then used as a label in the deep learning technique to correct S2S forecasts in the next stage. Jurnal Pengelolaan Sumber Daya Alam dan Lingkungan 12. : 587Ae598 This study also utilized seven S2S model outputs incorporated with Subseasonal Experiment Projects (SubX) consisting of CCSM4. CFSv2. FIMr1p1. GEFS GEOS_v2p1. GEPS6, and NESM. The SubX model covers the global domain at 1A x 1A spatial resolution and the daily time scale with A18 years of hindcasts data and A18 months of real-time forecasts (Pegion et al. This dataset was made publicly available through the IRI Data Library and might be accessed via the following link: https://iridl. edu/SOURCES/. Models/. SubX/. More detailed information regarding the modelAos institution, ensemble member, update interval, lead time, and time-coverage of hindcasts and forecasts of the utilized SubX data (Kirtman et al. can be found in Table 1. Figure 1 The topography of Java Island and the distribution of ground observation Table 1 SubX model data was used in this study Hindcasts Institution Models RSMASCCSM4 NCEPCFSv2 ESRLFIMr1p1 EMC-GEFS EcGEPS6 GMAOGEOS_v2p1 NRL-NESM Lead EnsMember er Initinterval . Forecasts Time EnsMember er Init-update Start dates 1999 - 2016 Sunday 12/31/2017 Ae 12/29/2019 1999 - 2016 Everyday 1999 - 2016 Wednesday 1999 - 2016 Wednesday 1999 - 2016 Thursday 1999 - 2016 4-every 1999 - 2016 SaturdayTuesday 12/31/2017 Ae 12/31/2019 12/27/2017 Ae 12/25/2019 12/27/2017 Ae 12/25/2019 12/28/2017 Ae 12/26/2019 12/27/2017 Ae 12/27/2019 12/30/2017 Ae 12/31/2019 Note: *initialized four times a day, but we only used the data that initialized at 00:00Z. **updated four times a week. Therefore, we used all init-update Raharja AB. Faqih A. Setiawan AM From Table 1, we noticed that each model have different ensemble member, interval, and start dates in the hindcast and forecast period. Therefore, we used the model ensemble mean in the hindcast period and averaged it to all start dates available for the following training process described in Figure 2. Furthermore, we choose the latest issues closest to the 1-4 weeks target forecasts to handle the different forecast start All SubX data were then re-gridded to 0,05A applying bilinear interpolation to match the spatial resolution of the observation dataset. This re-gridded dataset was then fed to the deep-learning models in the next step. Ensemble Mean of SubX ModelAos Hindcast . Combined CHIRPSGround Observation Training . SubX ModelAos Forecasts . Trained ConvAE Corrected SubX Forecasts . Figure 2 ConvAE flowchart: . Model training, . Model application Convolutional Auto Encoders (ConvAE) This study applied the ConvAE architecture to correct the bias of SubX model output. ConvAE merges the architecture of CNN with autoencoder (AE), which consists of an encoder and decoder. Encoders employ an effective learning process in compressing and encoding data by reducing data dimensions and letting the noise pass through while the decoder reconstructs the encoded data as similar as possible to the input data (Le et al. ConvAE is commonly used in image noise reduction, hence it could also be applied in statistical downscaling or bias correction. Initially, we train the ensemble mean of hindcast data from seven SubX models using ConvAE In the training process, we split the hindcast data from 1999-2016 into proportion 2:1 as training and validation sets, respectively. The ConvAE training has been carried out with 100 epochs, a filter size of 3 x 3 kernels, and a batch size of 32. The reluactivation function was applied in all convolution layers, while the linear activation function has applied in the MaxPooling and Upsampling layers. The MSE loss function with a learning rate of 0,00001 and Adam function as an optimizer was used to train the model. Table 2 explains the detailed architecture of ConvAE used in this study. The training processes were resulting seven trained ConvAE models. These trained ConvAE models were then used to correct the raw SubX modelAos output. We have conducted the bias correction to all members of the SubX modelAos output . members in tota. at daily time scales for 2018-2019. The general process of the deep learning technique that has been applied is described in Figure 2. Jurnal Pengelolaan Sumber Daya Alam dan Lingkungan 12. : 587Ae598 Table 2 Detailed architecture of ConvAE used in this study Layer . Output Shape Parameter # Connected to input 1 (InputLaye. (None, 224, 224, . conv2d 37 (Conv 2D) (None, 224, 224, . input 1 . max pooling 2d 37 (None, 112, 112, . conv2d 37 . (MaxPooling2D) conv2d 38 (Conv2D) (None, 112, 112, . max pooling2d 17. max pooling 2d 38 (None, 56, 56, . conv2d 38 . (MaxPooling2D) conv2d 39 (Conv2D) (None, 56, 56, . max pooling2d 38 . max pooling 2d 39 (None, 28, 28, . conv2d 39 . (MaxPooling2D) conv2d 40 (Conv2D) (None, 28, 28, . max pooling2d 39 . max pooling 2d 40 (None, 14, 14, . conv2d 40 . (MaxPooling2D) conv2d 41 (Conv2D) (None, 14, 14, . max pooling2d 40 . conv2d 42 (Conv2D) (None, 14, 14, . conv2d 41 . up sampling 2d 42 (None, 28, 28, . conv2d 42 . (UpSampling2D) conv2d 43 (Conv2D) (None, 28, 28, . up sampling2d 42 . up sampling 2d 43 (None, 56, 56, . conv2d 43 . (UpSampling2D) conv2d 44 (Conv2D) (None, 56, 56, . up sampling2d 43 . up sampling 2d 44 (None, 112, 112, . conv2d 44 . (UpSampling2D) conv2d 45 (Conv2D) (None, 112, 112, . up sampling2d 44 . up sampling 2d 45 (None, 224, 224, . conv2d 45 . (UpSampling2D) conv2d 46 (Conv2D) (None, 224, 224, . up sampling2d 45 . conv2d 46 . add (Ad. (None, 224, 224, . input 1 . Note: Total parameters: 481. Trainable parameters: 481. Non-trainable parameters: 0. Performance Evaluation The performance evaluation has been conducted to the raw and ConvAE corrected for lead 1Ae4 weeks In this study, we evaluated the multimodel ensemble mean forecasts using coefficient correlation (CC) and RMSE. Furthermore, probabilistic forecasts were assessed using Continuous-Ranked Probabilistic Skill Score (CRPSS). These indicators were calculated grid-by-grid of raw and ConvAE corrected forecasts at a weekly time scale. Root Mean Square Error (RMSE) RMSE was calculated to assess deterministic forecast performances. RMSE measured the differences between the prediction of a model as an estimate of the observed values. The formulation of RMSE is defined as follows: ycIycAycIya = Ooycu Ocycuycn=1. cuycn Oe yceycn )2 Raharja AB. Faqih A. Setiawan AM Here ycuycn denotes the observed weekly precipitation records, and yceycn denotes the weekly precipitation The RMSE values displayed in Figure 3 were calculated over 2018-2019 and averaged over all Correlation Coefficients (CC) The correlation coefficient . between forecasts and observed weekly rainfall was also used for measuring model performance and defined as follows: yc = Ocycu ceycn Oeyce ) . cuycn Oeycu I 2 I)2 ocycu ycn . ceycn Oeyc. cuycuycn Oeycu where ycuI and yce I denotes the average of observed and forecasts weekly precipitation, respectively. Continuous-Ranked Probabilistic Skill Score (CRPSS) CRPSS measured skills of the probabilistic prediction relative to the forecast reference. CRPSS compares CRPS from the predicted rainfall . aw or correcte. against the climatological mean prediction CRPS can be formulated as follows (Ratri et al. CRPS = OOeO. Oe yaycu . ]2 where F. is the CDF of the predicted ensemble and Fo . is a Heaviside function that has a value of 0 if the forecasts value < observed value and a value of 1 if the forecasts value Ou observed value. CRPSS can be defined as follows: CRPSS = 1- CRPSfcst CRPSref where CRPSfcst is the ensemble forecast CRPS, while CRPSref is the reference . forecast CRPS. The CRPSS value spans from -O to 1. A positive result implies that the prediction outperforms the prediction reference. 1 is the perfect score of CRPSS. RESULT AND DISCUSSION Area Averaged Forecasts Figures 3 and 4 illustrate the weekly area-averaged rainfall forecasts of seven raw and ConvAE corrected the SubX modelAos output for lead time 1Ae4 weeks, respectively. The thick black and blue lines indicate observation and SubX MME, while other colors indicate the ensemble mean of seven single models incorporated in SubX. Moreover, the blue shades in these figures explain the spreads of all 67 SubXAos ensemble members. Figure 3 shows that all raw models in all lead times have better represented the actual observation at the dry season (JulAeOc. , indicated by all model lines that almost coincide with the observation. Nevertheless, almost all models have shown a large gap with the observation at the wet season (DecAeMa. Regarding lead time. Figure 3 shows a slightly decreasing predictive quality as the increase of lead time forecasts, especially during the wet season. Related to Figure 3. Figure 4 shows an improvement in prediction quality compared to the raw ones. The gap between SubX MME and observation is narrowing in all lead times and all models. Besides, the ensemble spread of the ConvAE corrected forecast was also smaller compared to the raw It theoretically indicates higher forecast accuracy since it has smaller uncertainty (Barker 1. Jurnal Pengelolaan Sumber Daya Alam dan Lingkungan 12. : 587Ae598 Figure 3 Weekly area-averaged rainfall forecasts of the raw SubX modelAos output over Java. Indonesia for lead time 1Ae4 weeks in 2018-2019 Figure 4 Weekly area-averaged ConvAE corrected rainfall forecasts over Java. Indonesia for lead time 1 Ae4 weeks in 2018Ae2019 Correlation Coefficient (CC) CC assesses the linear relationship between weekly observed and rainfall forecast for all grid points in the 2018Ae2019 periods. Figure 5 clarifies CC improvement of ConvAE corrected forecasts compared to SubX raw forecasts at weekly time scales, except for three weeks lead time. The CC for lead 1,2 and 4 weeks Raharja AB. Faqih A. Setiawan AM improved from: 0,787. 0,753. and 0,746 towards 0,824. 0,768. and 0,757. Spatially. CC for all lead times was ranged from 0,7-0,8 for most areas of Java Island, except the western part of Java (Banten Provinc. and the North-Eastern part of Central Java which has a value up to 0,6. Figure 5 The correlation coefficient between SubX MME weekly rainfall forecasts and observation of . raw and . ConvAE corrected for lead time 1Ae4 weeks Root Mean Square Error (RMSE) RMSE was calculated for raw, and ConvAE corrected SubX MME weekly rainfall forecasts over Java. Indonesia for 2018Ae2019. The results show that averaged RMSE values were slightly decreased for all leads in ConvAE corrected forecasts. Generally, the eastern part of Java has a lower RMSE value compared to other areas. It could be clearly shown in one week lead time, while the eastern part of Java mostly has RMSE values up to 25 in ConvAE corrected forecasts, except for the mountainous areas with RMSE values >50. The center and western part of Java commonly shows RMSE values >30 in one week lead time. Continuous-Rank Probability Skill Score (CRPSS) Figure 7 shows that ConvAE typically improves CRPSS values up to 0. 5 in the one week lead time. The area-averaged of CRPSS improves from 0,294. 0,205. 0,176 and 0,167 towards 0,384. 0,254. 0,211 and 0,218. for week 1Ae4 forecasts, respectively. Their improvement varies between 20-30%, with the greatest improvement occurring in one week lead time. The highest CRPSS score also happened in the one-week lead At the same time, the lowest appeared in the four-week lead time of raw SubX MME forecast. Related to the spatial pattern of RMSE values in Figure 6, the CRPSS of the eastern part of Java also introduced better values compared to the western part of Java. It could be because eastern JavaAos dry season is more steadily dry than the western part, and JavaAos western part experiences greater rainfall variability (Ratri et al. If we take a closer look at Figure 6. Figure 7 and compare them to Figure 1, we can conclude that higher RMSE values and lower CRPSS scores are associated with local topographic effects. Since the higher RMSE values and lower CRPSS values are persistent occurred in the mountainous area. The poor skill at mountainous areas in raw models might be due to coarse spatial resolution of GCM output that do not incorporate local topographic effect (Nover et al. The primary and most persistent circulation error in global NWP and GCM is an insufficient depiction of orographic influence on atmospheric flow (Sandu et al. The ConvAE could not address this problem since the RMSE values are still high and CRPSS values remain low in the mountainous area. It could be affected by the deep network in our ConvAE architecture that usually discards detailed image information, presumably because too much detail is already lost during convolution and pooling (Mao et al. Jurnal Pengelolaan Sumber Daya Alam dan Lingkungan 12. : 587Ae598 Figure 8 shows averaged RMSE, correlation coefficient, and CRPSS for all individual models compared to SubX MME. The upper row is all raw model forecasts, while the bottom row is ConvAE corrected In general, the area-averaged RMSE values of ConvAE corrected forecasts are decreased compared to raw ones, except for GEFS, which increased for 2Ae4 weeks lead times. Depending on models and lead times, the improvement of RMSE values varies up to 39%. Furthermore, the most remarkable RMSE improvements occurred in NESM for one week lead time, which improved from 47,2 to 28,8. further area-averaged correlation coefficient is increased between 0,006-0,201, which one week lead time of NESM also recording the notable increases. It also happened in CRPSS, while the CRPSS value of NESM in one week lead time was improved from -0,437 to 0,327. Another notable enhancement occurred in CCSM4 at one and four weeks lead time and CFSv2 at all lead times, where CRPSS improved from negative to positive after ConvAE bias correction. Figure 6 RMSE of . raw and . ConvAE corrected SubX MME weekly rainfall forecasts for lead time 1-4 Figure 7 Overall CRPSS of . raw and . ConvAE corrected SubX MME weekly rainfall forecasts for lead time 1Ae4 weeks Raharja AB. Faqih A. Setiawan AM Figure 8 Area averaged weekly rainfall forecasts skill (RMSE. Correlation Coefficient and CRPSS) of raw . and ConvAE corrected . SubX models for lead time 1Ae4 weeks CONCLUSIONS This study has implemented convolutional autoencoders (ConvAE), a deep learning technique, to improve S2S rainfall forecasts over Java. Indonesia. The results show that ConvAE has increased the quality of weekly S2S rainfall forecasts over Java. Indonesia, by improving the correlation coefficient and CRPSS and decreasing the RMSE value. Although we have demonstrated that the ConvAE could effectively improve the prediction skills of weekly forecasts in general, the ConvAE could not address poor skills in the mountainous area. It might result from the deep ConvAE network architecture that usually ignores detailed image information because too much detail is already lost during the convolution and pooling process. suggest employing a skip connection, e. U-NET architecture, in the following research to solve this issue. The feature maps transmitted via skip connections include a lot of information detail, enabling deconvolution to recover a better image version. ACKNOWLEDGEMENT We want to acknowledge the climate modeling groups coordinated in SubX Projects (NASA. NOAA/NCEP. University of Miami. NRL, and Environment Canad. for developing and making their model output public. Also, we want to thank Agency for Meteorology Climatology and Geophysics (BMKG) for providing the ground observation dataset used in this study. REFERENCES