Research Paper Journal of Engineering and Technological Sciences Rapid Flood Mapping Using Statistical Sampling Threshold Based on Sentinel-1 Imagery in the Barito Watershed. South Kalimantan Province. Indonesia Muhammad Priyatna1,2*. Muhammad Rokhis Khomarudin1. Sastra Kusuma Wijaya2. Fajar Yulianto1. Gatot Nugroho1. Pingkan Mayestika Afgatiani1,3. Anisa Rarasati1 & \\\\\\\\ Muhammad Arfin Hussein4 Research Center for Remote Sensing Technology. Research Organization for Aeronautics and Space. National Research and Innovation Agency. East Jakarta, 13710. Indonesia Physics Department. Faculty of Mathematics and Natural Sciences. University of Indonesia. Depok, 16424. Indonesia Faculty of Science. Graduate School of Science and Engineering. University of the Ryukyus. Okinawa. Japan Instrumentation Electronics Study Program. Indonesian Nuclear Technology Polytechnic. Yogyakarta. Indonesia *Corresponding author: muha054@brin. Abstract Flood disasters occur frequently in Indonesia and can cause property damage and even death. This research aimed to provide rapid flood mapping based on remote sensing data by using a cloud platform. In this study, the Google Earth Engine cloud platform was used to quickly detect major floods in the Barito watershed in South Kalimantan province. Indonesia. The data used in this study were Sentinel-1 images before and after the flood event, and surface reflectance of Sentinel-2 images available on the Google Earth Engine platform. Flooding is detected using the threshold method. In this study, we determined the threshold using the Otsu method and statistical sampling thresholds (SST). Four SST scenarios were used in this study, combining the mean and standard deviation of the difference backscatter of Sentinel-1 images. The results of this study showed that the second SST scenario could classify floods with the highest accuracy of 73. The inundation area determined by this method was 4,504. 33 km2. The first, third and fourth SST scenarios and the Otsu method could reduce the flood load with an overall accuracy of 48. 37%, 43. 79%, 55. 5% and 68. 63%, respectively. The SST scenario is considered to be a reasonably good method for rapid flood detection using Sentinel-1 satellite imagery. This rapid detection method can be applied to other areas to detect flooding. This information can be quickly produced to help stakeholders determine appropriate flood management strategies. Keywords: flood. Barito watershed. Sentinel-1. statistical sampling. Introduction Flooding can be defined as the process of water overflowing land areas and the tides associated with this overflow . Floods are often considered disasters when people are affected by their impacts . Flood disasters are not uncommon in Indonesia . and occur more frequently in the western regions of the country due to higher rainfall . Flood disasters can cause property damage and disruption of community activities . and often result in fatalities . In addition to the initial damage, floods can also cause numerous post-event problems. Disease outbreaks are an example of a secondary disaster that can be triggered by flooding . Flood disasters usually bring the economic life of the affected communities to a standstill, necessitating the provision of assistance to those affected. Copyright A2023 Published by IRCS - ITB ISSN: 2337-5779 Eng. Technol. Sci. Vol. No. 1, 2023, 98-107 DOI: 10. 5614/j. Rapid Flood Mapping Using Statistical Sampling DOI: 10. 5614/j. Information on the location of flood events is needed to facilitate the distribution of relief supplies and the management of flood disasters. Large floods, such as those in Indonesia's South Kalimantan province, have covered wide areas and damaged some settlement, forest, plantation, and agricultural areas. Therefore, resources must be available to effectively map floodplains. The problems associated with this mapping can be solved using remote sensing technologies, which have the advantage of covering larger areas than conventional methods and always providing up-to-date data, which is part of the role of remote sensing data and information in the disaster management cycle . Remote sensing data can support modeling of flood-prone areas . The frequency ratio is an example of a methodological approach to modeling the degree of flood hazard based on remote sensing data . In addition, remote sensing data have the potential to be used as input for flood Remote sensing technologies have been used to detect areas of water in the form of transient puddles that occur during floods. Images in the green band and NIR band are used to detect water bodies using the normalized difference water index (NDWI) . Since the green band can increase the reflectance of water bodies and the NIR band can decrease the reflectance, these two bands are used. On the west coast of West Lombok, the NDWI is used to identify water bodies. If the image used is cloud-free, optical imaging can be used to identify bodies of water. however, if there is heavy cloud cover, flooding is more likely. As a result, optical imagery loses clarity, and radar imagery may be more effective . The advantage of radar satellite imagery is that it is unaffected by cloud cover . and is therefore suitable for detecting water bodies . Although radar imagery can detect water bodies well, radar satellite imagery, which is acquired only once, cannot determine the difference between floods and pre-existing water bodies such as lakes. Therefore, in this study, we attempted to use multi-temporal Sentinel-1 radar satellite imagery to distinguish receiving waters from flooded areas. Several studies have used a specific threshold to detect flooding from radar imagery . The objective of this study was to determine the ability of certain statistical thresholds and the Otsu method to detect flooded areas in the Barito watershed in South Kalimantan province. Indonesia. The use of a threshold on Sentinel- 1 data enables rapid detection of floodplains that can be used for rapid disaster management. While some research has been conducted using SAR imagery, it used expensive data such as TerraSAR X . The advantage of using Sentinel-1 data is that it is available for free on a cloud platform. Materials and Methods Location The Barito watershed is located in the administrative area of South Kalimantan province (Figure . Geographically, the Barito watershed is located at 114o20' to 115o52' and 1o0' to -3o44'. The Barito catchment is one of the most important watersheds in Borneo. The provinces of South Kalimantan and Central Kalimantan are both located in the headwaters of this watershed. In January 2021, the Barito watershed experienced unusually heavy rains that caused flooding in several locations. The flood event in the Barito watershed, particularly in the province of South Kalimantan, was an exceptional flood event. It was the largest flood in the last 50 years. The flood also affected 5,000 inhabitants . Therefore, this place can be used as learning material for research related to flood disasters. Data Sentinel-1 multi-temporal imagery was employed in this study. Sentinel-1 image mosaic for July 2020 was used for the Aobefore floodAo imaging (Figure 2. , while the January 2021 post-flood imaging was based on the Sentinel1 picture mosaic (Figure 2. The Google Earth Engine (GEE) platform provided the information used. polarization was used to detect flood areas, making it simpler to detect flooding in regions where there is vegetation, according to a study conducted in Pakistan . Because of the effect of multiple reflections generated by water surfaces and vegetation. VV polarization is used in flood monitoring, and the majority of backscattering exhibits stable polarization . Sample data from flood locations obtained from the Indonesian National Disaster Management Agency (BNPB) and the PetaBencana. id portal were used in threshold calculations and accuracy checks (Figure . The sampling points were located across the Barito watershed from the upper to the lower reaches. There were 178 sample Muhammad Priyatna et al. points consisting of 65 non-flood sample points and 113 flood sample points. The limited number of sample points for flood locations, which were only 113 points, made the authors use 25 samples as training data for the statistical sample threshold and 88 other flood points as validation tests. A small number of samples intended to test if there are limited field data conditions as training due to flooding and whether rapid detection of flood inundation using this approach can still produce good results. Figure 1 Research location in the Barito watershed in South Kalimantan. Figure 2 Images used in this study. (Source: GEE). Rapid Flood Mapping Using Statistical Sampling DOI: 10. 5614/j. Figure 3 Samples of flood data. (Sources: BNPB and PetaBencana. Method The fastest approach for binary classification of a picture is the threshold method. Choice of the correct threshold is a key aspect in this procedure since it affects the classification outcomes . Several methods can be applied to determine the threshold. In this study, we will use statistical sample thresholds and the Otsu method to determine the detection threshold for flood inundation in the Barito watershed. South Kalimantan. In general, the flood inundation detection method in the Barito watershed. South Kalimantan Province, consists of . pre-processing, . processing, and . assessment, as shown in Figure 4. Figure 4 Rapid flood mapping process. Muhammad Priyatna et al. Pre-processing: The Sentinel-1 VV imagery was used to map flood inundation and GEE was used to process the data. The pre-processing stage was carried out to prepare the data to be ready to be used as input for determining the threshold and detection of flood inundation. This stage consisted of several steps, including area focusing. Sentinel-1 image speckle noise correction, determination of backscattering differences, detection of permanent water bodies from Sentinel-2 SR images, and preparation of flood samples. Deductions were made on the area of interest, specifically the Barito watershed boundary in South Kalimantan Province. Indonesia, to speed up processing time. To remove the effect of speckle noise, a 3 x 3 median filter was used, chosen because it produces a superior signalto-noise ratio . Detection of permanent water bodies was carried out from the 2020 mosaic Sentinel-2 images using the automatic water extraction index (AWEI). Permanent water bodies are pixels with a value greater 232 . This permanent water body is used as a mask so that the permanent water body can be removed from the Sentinel-1 image. This process is done to minimize errors in flood detection Furthermore, ordinary mathematical processes were used to calculate the difference between the mixed values. The difference in scattering values was calculated by subtracting the post-flood data from the data collected before the flood. Processing: Determination of the threshold for detecting flood inundation in this study was conducted using two methods. The first method was performed by using a statistical sample threshold. The second method used to obtain flood threshold values is the Otsu method. The Otsu approach uses a criterion measure to assess the variance between threshold classes before applying it to the image to create a flood map with wet and dry pixels . The Otsu technique is a method of classification that uses a global threshold to discriminate between groups based on data from a normal distribution . The technique divides the data into two groups based on the grey level . , the first class having low grey level value and the second having high grey level value. Calculation of variance determines the grey level. Flood maps have been made using this technology for a significant time. Eq. shows how to calculate the Otsu threshold . yu 2 = ycEycuyc . cAycuyc Oe ycA)2 ycEyc . cAyc Oe ycA)2 ycA = ycEycuyc ycAycuyc ycEyc ycAyc ycEycuyc ycEyc = 1 ycNEaycCycycyc = yaycyci max . u 2 ) ycuOycNEaycCycycyc Oyc In Eq. , yu is the variance between classes of water bodies and non-water bodies. ycAycuyc and ycAyc are the average pixels of non-water bodies and water bodies. ycEycuyc and ycEyc are the probabilities of pixels belonging to the nonwater-body or the water-body class. ycA is the average of the entire image. The other threshold method used in this research was the statistical sample threshold. The statistical sample threshold is a threshold determination method that is carried out by calculating the average value and standard deviation of the difference between Sentinel-1 backscattering images before and after flooding. The goal of this strategy is to simplify classification by combining threshold- and sample-based guided classification with limited A mixture of statistical values is used to establish the threshold value. The average value and standard deviation were obtained from the sample points where the flood occurred. This study used four SST calculation scenarios, as indicated by Eqs. Threshold ycNEa is calculated using the following simple mathematical ycNEa1 = ycuI Ae yua ycNEa2 = ycuI yua ycNEa3 = ycuI Ae 1. ycNEa4 = ycuI 1. 5 yua where ycuI is the sample mean and yua is the sample standard deviation. Rapid Flood Mapping Using Statistical Sampling DOI: 10. 5614/j. Results and Discussion Flood Mapping Flood Detection The distribution of pixels representing wet and dry areas are visible in the threshold results. Flooded locations are those that are flooded with water. the southern half of the Barito watershed is where flood inundation is most noticeable (Figure . In the south, the downstream portion of the Barito watershed has a rather flat Cities with dense residential populations can be found in this area and as a result, if flooding occurs, its impact will be large. Each threshold has the potential to detect floods, as can be seen from the maps. However, statistical thresholds 2 and 4 detected significantly more flooding than the other thresholds, as demonstrated by the dominant blue areas. It is possible to determine the flood area based on these data. The Otsu detected floods with an area of 1,096. 28 km2 utilizing flood detection. Flooding can also be detected using the proposed sample threshold statistical method. Using Eq. , the flood inundation area detected using statistical threshold 1 was 269. 55 km2. Flood areas of 4,504. 01 km2, 134. 33 km2 and 16,444. 88 km2 were detected using statistical thresholds 2, 3, and 4, respectively. Figure 5 Barito watershed flood detection results using (A) statistical threshold 1, (B) statistical threshold 2, (C) statistical threshold 3, (D) statistical threshold 4, and (E) the Otsu threshold. The Otsu threshold approach can accurately and clearly detect floods and produces acceptable results. locations with a rather flat slope, floods are recorded (Figure . Meanwhile, the flooding observed using the statistical sample threshold technique appears to be exaggerated for statistical thresholds 1 and 2 and is more likely to be underestimated for statistical threshold 3. Even in places with a rather steep slope, both systems detect flooding. With results that are close to the Otsu threshold, statistical threshold 1 can detect flooding. Muhammad Priyatna et al. Figure 6 Flood inundation overlay with slope data. Accuracy Assessment A total of 153 flood location locations were used in the accuracy test. The test point locations were picked at A confusion matrix approach was used to conduct the accuracy testing, providing an overall accuracy index, chosen because it is a typical approach used for describing classification accuracy . According to the accuracy test findings, threshold 2 had the highest accuracy, at 73. 2%, while threshold 3, with a precision of 79%, had the lowest accuracy. The accuracy test results are summarized in Table 1. Table 1 Accuracy test result. Methods Th1 Th2 Th3 Th4 Otsu Omission error Flood Not flood Commission error Flood Not flood Overall Statistical Sample Threshold Performance Floods in open locations are easily detectable. However, in highly inhabited areas floods cannot be identified clearly using either the statistical sample threshold or the Otsu threshold approach. This finding can be attributed to a variety of factors. One of the causes is that the spatial resolution employed is limited. Sentinel-1 images with a resolution of 10 m will be more dominant in receiving backscatter from the roofs of buildings in places that are particularly densely populated. Backscatter is caused by flood conditions that do not submerge the entire roof of the building, but which are captured by sensors as reflections from the roofs. In addition, flooding in residential areas can cause radar waves to experience double-bounce backscattering. Double-bounce backscattering causes the backscattering value to be higher. As a result, the backscatter difference value becomes positive and is not identified as a flood area . Furthermore, because water that has retreated does not indicate tiny variations from the before-flood images, the time interval between the flood event and the final image used can alter the flood classification. The Rapid Flood Mapping Using Statistical Sampling DOI: 10. 5614/j. backscattering of water and non-water objects is significantly distinct, hence, these images can be used to discern between the two . Because the backscatter during/after the flood is smaller than before the flood, the calculation used of the difference in backscatter value of the flood point after and before the event will generate a negative number. This condition applies to relatively open areas such as agricultural land and bare soil. This phenomenon is among the results of . , which shows that the difference between Sentinel-1 backscattering before and after flooding in relatively open areas is negative. This is because bodies of water will cover objects other than bodies of water. In contrast to an item that was previously a body of water, the backscatter value after and before the flood will be nearly identical. This could reveal the difference between a permanent body of water and a flood inundation. The VV polarization backscattering of aquatic bodies ranges between -22. 4 dB and -12. 9 dB. For non-water bodies, the backscattering of VV polarization ranges between -10. 8 dB and -1 dB . As a result, the backscatter difference has a negative value. The statistical calculation of the sample revealed an average backscattering difference of 0. 94 dB with a standard deviation of 1. 5 dB. Flooding is determined to have a backscattering differential value smaller than the threshold. As a result, if the value of ycuI is dropped by E or 1. 5E, the threshold value is smaller, and there are locations with values around ycuI that are not detected as floods, resulting in underestimates for statistical threshold 1 and statistical threshold 3. If the threshold value is multiplied by a number that is too great, non-flood sites will be recognized as flooding. As a result, statistical threshold 4 produces overestimates. With a limited number of samples, the statistical sample threshold method for scenario 2 is sufficient to provide results with reasonably good accuracy, i. , above 70%. Thus, this method has the potential to be applied in rapid flood mapping where sample availability is challenging to obtain. Meanwhile, adding E will cause numerous flood readings acquired between ycuI and ycuI E to be detected as flooding, enhancing accuracy when compared to using ycuI as threshold. The precision of this strategy is better when it comes to locating flood events on flat terrain. However, such calculations will still have certain limitations in places with a fairly steep slope, causing the results to be overstated. In order to improve the methodAos capabilities, more post-classification work is required. In this, the use of DEM data as a filter has the potential to be useful. The use of statistical sample thresholds, particularly scenario 2, has the potential to be further developed, since Liang and D. Liu . found that local thresholds have the potential to be more accurate than global thresholds. Various scenarios that refer to land cover types may be further developed to overcome the problem of doublebounce backscatter flood inundation in urban areas. The method of mapping floods based on historical remote sensing data is feasible. Furthermore, the integration of the results of the analysis of the mapping of flood inundation historically with flood modeling is interesting to be developed. At present, several flood modeling methods have been developed, such as the Hydraulic Model . , the Flood Risk Map Model . , and the Potential Flood Model . These models can be integrated with the flooding inundation mapping based on the satellite image of remote sensing. The results of the analysis are expected to provide an overview related to the phenomenon of flooding to the Indonesian government in developing flood control policies and flood management solutions as the government of Indonesia did in formulating solutions for handling floods in Semarang. Central Java. Indonesia . Conclusions A statistical sampling threshold can be used to detect flood episodes in South Kalimantan Province of Indonesia. The statistical sample threshold showed its potential for rapid flood mapping using Sentinel-2 imagery. The second statistical sample threshold scenario could produce good results with an overall accuracy value of 73. In this study, the Otsu threshold did not provide the best accuracy. In open areas without structures, the statistical threshold 2 data and methods employed could detect flood areas fairly well. Although the accuracy test results showed good enough results for fast mapping, this method needs to be further developed, for example by considering the existing land cover types. Muhammad Priyatna et al. Acknowledgments The authors are grateful to the USGS. ESA. KLHK. BNPB. PetaBencana. id, and GEE for providing free data for this Funding This research was supported by PUTI NKB-617//UN2. RST/HKP. 00/2020 from University of Indonesia. References