VOL 2 . NO 1 e-ISSN : 2549-9904 ISSN : 2549-9610 INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION Neural Network for Earthquake Prediction Based on Automatic Clustering in Indonesia Mohammad Nur Shodiq#. Dedy Hidayat Kusuma#. Mirza Ghulam Rifqi#. Ali Ridho Barakbah*. Tri Harsono* # Department of Informatics. State Polytechnic of Banyuwangi. Jember Street KM13 Kabat. Banyuwangi 68461. East Java. Indonesia * Electronic Engineering Polytechnic Institute of Surabaya. Kampus ITS Sukolilo. Surabaya 60111. East Java. Indonesia E-mail: noer. shodiq@poliwangi. id, dedy@poliwangi. id, mirza@poliwangi. id, ridho@pens. id, trison@pens. AbstractAi A model of artiAcial neural networks (ANN. is presented in this paper to predict aftershock during the next five days after an earthquake occurrence in selected cluster of Indonesia with magnitude equal or larger than given threshold. The data were obtained from Indonesian Agency for Meteorological. Climatological and Geophysics (BMKG) and United States Geological SurveyAos (USGS). Six cluster was an optimal number of cluster base-on cluster analysis implementing Valley Tracing and Hill Climbing algorithm, while Hierarchical K-means was applied for datasets clustering. A quality evaluation was then conducted to measure the proposed model performance for two different thresholds. The experimental result shows that the model gave better performance for predicting an aftershock occurrence that equal or larger than 6 RichterAos scale magnitudes. KeywordsAi Artificial neural networks, earthquake prediction, cluster analysis INTRODUCTION data is filtered using magnitude of completeness of 5. Richter magnitude scale . This study proposed an ANN model to predict an earthquake during the next five days after an earthquake occurrence with magnitude equal or larger than given threshold for a selected cluster in all region of Indonesia. Aftershock prediction can be used by the authorities to deploy precautionary policies. Indonesia has a high risk to geological disasters as it lies between three active plates of the world: the Eurasian. IndoAustralian and Pacific plates. Therefore, a system that is able to predict the earthquake will greatly help to minimize the risk of losses that arise. A series of earthquakes are not randomly formed, but follow a spatial pattern with a trigger that results in an earthquake event. In other words, natural disasters . , earthquake. rarely appear on their own, but instead they tend to form a complex network of interacting faults as in . However, an earthquake prediction should state where, when, how big, and how probable the predicted event is . Several studies applied artificial neural networks (ANN) for earthquake prediction as in . Seismic clustering is the first stage to analyze earthquake risk which based on a variety of seismology criteria. using a clustering scheme, it is possible to retrieve spatiotemporal pattern that created by events. Although the automatic identification of the optimal number of clusters on seismic data is a very difficult problem in the process of data clustering . , but the optimal number of clusters can be determined by measuring variance within and variance between each cluster, this is known as cluster analysis . The data used are seismic data of all regions of Indonesia obtained from BMKG and USGS in year 1910 to 2017. This II. RESEARCH METHOD This section describes the system design of earthquake Figure 1 illustrates the overall process. Data Acquisition The earthquake data covers the entire territory of Indonesia, which is the boundary 5. 98 north latitude - 11 south latitude and east longitude 94. 12 Ae 140. 98 east The earthquake dataset in this research is taken from catalogue of Agency Meteorology. Climatology and Geophysics (BMKG) and United States Geological Survey (USGS) from 1910 to 2017. The dataset consists of 82,580 seismic events with magnitude 1 - 10 ML, and depth of 0650 km. implemented to select the optimal number of clusters . analyzes the moving variance of clusters for each stage of cluster construction and observes the pattern to find the global optimum as well as to avoid the local optima. The algorithm to find the optimal number of clusters . is described as follows: Set as each data of A, where A is attribute of ndimensional vector. Set K as the predefined number of clusters Apply clustering algorithm with number of clusters Record Increment j=j 1 Repeat from step 3 while j