1 Journal of Intelligent System and Telecommunications 1. 41-50 Journal of Intelligent System and Telecommunications Journal homepage: https://journal. id/index. php/jistel/index Predicting Regional Sanitation Conditions Using Support Vector Machines and Neural Networks Rosalia Indah Trisanti1*. Agam Nizar Dwi Nur Fahmi2 Department of Electrical Engineering. Faculty of Sains and Engineering. Universitas Bangka Belitung. Bangka. Indonesia Department of Electrical Engineering. Faculty Engineering. Universitas Muhammadiyah Jember. Jember. Indonesia *Correspondence: E-mail: 1trisantirosalia@gmail. com, 2agamnizardwi10@gmail. ARTICLE INFO ABSTRACT Article History: Submitted/Received 10 December 2024 First Revised 20 December 2024 Accepted 20 December 2024 First Available Online 21 December 2024 Publication Date 21 December 2024 Sanitation conditions are very important for an area. This is what makes an area comfortable and free from various diseases, one of which is dengue fever. This is because sanitation conditions are one of the important factors in reducing the number of dengue fever patients. This study aims to predict the level of sanitation conditions in an area using Neural Network and Support Vector Machine (SVM). This study is part of a study to determine dengue fever in the Bangka Belitung area of Indonesia. The parameters used are a learning rate of 0. 1, momentum 0. 1 and learning is carried out for 500 epochs. The results show that using the Neural Network method, an accuracy of 95% was obtained and the SVM method obtained an accuracy of 96%. Based on the results of the system, it can be concluded that the system has been able to run well. The contribution of this study is that it can be a Decision Support System in determining the sanitation level of an area, and related parties can immediately take follow-up action. ____________________ Keyword: Sanitation. Neural Network. SVM, dengue fever. Artificial Intelligence. A 2024 Tim Pengembang Jurnal UNESA Rosalia Indah Trisanti. Predicting Regional Sanitation Conditions Using Support Vector Machines and Neural Networks | 42 INTRODUCTION Dengue fever, caused by mosquito bites where the mosquito's saliva contains the dengue Dengue fever does not only occur in Indonesia, but has become a global community problem . Cases of fever are increasingly worrying and the number is increasing . Indonesia in 2022, according to the Ministry of Health of the Republic of Indonesia, cases of dengue fever reached 131,265 cases, around 40% of sufferers were children aged 0 to 14 years and around 73% of deaths due to dengue fever were children. Cases of dengue fever in Bangka Regency have increased in recent years. Based on information from the Bangka Regency Health Office, cases of dengue fever in the 2020 period were 119 cases, then increased in the 2021 period to 149 cases, then jumped to 290 cases in the 2022 period . To deal with the increase in dengue fever cases, the Bangka Regency Health Office has initiated various actions, short-term and long-term actions. In short-term actions, adult mosquito vector interventions such as fumigation or fogging are carried out, while for long-term actions, mosquito larvae control is carried out in the form of the One House One Mosquito Net Movement (G1R1J) for school children . Although various actions have been taken, dengue fever cases are still Currently, the classification of dengue fever cases in the Regency is carried out manually, namely by examining historical data from dengue fever cases that have occurred. A very important thing is the sanitation conditions of an area. Based on this, a decision support system is needed to classify the sanitation conditions of an area quickly and efficiently. In this case, it can utilize technological developments by implementing a more sophisticated classification This is also in accordance with the national strategy program for dengue control from 2021 to 2026 . , which is the development of research, studies, and innovations which are the basis for policies that can be used as a guideline to introduce more sophisticated and up-to-date forecasting methods. The research that has been done is a study on the prediction of dengue fever. The KNN method has good performance in predicting the period of hospitalization days for dengue fever patients, with an accuracy value of 65. 67% . Clustering of dengue fever patients devided into 2 clusters. The Medoids method in clustering is better than the K-Means method, because the K-Medoids method can overcome the problem of outliers in the data used . The K-Means method is effective for predicting dengue fever, 60% of this model is effective in predicting dengue fever cases with positive results . The SVM method has better performance in predicting dengue fever cases, where the accuracy value using the SVM method is higher than the Decision Tree method, which is 99% . CBR has good performance with an accuracy value of 80%, this shows that CBR can identify vital signs and blood profiles for the treatment of dengue fever at the Hospital in Negombo Sri Lanka . SVM has better performance in predicting dengue fever compared to the KNN method, with an accuracy value of 100% . The KNN model is able to predict dengue fever cases, where the MAE and RMSE values obtained in the KNN model with K = 3 are 16,637 and 29,145 . SVM has good performance in predicting dengue fever outbreaks in Selangor. Malaysia, with an accuracy value of 70% . The ANN model has better performance than Decision Tree in Predicting Dengue Fever Outbreaks Based on Meteorological Data, with an accuracy value of 68. 85% . The SVM method has good performance in predicting dengue fever, where the cross validation. K-Fold validation. Precision and F1-Score values are higher than other methods, namely 98. 5%, 98. 2% and 98% respectively . The above research discusses more about predicting dengue fever cases in various places, but there is still very limited research on the sanitation conditions of an area. Research discussing sanitation using K-Means shows a system accuracy of 0. 8645, average sensibility DOI: http://dx. org/10. x/jistel. vXiX e- ISSN x-x 43 | Journal of Intelligent System and Telecommunications. Volume 1 Issue 1. Dec 2024 pp 41-50 8775, average specificity 0. 9231, and average precision 0. The difference with this research is that the SVM and Artificial Intelligence methods are used, where previous research shows that using this method produces high accuracy. The purpose of this study is to predict the potential sanitation conditions of an area using Support Vector Machines and Artificial Neural Networks. The areas that are the objects of research are 8 Bangka Regencies. Bangka Belitung Province. Indonesia. The contribution of this research is to provide considerations or Decision Support Systems to related parties to immediately take action before cases occur. The hope is that if the sanitation of an area is good, it can at least reduce the potential for the spread of dengue fever. The originality of this research is that this measurement or prediction is usually done manually, but by using Artificial Intelligence, the prediction becomes more accurate. METHODS Research stage The research stage begins with first observing the area that is the object of research and conducting interviews with related parties. Furthermore, data collection is carried out, learning processes are carried out, testing is carried out, and results are evaluated. The learning process is carried out using Support Vector Machine and Neural Network. The input factors of the system are septic tank management, the habit of washing hands with soap, household food and drinking water management, household waste disposal management, and household liquid waste management. While the output is the level of regional sanitation conditions. The level of regional sanitation consists of levels 1 . , 2 . , and 3 . ery goo. Material The data used was obtained from the Bureau of Statistics regarding the number of culverts and rivers, population, and altitude of the are. , for data on sanitation conditions and data on the number of dengue fever patients obtained from the Bangka Regency Health Office. Bangka Belitung Province. This research was conducted from 2018 to 2022, where the data was divided into 2, namely learning data and testing data. 3 Support Vector Machine The SVM algorithm works by mapping data to a feature space so that data points can be categorized, even when the data cannot be separated linearly. If a separator between categories is successfully found, the data can be transformed in such a way that the separator can be described as a hyperplane. Then, the characteristics of the new data can be used to predict which group the new record should be in. SVM can categorize linear and non-linear data. this study, it is divided into 3 clusters, namely the sanitation level of this area consisting of level 1 . , 2 . , and 3 . ery goo. 3 Neural Network The algorithm of the Neural Network is as follows: Enter the vector X into the input layer. calculate the output of each neuron in the hidden layer and output layer. calculate the activation function in the output layer. update the weights so that the output of the ANN is consistent with the class label in the training data. 4 System evaluation DOI: http://dx. org/10. x/jistel. vXiX e- ISSN x-x Rosalia Indah Trisanti. Predicting Regional Sanitation Conditions Using Support Vector Machines and Neural Networks | 44 Evaluation process is important because it measures the performance of the proposed model and ensures how well the model is used in producing the right classification. Model performance usually uses measures such as accuracy, precision, recall and F1-Score. Precision describes the amount of data with positive categories that are correctly classified. Recall shows what percentage of positive category data is correctly classified by the system. F1-Score shows the balance between recall and precision. In this study, the matrix used to evaluate model performance is the Confusion Matrix which can be used to calculate the accuracy, precision, recall and F1-Score values . yaycaycaycycycaycayc = ycNycE ycNycA ycEycyceycaycnycycnycuycu = ycNycE ycNycE yaycE ycIyceycaycaycoyco = ycNycE ycNycE yaycA ya1 ycIycaycuycyce = 2ycu ycEycyceycaycnycycnycuycu ycu ycIyceycaycaycoyco ycEycyceycaycnycycnycuycu ycIyceycaycaycoyco Whereas: : True Possitive TN : True Negative : False Possitive FN : False Negative RESULTS AND DISCUSSION The research has been conducted, the first stage used Neural Network and the second stage used SVM. In the first study using Neural Network, and the Confusion matrix was obtained in Table 1. Table 1 explains that level 1 can be classified correctly 193 data, level 2 there are 189 data that are classified correctly, and level 3 there are 189. Table 1. Confusion Matrix using Neural Network Level 1 Level 2 Level 3 Level 1 Level 2 Level 3 DOI: http://dx. org/10. x/jistel. vXiX e- ISSN x-x 45 | Journal of Intelligent System and Telecommunications. Volume 1 Issue 1. Dec 2024 pp 41-50 The system performance is described in Table 2, where an accuracy of 0. 95 is obtained, with a precision value of 0. 95, a recall value of 0. 94, and an F1 score of 1. This fact shows that the system has been able to run well. Table 2. System performance using Neural Network Performance Value Accuracy Precision M1 Precision M2 Precision M3 Precision Recall M1 Recall M2 Recall M3 Recall F1 score Receiver operating characteristic (ROC) on the results of the testing process using Neural Network is depicted as in Figure 1. Based on this graph, calculations are carried out to calculate the Area Under The Curve (AUC). The AUC calculation obtained a value of 0. 95, based on the AUC criteria if the value is between 0. 9 and 1 then it is in the excellent criteria. So we can conclude that the system is in the excellent criteria. Figure 1. ROC using Neural Network The results of the Confusion Matrix when applying Support Vector Machine are explained in Table 3. When this method is applied, it can be seen that level 1 can correctly classify 194 data, and this is 1 data higher when compared to using Neural Network. Likewise at level 2, the data that can be correctly classified is 190 data. While at the third level that can be correctly classified is 192. This shows . n this cas. that SVM produces higher accuracy than using Neural Network. Table 3. Confusion Matrix menggunakan Support Vector Machine (SVM) Level 1 Level 1 Level 2 Level 3 Level 2 Level 3 DOI: http://dx. org/10. x/jistel. vXiX e- ISSN x-x Rosalia Indah Trisanti. Predicting Regional Sanitation Conditions Using Support Vector Machines and Neural Networks | 46 The system performance is described in Table 4, where the accuracy is 0. 96, with a precision value of 0. 96, a recall value of 0. 95, and an F1 score of 1. This fact shows that the system using SVM has also been able to run well. Based on Table 4, it also shows that the system performance . recision and recall value. tends to be higher than using Neural Network. Both systems get an F1 score of 1, because the precision and recall values have almost the same Table 4. System performance using Support Vector Machine (SVM) Performance Value Accuracy Precision M1 Precision M2 Precision M3 Precision Recall M1 Recall M2 Recall M3 Recall F1 score Receiver operating characteristic (ROC) on the results of the testing process using SVM is depicted as in Figure 2. Based on this graph, calculations are carried out to calculate the Area Under The Curve (AUC). The AUC calculation obtained a value of 0. 96, so we can conclude that the system is in the excellent criteria. Figure 2. ROC using SVM Efforts to prevent disease by removing or controlling various environmental components that are responsible for the chain of disease transmission are also called environmental sanitation . An environment with good sanitation can prevent the spread of disease, while an environment with poor sanitation can contribute to providing a habitat for the breeding of Ae. Aegypti mosquitoes. The population of Ae. Aegypti mosquitoes is often associated with poor sanitation, as well as inadequate waste management (WHO, 2. Thus, to improve environmental sanitation, the STBM program can be carried out, which is a government program to improve a clean environment. Community-Based Total Sanitation or STBM is a policy made by the government to encourage behavioral changes towards a clean and healthy DOI: http://dx. org/10. x/jistel. vXiX e- ISSN x-x 47 | Journal of Intelligent System and Telecommunications. Volume 1 Issue 1. Dec 2024 pp 41-50 life by involving active participation from individuals and the community . Some inputs from sanitation conditions are septic tank management, the habit of washing hands with soap, household food and drinking water management, household waste disposal management, and household liquid waste management. Sungailiat and Pemali areas are included in cluster 3, which is a high level of sanitation. This monitoring is very important. This is evident from the Belinyu. Riau Silip, and Mendo Barat areas from January to September being included in cluster 1 . aving a low level of sanitatio. , but its results followed up by anticipation action, from October to December they changed to being included in cluster 3, which is having a high level of sanitation. Another fact is that the Merawang and Puding Besar areas from January to September were included in cluster 2 . aving a moderate level of sanitatio. , and from October to December they were included in cluster 3, which was having a high level of sanitation. This also happened in the Bakam area from January to July it was included in cluster 2, which was having a moderate level of sanitation and from August to December it was included in cluster 3, which was having a high level of sanitation. Then the results of the sanitation level clustering in Bangka Regency in 2022 showed that all areas were included in cluster 3, which was having a high level of Research on SVM has been widely used by previous researchers . where it was carried out in the prediction of total soluble solids of industrial tomatoes. Other applications are used to predict heterogeneous machine parameters in industry 4. Based on the results above, good results were obtained. Meanwhile, the implementation of Neural Network in sanitation is to predict water pollution . , the results of this study can predict water pollution so that it can be anticipated immediately. In addition, it can model and simulate the environment . Research on sanitation using machine learning has been studied by previous researchers, the indicators used are such as washing hands before meals and drinking water from natural water sources, and sanitation facilities. The results of this study showed an accuracy of 70% . The overall system, in this case, shows that using SVM produces higher accuracy than Neural Network. But both still show good performance, when viewed from the performance of accuracy, precision, recall, and F1 score. When viewed from the AUC value, both methods have obtained excellent results, because the AUC value of the system . sing both Neural Network and SVM) is still in the range between 0. 9 and 1. CONCLUSION In the performance evaluation, it was found that the proposed model in classifying the level of sanitation has good performance. The results of the performance evaluation of the proposed model . sing Neural Networ. have values of accuracy, precision, recall and F1-Score obtained respectively of 0. 95, 0. 954, 0. 94 and 1. While the application of SVM the results of the calculation of the values of accuracy, precision, recall and F1-Score are respectively 0. 96, 0. 95 and 1. Based on this, the proposed model can be relied on in the level of regional sanitation based on septic tank management, the habit of washing hands with soap, household food and drinking water management, household waste disposal management, and household liquid waste management. Future research is proposed to explore the causes/determinants of sanitation in an area, and not only dengue fever. Then re-learning is carried out to obtain a more robust system. Other methods can also be used, with the hope of obtaining a higher accuracy value. ACKNOWLEDGMENT DOI: http://dx. org/10. x/jistel. vXiX e- ISSN x-x Rosalia Indah Trisanti. Predicting Regional Sanitation Conditions Using Support Vector Machines and Neural Networks | 48 We would like to thank the Statistics Bureau and the Health Service of Bangka Regency. Bangka Belitung Province. Indonesia, for providing data for this study. AUTHORSAo NOTE The authors declare that there is no conflict of interest regarding the publication of this The authors confirmed that the paper was free of plagiarism. AUTHORSAo CONSTRIBUTION/ROLE Rosalia Indah Trisanti1: Conceptualization. Methodology. Writing Original Draft. Investigation. Agam Nizar Dwi Nur Fahmi: Formal Analysis. Writing Original Draft. Supervision. REFERENCES