J o u r n a l o f I n f o r m a t i o n S y s t e m a n d T e c h n o l o g y R e s e a r c h Volume 1. Issue 2. May 2022 Journal of Information System and Technology Research journal homepage: https://journal. id/index. php/jistr/ Data Mining Grouping Of Drug Users By Age Using Clustering Method (Case Study: BNN Binjai Cit. Rizki Putri Ananda*. Budi Serasi Ginting*. Tio Ria Pasaribu* Progam Studi Sistem Informasi STMIK KAPUTAMA Binjai Jl. Veteran No. 4A-9A. Binjai 20714. North Sumatra ARTICLE INFO ABSTRACT Article history: Drug trafficking and abuse is a very complex problem, which requires efforts to overcome it. Given that there are still many obstacles in the process of grouping drug users at the Binjai City BNN Office, for this reason the author tries to create a system to support a computerized grouping process that can help automatically classify drug users based on age, so there is an opportunity to design a grouping data mining system in it. Data mining is part of a computer-based information system that employs one or more computer learning techniques to analyze and extract knowledge automatically that is used to support grouping within an organization or a company. Clustering is a method that is applied to create a grouping data mining system to make it easier for staff to classify drug users based on age. Based on the analysis that has been done on grouping drug user data using the clustering, it is necessary to do the cluster several times to get the same results according to the first In this process, the process is carried out 10 times to obtain cluster. In cluster 1 which is 3 9 4, cluster 2 is 3 1 4, cluster 3 is 3 5 4 with the number of members of cluster 1 as much as 322 data, cluster 2 as much as 81 data and cluster 3 as much as 97 data. Received Apr 1, 2022 Revised May 4, 2022 Accepted Apr 28, 2022 Available online May 31, 2022 Keywords: Data Mining Clustering Drag User Age A 2022 The Author. Published by AIRA. This is an open access article under the CC BY-SA license . ttp://creativecommons. org/licenses/by-sa/4. 0/). Corresponding Author: Rizki Putri Ananda Progam Studi Sistem Informasi STMIK KAPUTAMA Binjai Jl. Veteran No. 4A-9A. Binjai 20714. North Sumatra Email: rizkiputriananda16@gmail. INTRODUCTION National Narcotics Agency (BNN) is an institution Non-Ministry Government (LPNK) Indonesia on duty to coordinate agencies government in the field of prevention, eradication of abuse and narcotics trafficking (P4GN). BNN is led by a head directly responsible to the President. Along with the times, the demands of life encourage a variety of crimes committed by One of the criminal acts that often occur today is the abuse of narcotics which is quite common among the public, even the current circulation of narcotics does not look at who is the user, regardless of the position, education and background of the person, ranging from ordinary people to officials. countries, adults and even students are involved in drug abuse. Therefore the problem that now it has reached an alarming situation is the increasing number of narcotics abuse, so that it becomes an urgent state problem. The motivations and causes why people take these drugs can vary, among others, as an act of rebellion due to rejection by the environment such as feelings of inferiority, a broken family background, heartbreak, as an action to reduce stress and depression, just trying to get a comfortable and pleasant feeling or as an action to escape from the realities of . Ae. Many factors cause rampant multiple drug abuse this year, which increased rapidly, especially in North Sumatra, namely Binjai City. From data abuse and circulation narcotics in the Binjai area show a tendency increase from year to year. Of course, the question is how the Agency's efforts National Narcotics (BNN) in solve this problem, because with a generation whose morals have been broken it is impossible to form good nation. https://doi. org/xx. x/x. A 2022 The Author. Published by AIRA. This is an open access article under the CC BY-SA license . ttp://creativecommons. org/licenses/by-sa/4. 0/). P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 Law Number 22 Years 1997 concerning Narcotics regulates efforts to eradicate crime narcotics crime through threats fine, imprisonment, criminal life imprisonment, and the death penalty. In In addition. Law No 22 of 1997 also regulates regarding the use of narcotics for medical purposes and health and regulates medical and social rehabilitation. The results of this stud y Hopefully the National Narcotics Agency can find out the region and type most drugs used so that it can take policies to address drug abuse, in the form of anti-drug counseling to the community, and also create a clean village program drugs . in the fight drugs through prevention, eradication, abuse and narcotics trafficking. Based on the research of Sri Rahayu. Dodon T. Nugrahadi, and Fatma Indriani. Lambung Mangkurat University (Science with the title "Clustering Determination of Regional Crime Potential in Banjar Baru City Using the K-Means Method" concluded that "In ClusteringThe grouping of potential regional crimes in the city of Banjarbaru is processed based on alignment. This is done so that the results of the clustering/ grouping of potential regional crimes in the city of Banjarbaru become more specific. Ay Based on Roy Setiawan's research at the Jakarta LP3I Polytechnic Lentera ICT with the title "Implementation of Data Mining Using the KMeans Clustering to Determine New Student Promotion Strategies" concluded that the profile generated from data mining with the K-Means algorithm Clustering can provide knowledge to determine promotion strategies at the LP3I Jakarta Polytechnic. The knowledge obtained from the results of clustering is that prospective students are dominated by low and middle economic . Ae. RESEARCH METHOD The research method is carried out to find something in a systematic way using scientific methods and applicable sources. In the process of this research, it is shown to provide more meaningful results for the National Narcotics Agency (BNN) of Binjai City in classifying drug user data by age using the clustering so as to provide better results. Based on the research method used, the flow of activities is made as follows: Formulation of the problem Theoritical review Data collection Data analysis Testing and Implementation Evaluation Figure 1 Research Workflow Based on the picture above, it can be seen that there are several stages in completing the research, namely: Problem Identification This stage is the initial stage in research, namely by determining the background of the problem, objectives and benefits so as not to get out of the problem and discussion or preparation of the thesis made. Theoretical Study This stage is looking for information, sources related to the problems faced, both from literature studies, journals and the internet as a support and basic foundation for thesis writing. Data collection This stage is the collection of data needed in making a thesis such as interviews, observations which can then be processed to the next stage. Data analysis P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 This stage is the stage of managing and analyzing the data that has been obtained so that the data can be grouped according to predetermined variables. Testing and implementation This stage is the stage that performs the validation and implementation of the data that has been previously analyzed and the preparation of the program. Evaluation This stage is the stage of taking conclusions and suggestions that can be made in the preparation of the thesis. With the conclusion, the results of the entire thesis will be known and it is hoped that with suggestions there will be improvements and benefits for . , . , . RESULTS AND DISCUSSION The following will explain the results and discussion that has been obtained : The following displays in tables 1 and 2 are the results of the iteration calculation Then form a cluster into 3 groups (K=. and determine the center point of the centroid. The clustering calculation process is as follows. K=3 Centroids C1= . ,9,. taken from R . C2= . ,8,. taken from S . C3= . ,9,. taken from T . Iterasi 1 : A . ,9,. C1=. ,9,. = Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( B . ,8,. C1=. ,9,. =Oo( C2= . ,8,. = Oo( C3=. ,9,. =Oo( C . ,7,. C1= . ,9,. = Oo( C2= . ,8,. = Oo( C3= . ,9,. = Oo( D . ,9,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo E . ,9,. C1= . ,9,. = Oo( C2= . ,8,. = Oo( C3=. ,9,. =Oo( F . ,9,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3= . ,9,. = Oo( G . ,6,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( H . ,10,. 1,41 2,45 2,24 2,45 2,24 2,24 4,12 1,41 4,24 4,12 1,41 2,83 1,41 4,36 4,58 5,10 P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3= . ,9,. = Oo( I . ,5,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( J . ,6,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( K . ,9,. C1= . ,9,. = Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( L . ,8,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo M . ,9,. C1=. ,9,. =Oo( C2=. ,8,. =Oo( C3= . ,9,. = Oo( N . ,10,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( O . ,1,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( P . ,1,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( Q . ,1,. C1=. ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( R . ,9,. C1= . ,9,. =Oo( C2=. ,8,. = Oo( C3=. ,9,. =Oo( S . ,8,. C1=. ,9,. =Oo( C2= . ,8,. = Oo( C3= . ,9,. = Oo( T . ,9,. C1=. ,9,. =Oo( 3,74 4,24 3,74 4,58 4,69 2,45 3,32 4,58 2,45 1,41 1,73 2,24 4,12 3,32 2,45 8,49 8,60 9,43 8,49 8,60 9,43 9,43 7,14 8,12 3,74 3,61 3,74 3,61 P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 C2= . ,8,. = Oo( C3= . ,9,. = Oo( Table 1. Iteration I Calculation Results Alternatif 1,41 2,45 3,00 1,41 5,00 2,83 4,36 3,74 4,24 4,69 0,00 2,45 2,24 3,32 8,49 8,49 9,43 0,00 3,74 3,61 2,45 2,00 2,24 4,24 0,00 1,41 4,58 2,00 3,74 2,45 3,74 1,41 4,12 3,00 8,60 8,60 7,14 3,74 0,00 1,00 2,24 2,24 2,83 4,12 1,41 1,00 5,10 1,00 4,58 3,32 3,61 1,73 4,00 2,45 9,43 9,43 8,12 3,61 1,00 0,00 Next is to calculate the ratio between Between Cluster Variation (BCV), because the centroid C1 = . ,9,. C2 = . ,8,. C3=. ,9,. then the BCV calculation is as follows. ) =oc( BCV )=Oo( )=Oo( )=Oo( = d( ) d( ) d( = 3,74 3,61 1 = 8,35 3,74 3,61 So that the value of the BCV ratio is 8. Then do the calculations for iteration II, before doing the calculations for iteration II, it is necessary to make the centroid center of the three clusters first. Iteration 2: K = 3 Centroid 1 Group 1 C1=( ) = 3,83 ) = 6,63 ) = 5,75 = ( 3,83 6,63 5,. Centroid 2 Group 2 C2 =( )= 2,88 )= 6,50 ) = 2,75 P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 = . ,88 6,50 Centroid 3 Group 3 C3= ( ) = 2,75 ) = 9,50 ) = 2,75 = . ,75 9,50 2,. So K=3 Centroid C1 = ( 3,83 6,63 5,. C2 = . ,88 6,50 2,. C3 = . ,75 9,50 2,. The results of the calculations, the results of the iteration 2 calculations are obtained, as shown in the table below. Table 2. Iteration II Calculation Results Alternatif 2,84 3,98 3,22 2,31 1,96 1,97 1,90 1,35 2,81 2,42 4,25 3,52 5,55 0,00 1,97 3,69 2,52 0,61 1,76 3,40 4,83 5,08 3,58 0,94 2,42 2,25 4,83 3,25 1,04 3,59 2,84 3,98 3,22 3,10 2,47 4,13 5,80 5,80 7,54 2,84 4,04 4,48 1,89 4,10 3,82 6,95 6,95 5,84 3,98 1,68 2,61 1,97 3,30 1,54 9,51 9,51 8,71 3,22 1,70 0,94 Grup After calculating using the existing cluster formula, iteration 2 occurs the same as iteration 1 and no data moves groups again so that the calculation can be stopped. So that a cluster graph of drug use data can be made based on age. 1 Graph Clustering Create a cluster graph based on the calculations that have been done. The graphs obtained are as follows. , . P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 Z : Jenis Narkoba Y : Pekerjaan X : Usia 9 10 11 12 Figure 2. Graph of Drug Users Grouping Cluster From 20 data obtained 3 groups. Cluster 1 has 6 data. Cluster 2 has 7 data, and Cluster 3 there are 7 data. Cluster 1 There are 6 Data 3. It can be seen that in cluster 1 there are users aged 36-45 years with civil servant jobs and the types of drugs used are methamphetamine and marijuana. Cluster 2 There are 7 data 2. It can be seen that in cluster 1 there are users aged 26-35 years with student jobs and the type of drug used is Inex. Cluster 3 There are 7 data 2. It can be seen that in cluster 1 there are users aged 26-35 years with self-employed jobs and the type of drug used is Inex. 2 Discussion In designing the system, the author builds a system that has been done manually into a computerized system with the title dat a mining grouping drug users based on age using the clustering at the Binjai City National Narcotics Agency. The program created using Matlab GUI software and the database used is Microsoft Excel. This system will process the inputted data which then produces new information which is expected to be useful information, especially about drug users. The test will be carried out by processing the system input data, the data grouping process to be carried out is as follows: Clustering Data Stages of grouping Drug User data can be done on the "PROCESS" menu. The following is the process of grouping JHT data with a system that has been designed with the application of the Clustering using the MATLAB R2014a programming application: Button Import Data The import is used to import data files in Microsoft excel which will be processed using the clustering. With the button import data, the data will appear in the table and will provide information on the amount of data and the inputted The following is a page display after the data has been successfully input into the system: Figure 3. Display After Data Input on the System Button Cluster Process After the data import process has been P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 carried out, the next step is to process the cluster to be processed, by selecting the number of cluster, for example 2 or 3 clusters. Then click button process cluster then the data will be processed, wait until the process is complete then the drug use grouping information data will appear. The following is the display after the data grouping process on the system: Figure 4. Display after the Data Grouping Test Process From the results above, each group can be added up as follows: Group 1 A total of 322 data Group 2 A total of 81 data Group 3 A total of 97 data Total data = 500 data Results of the Data Grouping Process The results of the grouping of data that provide information in the form of coordinates for data grouping and the cluster , the display of the results for grouping the data is as follows: Figure 5. Graph of Central Data Grouping Process: C1 : 3,3043 9,0373 3,8727 . C2 : 2,5062 1,284 3,9506 . C3 : 2,6598 5,4536 3,866 . Information : C1 = 3 9 4 Based on the calculations above, it can be seen that cluster 1 is centered on centroid with an age group (X) of 26-35 years, with a job (Y) being self-employed, and the type of drug (Z) being shabu. C2 = 3 1 4 Based on the calculations above, it can be seen that cluster 2 is centered on centroid with an age group (X) of 26-35 years, with a job (Y) that is not working, and the type of drug (Z) is methamphetamine. C3 = 3 5 4 Based on the calculations above, it can be seen that clusterr 3 is centered on centroid with an age group (X) of 26-35 years, with a job (Y) being private employees, with a type of drug (Z) namely crystal meth. Based on the analysis that has been done on the grouping of drug user data method clustering, it is necessary to do the cluster several times to get the same results according to the first process. In this process, the process is carried out 10 times so that the results 1 are of cluster 3 9 4, cluster 2 is 3 1 4 and cluster 3 is 3 5 4 with the number of members of cluster 1 as much as 322 data, cluster 2 as much as 81 data and cluster 3 with 97 P ISSN 2828-3864. E ISSN: 2828-2973 JISTR. Volume 1. Issue 2. May 2022 CONCLUSION From the results of the analysis based on the clustering of data on grouping drug users based on age, a conclusion can be drawn, from 500 data there are 3 clusters which are grouped. In cluster 1 which is 3 9 4, cluster 2 is 3 1 4, cluster 3 is 3 5 4 with the number of cluster members as much as 322 data, cluster 2 is 81 data and cluster 3 is 97 data. Based on the cluster process that has been carried out, the data grouping of drug users is based on the type of age at C1 = 3, 9, 4 with the age of 2635 years old with self-employed work and with the type of narcotic, namely shabu. C2 = 3, 1, 4 with the age of 26-35 years, with a job that has not worked and with the type of drug, namely methamphetamine. C3 = 3. 5, 4, namely with the age of 2635 years with a private employee and with the type of narcotic, namely shabu. With data on drug users, it is known that the age of drug users is from the age of 26-35 years with self-employed jobs and types of shabu drugs. REFERENCES