Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. Sentiment Classification of Climate Change and Tourism Content Using Support Vector Machine Yerik Afrianto Singgalen Faculty of Business Administration and Communication. Tourism Study Program. Atma Jaya Catholic University of Indonesia. Jakarta. Indonesia Email: yerik. afrianto@atmajaya. Submitted: 04/02/2024. Accepted: 20/02/2024. Published: 21/02/2024 AbstractOeThis research aims to classify public sentiment regarding the issue of climate change and tourism. The research problem addressed in this study pertains to the classification of public sentiment concerning climate change within the tourism sector. Specifically, the study aims to explore and classify the public's sentiments regarding the impact of climate change on tourism activities. The methodology employed is CRISP-DM, which encompasses stages of business understanding, data understanding, modeling, evaluation, and deployment. Specifically, the SVM and SMOTE algorithms are utilized in the modeling stage to achieve optimal results. By leveraging this systematic approach and advanced algorithms, the study seeks to comprehensively analyze public sentiment towards climate change within the context of tourism, thus contributing valuable insights to academia and industry practitioners. Applying CRISP-DM methodology coupled with SVM and SMOTE algorithms enhances the rigor and effectiveness of sentiment analysis in addressing the complexities of climate change discourse in the tourism sector. The findings of this research demonstrate that the SVM and SMOTE algorithms yield promising results in sentiment classification, with an accuracy of 86. 15% /- 1. 68% . icro average: 86. 15%), precision of 17% /- 2. 16% . icro average: 85. 11%) . ositive class: Positiv. , recall of 87. 64% /- 3. 39% . icro average: 87. ositive class: Positiv. , f_measure of 86. 34% /- 1. 79% . icro average: 86. 35%) . ositive class: Positiv. , and AUC of 923 /- 0. icro average: 0. ositive class: Positiv. These metrics indicate the effectiveness and reliability of the SVM and SMOTE algorithms in accurately classifying sentiment toward climate change in the context of tourism. The high accuracy, precision, recall, f_measure, and AUC scores suggest that the models produced by these algorithms are robust and capable of capturing nuanced sentiment patterns, thereby contributing to the advancement of sentiment analysis techniques in climate change research within the tourism domain. Keywords: Sentiment. Classification. Climate Change. Tourism. Support Vector Machine INTRODUCTION The impact of climate change on tourism has emerged as a pivotal subject warranting comprehensive scrutiny. The overarching issue lies in its profound alterations on natural landscapes and ecosystems, directly affecting tourist destinations worldwide . Furthermore, the escalating frequency and intensity of extreme weather events exacerbate these changes, posing significant challenges to the sustainability and resilience of tourism activities . Consequently, there is an urgent need for thorough investigation and analysis to understand the multifaceted implications of climate change on tourism . Such an endeavor is imperative for devising effective mitigation and adaptation strategies to safeguard the future viability of the tourism industry and the socio-economic well-being of communities reliant on it . The issue of climate change has pervaded various digital media platforms, offering widespread accessibility to users of digital applications. This proliferation serves as a digital campaign to heighten awareness regarding climate change and its repercussions on human life . As information dissemination becomes increasingly digitized, it facilitates the dissemination of knowledge and empowers individuals to engage with climate-related issues more actively . Consequently, this digital campaign is crucial in fostering a collective understanding of the urgency to address climate change . It underscores the importance of concerted efforts toward mitigating its adverse effects. The issue of climate change exerts a profound influence on coastal and maritime tourist destinations, significantly impacting the sustainability of local tourism. The primary concern lies in the escalating sea levels, changing weather patterns, and the increasing frequency of extreme events, collectively threatening coastal areas' ecological integrity and aesthetic appeal . These impacts, in turn, jeopardize maritime tourism's overall allure and economic viability . Consequently, it is imperative to recognize the intricate interplay between climate change and coastal tourism, emphasizing the urgency of implementing sustainable practices to mitigate environmental degradation and ensure the long-term resilience of local tourism economies . Given these challenges, a comprehensive and strategic approach is required to safeguard these valuable destinations, promoting a harmonious coexistence between tourism development and environmental conservation. The inception of this research emanates from the imperative to address the multifaceted challenges associated with sentiment classification in the context of climate change and its implications for the tourism The growing significance of sustainable tourism and the need to comprehend public perceptions on climate change impact have underscored the necessity for a nuanced sentiment analysis approach. The limited extant research on sentiment classification within the tourism domain prompted this investigation to fill the existing gap in scholarly discourse. The complexities of sentiment patterns related to climate change necessitated a comprehensive examination, prompting the adoption of advanced methodologies such as CRISP-DM, along Copyright A 2024 the auhor. Page 357 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. with the utilization of SVM and SMOTE algorithms for optimal classification outcomes. In conclusion, the research is propelled by the exigency to contribute substantively to the understanding of public sentiment regarding climate change within the pivotal domain of tourism. The urgency of this research underscores the significance of digital climate change campaigns and their impact on tourism, particularly in shaping public responses to the dangers of climate change and the comfort of tourist experiences. Digital campaigns are vital tools for disseminating information, raising awareness, and mobilizing public opinion on pressing environmental issues such as climate change . By leveraging digital platforms, these campaigns can effectively reach diverse audiences, fostering a widespread understanding of the threats posed by climate change to tourist destinations . Moreover, they influence tourists' perceptions and behaviors, encourage environmentally responsible practices, and promote sustainable tourism . Thus, conducting research in this area is essential for elucidating the efficacy of digital campaigns in shaping public attitudes towards climate change and fostering sustainable tourism practices, ultimately contributing to the preservation of natural resources and the long-term viability of tourism industries worldwide. The practical implications of this research emphasize the importance of collective public action in response to the issue of climate change through digital campaigns aimed at enhancing tourists' and local communities' understanding of anticipatory measures against climate change-induced disasters in tourism. harnessing digital platforms for advocacy and information dissemination, these campaigns have the potential to mobilize widespread support and engagement, fostering a shared sense of responsibility toward mitigating the adverse impacts of climate change on tourist destinations . Moreover, they facilitate the communication of proactive measures and adaptation strategies, empowering tourists and residents to make informed decisions and take necessary actions to safeguard the resilience and sustainability of tourism in the face of climate-related challenges . In essence, this research underscores the pivotal role of digital campaigns in promoting collective action and building resilience within the tourism sector, thereby contributing to the long-term preservation of natural and cultural heritage for future generations. The theoretical implications of this study underscore the significance of collective awareness regarding the impacts of climate change on livelihoods and ecologies in tourist destinations. The research contributes to the broader understanding of the interconnectedness between environmental sustainability, socio-economic wellbeing, and tourism development by delving into this aspect . The study illuminates the intricate dynamics at play through theoretical exploration and empirical analysis, shedding light on how climate change-induced disruptions can reverberate throughout local communities and ecosystems reliant on tourism activities . This deeper understanding enriches theoretical frameworks in tourism studies and informs policy-making and practical interventions to foster resilience and adaptive capacity within destination communities . conclusion, the theoretical implications of this research underscore the imperative of addressing climate change as an integral component of sustainable tourism development, thereby safeguarding the livelihoods and ecologies that underpin the viability of tourist destinations. The limitation of this research lies in the methodology employed, precisely the employment of sentiment classification utilizing the Cross-Industry Standard Process for Data Mining (CRISP-DM) approach and the Support Vector Machine (SVM) algorithm with the Synthetic Minority Over-Sampling Technique (SMOTE) operator . Ae. While these methods offer valuable insights into understanding public sentiment towards climate change campaigns in tourism, they also pose certain constraints. Although widely used in data mining, the CRISP-DM approach may overlook nuances and contextual complexities inherent in sentiment analysis. Additionally, while SVM with SMOTE effectively handles imbalanced datasets, its reliance on predetermined parameters and assumptions may limit the generalizability of findings . Therefore, future research endeavors should explore alternative methodologies or integrate complementary approaches to address these limitations Further research recommendations should focus on comparing algorithms for sentiment classification of climate change content and its impacts on the tourism industry. This comparison would involve assessing the efficacy and accuracy of different algorithms, such as Support Vector Machine (SVM). Naive Bayes. Decision Trees, and Neural Networks, in analyzing sentiments expressed in digital content related to climate change and By conducting comparative analyses, researchers can identify the most suitable algorithm or combination of algorithms for accurately classifying sentiment about climate change and tourism in diverse Such investigations would enhance methodological rigor and contribute to developing robust analytical frameworks for understanding public perceptions and responses to climate change within the tourism context, thereby informing more effective strategies for sustainable tourism development. RESEARCH METHODOLOGY 1 Research Gap and Trends Mapping: Climate Change and Tourism In the process of identifying research gaps, this study utilized VOSviewer as a tool to map journals focusing on the topics of climate change and tourism. VOSviewer, a powerful bibliometric software, enables researchers to visually represent and analyze the relationships between scholarly publications based on citation patterns. Copyright A 2024 the auhor. Page 358 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. keywords, and co-authorship networks. By employing VOSviewer, the study effectively identified vital thematic areas, emerging trends, and potential gaps within the existing literature on climate change and tourism. This methodological approach provides valuable insights into the current state of research and facilitates identifying areas that warrant further investigation and exploration. Thus, leveraging VOSviewer in research endeavors contributes to a more systematic and comprehensive understanding of the research landscape, ultimately enhancing the quality and relevance of scholarly contributions in the field. Figure 1. Network. Overlay, and Density Visualization Based on the results of the research gap identification, it is evident that studies focusing on sentiment analysis of climate change and tourism content are relatively scarce, underscoring the importance of conducting comprehensive investigations utilizing the CRISP-DM methodology alongside the SVM and SMOTE The limited attention to this specific intersection between climate change and tourism highlights a significant gap in the existing literature, necessitating a more thorough examination to elucidate public perceptions and responses in this domain. Researchers can gain deeper insights into sentiment patterns by employing robust methodologies such as CRISP-DM and leveraging advanced algorithms like SVM and SMOTE, thus enriching our understanding of the complex interactions between climate change discourse and tourism practices. In conclusion, addressing this research gap holds considerable potential for advancing knowledge and informing evidence-based strategies to foster sustainable tourism development in the face of climate change challenges. 2 Cross-Industry Standard Process for Data Mining (CRISP-DM) The method employed in this research is the CRISP-DM, which comprises business understanding, data understanding, modeling, evaluation, and deployment stages. CRISP-DM offers a systematic framework for conducting data mining projects, guiding researchers through essential steps from understanding the business objectives to deploying the final model. By adhering to the CRISP-DM methodology, the research ensures a structured and comprehensive approach to analyzing sentiment toward climate change and tourism. This methodological rigor enhances the reliability and validity of the study's findings, contributing to a robust understanding of public sentiment in this critical area of inquiry. Figure 1. Implementaiton of CRISP-DM and SVM Algorithm Copyright A 2024 the auhor. Page 359 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. In the business understanding stage, the research utilizes videos from YouTube, focusing on climate change, as the primary data. In the data understanding stage, user reviews from the YouTube platform are employed as textual data to be classified and analyzed based on positive and negative sentiments. Subsequently, the SVM algorithm and SMOTE operator are employed in the modeling stage to generate a confusion matrix. This structured approach ensures a comprehensive understanding of public sentiment towards climate change conveyed through digital media, facilitating rigorous analysis and interpretation of the research findings. In the evaluation stage, the confusion matrix values concerning accuracy, precision, recall. Area Under Curve (AUC), and f-measure are assessed to measure the best performance of the SVM algorithm. Meanwhile. SMOTE addresses the issue of data imbalance that impacts the algorithm's performance. This comprehensive evaluation ensures the robustness and reliability of the sentiment analysis outcomes, providing valuable insights into public perceptions of climate change within the tourism context. In the data understanding phase, textual data was sourced from public comments on video content published by National Geographic, identified by the video ID G4H1N_yXBiA, comprising a total of 3495 The selection of this specific dataset is grounded in the richness and diversity of insights that public comments on National Geographic videos offer, reflecting a broad spectrum of perspectives on climate change within the context of the content disseminated by a renowned scientific and educational platform. Analyzing this dataset provides a unique opportunity to glean valuable insights into public sentiment, contributing to a more comprehensive understanding of the nuanced discourse surrounding climate change as presented by National Geographic. In conclusion, the chosen dataset aligns with the research objective, allowing for an in-depth exploration of sentiment patterns within a contextually relevant and substantial pool of public commentary on climate change. During the modeling phase, a dataset consisting of 3495 textual comments was extracted using the Extract Sentiment function in the RapidMiner application. Subsequently, the data underwent classification based on positive and negative sentiment scores. Following this, performance testing of the SVM algorithm with and without SMOTE was conducted to measure accuracy, precision, recall, f-measure, and AUC. This meticulous approach in the modeling phase reflects a rigorous and systematic effort to analyze sentiment patterns within the dataset, utilizing advanced algorithms to ascertain the effectiveness of SVM in sentiment classification, particularly in the presence of imbalanced data. The comparison of SVM performance with and without SMOTE provides valuable insights into the impact of addressing class imbalance on the algorithm's effectiveness. conclusion, the modeling phase serves as a pivotal stage in evaluating the robustness of sentiment analysis techniques in discerning nuanced sentiment patterns related to climate change discourse within the tourism In the evaluation phase, the performance metrics, namely accuracy, precision, recall, f-measure, and AUC, of the SVM algorithm were systematically analyzed. The primary focus was on discerning the algorithm's effectiveness in accurately classifying sentiment within the dataset derived from public comments on National Geographic videos pertaining to climate change. The meticulous examination of these metrics provides a comprehensive understanding of the algorithm's proficiency in capturing both positive and negative sentiments. The AUC, in particular, serves as an essential indicator of the algorithm's discriminatory ability. This evaluative process contributes critical insights into the reliability and efficacy of the employed SVM algorithm, offering a robust foundation for interpreting the sentiment analysis results and advancing the understanding of public sentiment towards climate change within the tourism context. In the deployment phase, a strategic analysis was undertaken to respond to the outcomes of the performance evaluation. The primary objective was to translate the insights derived from the sentiment analysis of public comments on National Geographic videos into actionable strategies. This involved delineating effective approaches for addressing the sentiments expressed by the audience regarding climate change within the tourism The strategic deployment phase is pivotal in ensuring that the research findings are not only academically robust but also have practical implications. By aligning the sentiment analysis results with strategic responses, the deployment phase aims to contribute to informed decision-making processes and facilitate proactive measures within the tourism industry to address public concerns related to climate change. conclusion, this phase serves as a bridge between research outcomes and practical applications, emphasizing the significance of translating academic insights into tangible strategies for real-world impact. 3 Support Vector Machine (SVM) The SVM algorithm possesses distinct advantages and is highly relevant for sentiment classification based on Its ability to effectively handle high-dimensional data and nonlinear relationships makes it a preferred choice for sentiment analysis tasks. Additionally. SVM's robustness in handling binary and multiclass classification problems further enhances its applicability in diverse scenarios. The algorithm's versatility and proven performance in various research domains underscores its relevance and utility in sentiment analysis In conclusion, leveraging SVM algorithms offers a powerful approach to accurately classify sentiment based on dataset characteristics, contributing to advancements in sentiment analysis methodologies. Meanwhile, the regression function of the SVM method is as follows. Copyright A 2024 the auhor. Page 360 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. ( ) Where ( ) is the decision function. W is the weight vector perpendicular to the hyperplane. X is the input feature vector and B is the bias term. In the case of binary classification, the class label of a data point can be determined by the sign of ( ) ( ) ( ) . In non-linearly separable cases. SVM utilizes a kernel function K( ) to map the input feature vectors into a higher-dimensional space where the data becomes linearly separable. The decision function then becomes: ( ) Where are the Lagrange multipliers obtained during training. The strengths and weaknesses of the SVM algorithm when employing the SMOTE operator provide valuable insights into its effectiveness in handling imbalanced datasets. One of SVM's notable advantages is its ability to effectively handle nonlinear classification tasks, making it suitable for sentiment analysis applications. In addition. SVM's margin maximization principle helps improve generalization performance, enhancing its robustness in handling complex However, when combined with the SMOTE operator. SVM may face computational complexity and scalability challenges, particularly with large-scale datasets. Furthermore, the effectiveness of SMOTE in addressing class imbalance relies heavily on the choice of parameters, which may affect the algorithm's performance and require careful optimization. In conclusion, while SVM with SMOTE offers benefits in addressing data imbalance, carefully considering its limitations and parameter tuning is essential to maximize its efficacy in sentiment analysis tasks. RESULT AND DISCUSSION Climate change is an issue of paramount importance that elicits both proponents and detractors . Nevertheless, the impact of climate change on the sustainability of the tourism industry cannot be overstated . This sector relies heavily on environmental stability and favorable weather conditions to thrive . Consequently, any alteration in climate patterns significantly threatens its operation and profitability . such, addressing climate change is not merely an environmental concern but also a critical economic imperative for the tourism industry and the broader global economy . The digital campaign on climate change has garnered a myriad of both support and opposition. In recent years, digital platforms have become instrumental in raising awareness about environmental issues, including climate change, reaching diverse audiences globally . However, despite its potential to mobilize widespread action, digital campaigns often encounter criticism regarding their effectiveness in driving tangible change . Critics argue that the virtual nature of these campaigns may dilute the sense of urgency or lead to "clicktivism" superficial engagement without meaningful impact . Nevertheless, proponents emphasize the power of digital platforms to amplify voices, connect activists, and pressure policymakers, thereby catalyzing collective action on climate change . In conclusion, while the digital campaign landscape offers vast opportunities for climate advocacy, its effectiveness ultimately hinges on strategic implementation and concerted efforts to translate online activism into real-world solutions. Figure 2. Number of Post Overtime Based on public responses to content related to climate change, it is evident that public perceptions and reactions are highly diverse. The dissemination of information and discourse surrounding climate change often elicits a spectrum of reactions ranging from concern and urgency to skepticism and denial. While some segments Copyright A 2024 the auhor. Page 361 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. of the population strongly support climate action, others express skepticism toward the severity or even the existence of climate change. This variation in public perception underscores the complexity of addressing climate change and highlights the importance of nuanced communication strategies tailored to different In conclusion, understanding and navigating the diverse responses to climate change content is crucial for effective communication and fostering widespread engagement in climate mitigation and adaptation efforts. Figure 2. Most Frequently Used Words and Top Ten Posters Based on the results of keyword identification from review data, the Netlytics website indicates that frequently occurring words include "climate," "change," "earth," "global," "warning," "people," "school," "CO2," "years," and "video. " This comprehensive list reflects the recurring themes and topics prevalent in discussions surrounding climate change and its impacts. The prominence of "climate" and "change" underscores the central focus on environmental transformations. At the same time, words like "global" and "warning" suggest recognizing the widespread and urgent nature of climate-related challenges. Additionally, "school" and "video" hint at educational and multimedia strategies for communicating climate change issues. In conclusion, identifying these keywords provides valuable insights into the climate change discourse and can inform effective communication and engagement strategies with diverse audiences. Furthermore, the sentiment extract results on the Rapidminer application showed positive and negative sentiments regarding climate change video content posted by the National Geographic Channel. Table 1. Extract Sentiment in Rapidminer Review chemtrails are artificially creating that and your criminal frauds of the plastics frauds!!!!! Plant Marijuana. STOP THESE FRAUDS AND THEIR LIES! I think the best solution is to give the people of the world something good to invest in that is secure and pays a great dividend. The developed nations of the world could provide backing and guarantees for an "Earth Bond" that pays 8% annual inflationadjusted dividends with no limit to the guarantee and no ownership limit like was in place with war The bond funds would only be used for 2 purposes: . projects to improve the health of our . provide employment, benefits and fair compensation to the world's poor Essentially you would employ the world's poor to improve the health of the planet and give plenty of incentive for everyone to invest in these endeavors through a guaranteed annual 8% inflation-adjusted dividend. Everybody wins and nobody feels like they were "taken from" like through taxation that never Score criminal (-0. frauds (-0. frauds (-0. propaganda (-0. frauds (-0. stop (-0. frauds (-0. lies (-0. no (-0. no (-0. poor (-0. poor (-0. Total Score -5,53846153846154 4,58974358974359 Copyright A 2024 the auhor. Page 362 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. Based on sentiment analysis, it is discernible that the most frequently occurring keywords in reviews of climate change content published by the National Geographic Channel complement those identified by the website Netlytic. The amalgamation of data from both sources provides a comprehensive understanding of the prevalent themes and sentiments surrounding climate change discourse. The keywords "climate" . , "change" . , "earth" . , "global" . , and "warming" . featured prominently in both datasets, highlighting the overarching concern for environmental issues and the urgency to address climate change. Additionally, terms such as "people" . , "years" . , "school" . , "video" . , and "planet" . indicate a focus on educating and engaging diverse audiences, mainly through multimedia The convergence of popular keywords underscores the interconnectedness of public sentiment and media representation in shaping perceptions and responses to climate change. In conclusion, leveraging insights from National Geographic Channel reviews and Netlytic data facilitates a holistic approach to understanding and addressing climate change's multifaceted challenges, as shown in the figure below. Figure 3. WordlCloud In modeling using RapidMiner, connecting the SMOTE operator with the SVM algorithm allows for comparing performance outcomes. The utilization of the Synthetic Minority Over-sampling Technique (SMOTE) within the RapidMiner framework addresses the challenge of imbalanced datasets by generating synthetic samples of the minority class. By incorporating SMOTE with the Support Vector Machine (SVM) algorithm, which is renowned for its effectiveness in classification tasks, the modeling process aims to enhance predictive accuracy while mitigating the impact of class imbalance. This integration facilitates a robust model performance evaluation, enabling researchers and practitioners to make informed decisions regarding the most suitable approach for addressing imbalanced datasets in predictive modeling tasks. Figure 4. Modeling Process in Rapidminer Copyright A 2024 the auhor. Page 363 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. In the modeling process, the division between training and testing data follows a standard ratio of 70:30, allocating most of the dataset for training the model and a smaller portion for evaluating its performance. Additionally, to ensure data cleanliness and reduce redundancy, the data cleansing process involves removing duplicates, thereby enhancing the quality and reliability of the dataset. Furthermore, incorporating stopwords in the English language aids in refining the text data by filtering out commonly occurring words with minimal semantic value, thereby focusing the model's attention on more informative features. This meticulous approach to data preprocessing contributes to the robustness and effectiveness of the modeling process by enhancing the quality of input data and facilitating more accurate model predictions. SVM Using SMOTE SVM Without using SMOTE Figure 5. Area Under Curve (AUC) of the SVM Algorithm Based on the values obtained from the confusion matrix, it is evident that SVM with SMOTE yields differing performance outcomes. Specifically. SVM with SMOTE demonstrates distinct performance metrics, indicating its effectiveness in handling imbalanced datasets. The Area Under the Curve (AUC) values, including optimistic, micro-average, and pessimistic scores, consistently demonstrate predictive solid performance, with micro-average AUC values ranging from 0. 923 to 0. These results underscore the utility of integrating SMOTE with SVM for classification tasks, mainly when dealing with imbalanced datasets, as it enhances the model's ability to classify instances from minority classes accurately. In conclusion, the observed performance metrics validate the efficacy of SVM with SMOTE in improving classification performance, thus emphasizing its relevance in data-driven decision-making processes. Table 2. SVM Performance in Sentiment Classification SVM Using SMOTE PerformanceVector: accuracy: 86. 15% /- 1. 68% . icro average: ConfusionMatrix: True: Negative Positive Negative: 1082 Positive: 196 AUC . : 0. 924 /- 0. icro average: ositive class: Positiv. AUC: 0. 923 /- 0. icro average: 0. ositive class: Positiv. AUC . : 0. 923 /- 0. icro average: ositive class: Positiv. precision: 85. 17% /- 2. 16% . icro average: 11%) . ositive class: Positiv. ConfusionMatrix: True: Negative Positive Negative: 1082 Positive: 196 recall: 87. 64% /- 3. 39% . icro average: 87. ositive class: Positiv. ConfusionMatrix: True: Negative Positive Negative: 1082 Positive: 196 SVM Without using SMOTE PerformanceVector: accuracy: 74. 23% /- 0. 55% . icro average: ConfusionMatrix: True: Negative Positive Negative: 10 Positive: 443 AUC . : 0. 764 /- 0. icro average: ositive class: Positiv. AUC: 0. 760 /- 0. icro average: 0. ositive class: Positiv. AUC . : 0. 757 /- 0. icro average: ositive class: Positiv. precision: 74. 22% /- 0. 50% . icro average: 21%) . ositive class: Positiv. ConfusionMatrix: True: Negative Positive Negative: 10 Positive: 443 recall: 99. 77% /- 0. 38% . icro average: 99. ositive class: Positiv. ConfusionMatrix: True: Negative Positive Negative: 10 Positive: 443 Copyright A 2024 the auhor. Page 364 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. SVM Using SMOTE f_measure: 86. 34% /- 1. 79% . icro average: 35%) . ositive class: Positiv. ConfusionMatrix: True: Negative Positive Negative: 1082 Positive: 196 SVM Without using SMOTE f_measure: 85. 11% /- 0. 30% . icro average: 11%) . ositive class: Positiv. ConfusionMatrix: True: Negative Positive Negative: 10 Positive: 443 Based on the values obtained from the confusion matrix, it is evident that SVM without SMOTE exhibits specific performance metrics indicative of its classification accuracy and effectiveness. The accuracy rate, 23% with a micro average of 74. 23%, suggests a moderate level of overall correct classifications. However, further examination of precision, recall, and F-measure metrics reveals a nuanced understanding of the model's performance. While precision demonstrates the proportion of correctly classified positive instances among all instances predicted as positive, recall represents the proportion of correctly classified positive instances among all actual positive instances. The F-measure, which harmonizes precision and recall, provides a balanced assessment of the model's performance. In this instance, the precision . 22%), recall . 77%), and F-measure . 11%) values reflect reasonably high levels of correct classifications for the positive class, indicating the model's proficiency in identifying positive instances. Thus, despite not employing SMOTE to address the class imbalance. SVM demonstrates a commendable ability to classify positive instances accurately. Based on the values derived from the confusion matrix, it is evident that SVM utilizing SMOTE demonstrates notable performance metrics, particularly classification accuracy. The model exhibits a high overall correct classification with an accuracy rate averaging 86. 15% and a micro average of 86. Further precision, recall, and F-measure metrics analysis reveal additional model performance insights. Precision, representing the proportion of correctly classified positive instances among all instances predicted as positive, yields an average of 85. Additionally, recall, indicating the proportion of correctly classified positive instances among all actual positive instances, averages 87. The F-measure, synthesizing precision and recall, provides a balanced assessment of the model's performance, averaging 86. These metrics collectively underscore the effectiveness of SVM with SMOTE in accurately classifying positive instances, thereby affirming its utility in addressing class imbalance and enhancing predictive performance. Comparing the results of SVM with and without SMOTE reveals distinct performance disparities. When utilizing SMOTE. SVM demonstrates superior classification accuracy, averaging 86. 15% with a micro average 15%, compared to 74. 23% accuracy without SMOTE. Precision, recall, and F-measure metrics also exhibit significant improvements with SMOTE, averaging 85. 17%, 87. 64%, and 86. 34%, respectively, compared to 22%, 99. 77%, and 85. 11% without SMOTE. Additionally, the AUC values for SVM with SMOTE remain consistently high, with optimistic, micro average, and pessimistic scores ranging from 0. 923 to 0. 924, compared to slightly lower AUC values of 0. 757 to 0. 764 without SMOTE. These findings underscore the efficacy of SMOTE in addressing class imbalance and enhancing the predictive performance of SVM, particularly in accurately classifying positive instances. The analysis of the results reveals a substantial disparity in the performance of SVM with and without SMOTE. SVM with SMOTE demonstrates a notable increase in classification accuracy, achieving an average of 15% with a micro average of 86. 15%, compared to 74. 23% accuracy without SMOTE. Additionally, precision improves from 74. 22% without SMOTE to 85. 17% with SMOTE, recall increases from 99. 77% to 64%, and the F-measure rises from 85. 11% to 86. Furthermore, the Area Under the Curve (AUC) values consistently remain high for SVM with SMOTE, ranging from 0. 923 to 0. 924, compared to slightly lower AUC values of 0. 757 to 0. 764 without SMOTE. These findings underscore the efficacy of SMOTE in enhancing SVM's performance, particularly in handling imbalanced datasets, and emphasize the significance of employing appropriate techniques for optimal predictive modeling outcomes. CONCLUSION The research findings present a comparative analysis of the SVM algorithm's performance with and without SMOTE. The results indicate that SVM without SMOTE achieved an accuracy of 74. 23% /- 0. 55%, precision 22% /- 0. 50%, recall of 99. 77% /- 0. 38%, f_measure of 85. 11% /- 0. 30%, and AUC . of 764 /- 0. Conversely. SVM with SMOTE obtained an accuracy of 86. 15% /- 1. 68%, precision of 17% /- 2. 16%, recall of 87. 64% /- 3. 39%, f_measure of 86. 34% /- 1. 79%, and AUC of 0. 923 /- 0. These results highlight the notable performance improvements when employing SMOTE, demonstrating enhanced accuracy, precision, recall, f_measure, and AUC. Using SMOTE contributes to mitigating the class imbalance issue and consequently improves the SVM algorithm's effectiveness in sentiment classification tasks related to climate change and tourism. Thus, integrating SMOTE into the SVM algorithm is promising for achieving more robust sentiment analysis outcomes in such contexts. The practical implications of this research are multifaceted and extend beyond academia into real-world applications. The findings highlight the importance of addressing class imbalance in predictive modeling tasks, particularly in domains where minority classes are Copyright A 2024 the auhor. Page 365 This Journal is licensed under a Creative Commons Attribution 4. 0 International License Journal of Computer System and Informatics (JoSYC) ISSN 2714-8912 . edia onlin. ISSN 2714-7150 . edia ceta. Volume 5. No. February 2024. Page 357-367 https://ejurnal. seminar-id. com/index. php/josyc DOI 10. 47065/josyc. By demonstrating the efficacy of the SMOTE technique in enhancing the performance of Support Vector Machine (SVM) models, this research provides practitioners with a valuable tool for improving the accuracy and reliability of classification systems. Moreover, the observed improvements in classification metrics such as accuracy, precision, recall, and F-measure underscore the potential for SMOTE-enhanced SVM models to be deployed in various practical settings, including healthcare, finance, and cybersecurity, where accurate classification of minority instances is crucial for decision-making processes. In conclusion, this research contributes to advancing the field of machine learning and offers actionable insights that can positively impact decision-making and problem-solving in real-world scenarios. REFERENCES