SciRepID - Scientific Publication Search

Publication Search

41,520 articles from 397 journals · 1,447 citations tracked

Showing 1-20 of 82

Analytics

Yuma Akbar; Frencis Matheos Sarimolle; Dwi Swasono Rachmad; Muhammad Derry Oktaviandi

International Journal of Applied Mathematics and Computing 2026 Asosiasi Riset Ilmu Matematika dan Sains Indonesia

This study aims to analyze public sentiment toward the hashtag #KaburAjaDulu, which has circulated widely on the social media platform X (formerly Twitter). The hashtag reflects the growing anxiety among the public, especially younger generations, regarding socio-political issues in Indonesia. The data were collected using web scraping techniques, focusing on user-generated tweets that contain the hashtag. A comprehensive text preprocessing phase was conducted to clean the raw data by removing irrelevant elements such as URLs, emojis, numbers, and punctuation. The research applies a hybrid classification approach using a combination of Support Vector Machine (SVM) and Random Forest algorithms to categorize sentiment into three classes: positive, negative, and neutral. The performance of the model was evaluated using metrics such as accuracy, precision, recall, and F1-score to determine the effectiveness of the classification. The study aims to demonstrate that combining algorithms can improve classification performance compared to using a single algorithm. This research contributes to the field of sentiment analysis and provides valuable insights for researchers, policymakers, and social observers in understanding public opinion trends in digital media.

Rasiban Rasiban; Dadang Iskandar Mulyana; Muhammad Joko Umbaran Kharis Bahrudin; Nicola Marthy

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

The development of social media, especially TWITTER, has become one of the main means for people to express opinions and criticism on various issues, including the performance of law in Indonesia. This study aims to analyze public sentiment towards the performance of law based on TWITTER user comments using the Naïve Bayes algorithm. The research data consists of 1004 comments collected from several videos related to legal topics. The analysis process includes the stages of data crawling, pre- processing (text cleaning, normalization, and tokenization), labeling sentiment into positive, negative, and neutral, and testing the Naïve Bayes model. The results show that the Naïve Bayes algorithm is able to classify sentiment with an accuracy level of 93.73%. The distribution of sentiment from 1004 comments shows that the majority of public opinion is (negative/positive/neutral), which indicates that public perception of the performance of law is still (critical/positive). These findings are expected to be input for related parties to understand public opinion and improve the quality of legal performance in

Sutisna Sutisna; Tri Wahyudi; Dwi Swasono Rachmad; Fachrur Rozi

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

Social media X (Twitter) has become the main platform for the Indonesian public to express opinions, including on the trend of 'kabur aja dulu' (let's just run away for a bit). This research aims to classify the sentiments of the public using the Naïve Bayes and Support Vector Machine (SVM) methods, and to compare the accuracy of both in sentiment analysis. Data was collected via the Twitter API with the hashtag #kaburajadulu, resulting in 2,067 tweets, which, after the cleansing process and manual labeling, left 385 data points. The analysis process followed the CRISP-DM stages, which include business understanding, data understanding, data preparation, modeling, evaluation, and deployment. Model evaluation was conducted using a confusion matrix with accuracy, precision, and recall metrics. The classification results show that 82% of tweets have a positive sentiment and 18% negative. The Naïve Bayes algorithm achieved an accuracy of 86.49%, slightly lower than SVM, which reached 88.05%. In conclusion, Support Vector Machine is more effective in sentiment classification on public opinion data. This research contributes to the digital mapping of public opinion and recommends the development of automatic labeling methods as well as the exploration of advanced algorithms in the future.

Veri Arinal; Satria Wira Yudha; Muhammad Joko Umbaran Kharis Bahrudin; Dessyanti Ryantina

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

QRIS (Quick Response Code Indonesian Standard) has become a widely used national digital payment standard. User satisfaction with this service needs to be monitored continuously to ensure its sustainability. This study aims to predict the level of QRIS user satisfaction based on their experiences and perceptions expressed organically on the Twitter social media platform. The method used is sentiment analysis with the Naive Bayes classification algorithm implemented using RapidMiner software. The research data was obtained from Twitter user comments collected through web scraping techniques. The text data then went through a preprocessing stage that included cleansing, stopword filtering, stemming, and tokenizing to be prepared as features ready to be processed by the model. The data was divided into training (80%) and testing (20%) subsets for model training and validation. The results showed that the Naive Bayes model was able to predict user satisfaction sentiment with an accuracy of 80.99%. These findings indicate that the model is highly accurate in identifying satisfied comments and sufficiently sensitive in detecting dissatisfaction. This study concludes that sentiment analysis of Twitter UGC data using Naive Bayes is an effective and efficient approach for predicting QRIS user satisfaction in real time. The practical implication of this study is to provide an automatic feedback system for service providers to monitor public sentiment and take targeted corrective actions.

Mesra Betty Yel; Sopan Adrianto; Rasiban Rasiban; Eva Widiyanti

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

The growth of information technology has driven changes in consumer behavior, one of which is through e-commerce platforms such as Shopee. This phenomenon has generated a large number of customer reviews, including those for local cosmetic products such as Wardah. These reviews serve as an important source of information for understanding customer perceptions and satisfaction levels. However, manual analysis of large and linguistically diverse datasets is inefficient and potentially subjective. This study aims to implement the multi-category Naive Bayes algorithm to classify the sentiment of Wardah product reviews on Shopee into three categories: positive, negative, and neutral. The data were collected using a web scraping technique and processed through a series of preprocessing stages including case folding, tokenization, stopword removal, stemming, and text cleaning. Subsequently, term weighting was performed using the TF-IDF method prior to classification. Model performance was evaluated using a confusion matrix as well as accuracy, precision, and recall metrics. The results indicate that the multi-category Naive Bayes algorithm achieved an accuracy of 86.00%, a precision of 86.63%, and a recall of 98.24%. This approach can assist business practitioners in objectively understanding customer opinions and support decision-making in business strategy and product development.

Untung Surapati; Veri Arinal; Tri Wahyudi; Ahmad Fauzan

International Journal of Applied Mathematics and Computing 2026 Asosiasi Riset Ilmu Matematika dan Sains Indonesia

The rise of social media has created a digital public sphere that enables users to express their opinions on social and political issues openly and in real-time. One of the most discussed topics on social media platform X is the trending hashtag #IndonesiaGelap, which reflects public concern and criticism regarding various governmental and societal conditions. This study aims to conduct sentiment analysis on tweets containing the hashtag to determine the overall sentiment trend among users. The method employed in this research is the Naive Bayes classification algorithm, known for its simplicity and effectiveness in text classification. To enhance the model’s performance, Particle Swarm Optimization (PSO) is applied to optimize feature selection and parameter tuning. The dataset consists of public tweets collected via the Twitter API, followed by preprocessing, feature extraction using TF-IDF, and sentiment classification into three categories: positive, negative, and neutral. The results indicate that the integration of PSO significantly improves the classification accuracy of the Naive Bayes model compared to the baseline. The majority of tweets related to #IndonesiaGelap exhibit a negative sentiment, indicating widespread public dissatisfaction and criticism. This research is expected to contribute to a better understanding of public perception and serve as valuable input for stakeholders in addressing social issues in the digital age.

Aura Rahayu Aksa Radiana; Fathoni Mahardika; Dani Indra Junaedi

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2026 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

This study aims to develop a sentiment classification method for YouTube user comments related to the game Love and Deepspace using the Naïve Bayes algorithm, focusing on improving the text data processing and understanding user perceptions. Comment data were collected through scraping from YouTube videos, followed by preprocessing including text cleaning, normalization, stopword removal, stemming, and translation into English. Initial labeling was conducted using TextBlob, then the data were randomly sampled for training the Naïve Bayes model. Evaluation involved comparing sentiment distributions and visualization using Word Cloud and bar charts. The Naïve Bayes model achieved an accuracy of 77.36% in sentiment classification. The sentiment distribution shows differences between TextBlob (positive: 1,011, neutral: 1,312, negative: 575) and Naïve Bayes (positive: 901, neutral: 1,627, negative: 370), with Naïve Bayes being more conservative. The Word Cloud visualization identifies dominant words such as "bang," "game," and "main," while the bar chart shows the largest proportion of neutral sentiment. Naïve Bayes is effective for sentiment classification on informal comment data, with significant differences from rule-based methods like TextBlob. This research contributes to the development of text data processing techniques and user perception analysis, as well as opening up optimization opportunities with other algorithms like SVM for better accuracy.

Diajeng Febriana; Suci Suci; Darmawati Darmawati

Jurnal Penelitian Komunikasi dan Sosialisasi 2026 Asosiasi Peneliti dan Pengajar Ilmu Sosial Indonesia

This research critically investigates the circulation of disinformation concerning the instability of fuel prices on the digital platform X and its subsequent implications for the polarization of modern society. In an era where unverified economic news frequently dictates public reaction, fake news often acts as a potent catalyst for mass anxiety. By implementing a quantitative framework driven by lexicon-based computational sentiment analysis, this study effectively processed a dataset of 500 public opinion samples extracted via Google Colab spanning from April 2024 to April 2026. To ensure computational accuracy and eliminate textual noise, the data underwent a rigorous preprocessing phase encompassing case folding, alongside the systematic removal of URLs, account mentions, numbers, hashtags, and punctuation marks. The statistical outcomes revealed a highly disproportionate emotional landscape, overwhelmingly dominated by 451 negative reviews. In stark contrast, neutral observations and positive affirmations were nearly absent, recording only 40 and 9 instances, respectively. The data compellingly illustrates that the relentless influx of pessimistic narratives regarding economic instability directly induces financial panic, undermines rational discourse, and severely fragments cyberspace into deeply polarized factions.

Ayu Astuti Siregar; Al-Khowarizmi

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2026 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Social media has evolved into a significant platform where consumers freely express their opinions, experiences, and levels of satisfaction regarding various products, including those offered by Micro, Small, and Medium Enterprises (MSMEs). The comments and reviews shared by customers on these platforms contain diverse sentiments that can serve as valuable indicators of how consumers perceive product quality. Understanding these sentiments is crucial for MSME owners, as it allows them to evaluate their products and adapt to market expectations more effectively. This study aims to analyze customer sentiment toward MSME products on social media by utilizing the Naïve Bayes algorithm, a widely used classification method in text mining. The data used in this research consist of customer comments collected from various social media platforms. The research process involves several stages, including data collection, manual labeling of sentiments, text preprocessing (such as tokenization, case folding, and stopword removal), and splitting the dataset into training and testing subsets. Subsequently, the classification process is carried out using the Naïve Bayes algorithm to categorize sentiments into positive, negative, and neutral classes. The results of this study demonstrate that the Naïve Bayes method is effective in classifying customer sentiments with a satisfactory level of accuracy. These findings provide a comprehensive overview of consumer perceptions regarding the quality of MSME products. Furthermore, this research is expected to assist MSME business owners in understanding customer feedback more systematically and using it as a basis for improving product quality and enhancing customer satisfaction in a competitive digital marketplace.

Eko Susanto; Sharipuddin Sharipuddin; Benni Purnama

Prosiding Seminar Nasional Ilmu Teknik 2026 Asosiasi Riset Ilmu Teknik Indonesia

The rapid growth of e-commerce in Indonesia, particularly the Shopee platform, has generated a large volume of user reviews on the Google Play Store, which can be analyzed to understand consumer sentiment. This study aims to compare the performance of the Support Vector Machine (SVM) and Random Forest (RF) algorithms in binary sentiment classification (positive and negative) on Shopee reviews, as well as to statistically test the significance of their differences using One-Way ANOVA. A total of 400,498 reviews were collected via web scraping, preprocessed through text normalization, tokenization, and Indonesian language stemming, and then feature-extracted using TF-IDF and Count Vectorizer. Evaluation results show that SVM achieved an accuracy of 91.77%, precision of 91.49%, recall of 91.77%, and F1-Score of 91.56%, while RF achieved an accuracy of 90.07%, precision of 91.68%, recall of 90.07%, and F1-Score of 90.55%. ANOVA confirmed that the performance difference between the two algorithms is statistically significant (p-value = 0.0007) with a large effect size (η² = 0.1815). Therefore, SVM is recommended as a more optimal and consistent algorithm for automated sentiment analysis of Indonesian e-commerce reviews, while also providing a replicable methodological framework for similar future research.

Dihin Muriyatmoko; Aziz Musthafa; Yusuf Al Banna

Prosiding Seminar Nasional Ilmu Teknik 2026 Asosiasi Riset Ilmu Teknik Indonesia

Sentiment analysis on social media is widely used to represent public perceptions of sports performance, particularly in international competitions. This study aims to analyze the sentiment of YouTube user comments regarding the performance of the Indonesian National Football Team during the FIFA World Cup 2026 Asian Qualifiers. The data were collected from user comments on videos related to the matches and analyzed using a machine learning–based sentiment analysis approach. Sentiment classification was performed using the Naive Bayes algorithm. The results indicate that the proposed approach is able to effectively identify public sentiment toward the national team’s performance during the qualification matches. The findings of this study are expected to provide insights into public perceptions and contribute to sentiment analysis research in the field of sports.

Marjelin Putri Ndaparoka; Stefanus D.I. Mau; Sihang Gregorius Bali Mema

Modem : Jurnal Informatika dan Sains Teknologi 2026 Asosiasi Profesi Telekomunikasi Dan Informatika Indonesia

Savings and Loan Cooperatives (KSP) play a vital role in expanding community access to capital, especially within the informal sector. Nevertheless, non-performing loans remain a persistent challenge that can threaten liquidity and long-term institutional sustainability. KSP CU Mera Ndi Ate faces similar issues, which are assumed to stem not only from administrative weaknesses but also from members’ perceptions and behavioral factors. This research aims to examine the potential causes of non-performing loans through text-based sentiment analysis using an unsupervised learning approach. A quantitative method with a data mining framework was applied. Data were gathered through interviews, observations, documentation, and 200 customer opinion texts processed using the Orange Data Mining application. The analytical stages included preprocessing, corpus development, feature extraction, sentiment clustering, and visualization. Because the dataset lacked predefined labels, unsupervised learning was used to identify naturally emerging sentiment patterns. Findings reveal a predominance of critical sentiments related to credit assessment procedures and service quality. The highest sentiment score (75) concerned insufficient creditworthiness evaluation, followed by concerns about service efficiency (66.6667). These insights suggest that improving assessment accuracy and service quality may help reduce non-performing loans.

Muhimatul Ifadah; Muhimatul Ifadah; Bambang Irawan

Jurnal Elektronika dan Komputer 2026 STEKOM PRESS

User reviews on the Shopee e-commerce platform represent an important source of information for understanding consumer perceptions of products and services. Sentiment analysis is commonly applied to classify user opinions into positive, neutral, and negative sentiment categories based on textual data. This study aims to analyze the performance of the Long Short-Term Memory (LSTM) method in sentiment classification of Shopee user reviews. The dataset used in this study consists of Indonesian-language user reviews that have undergone preprocessing stages, including case folding, text cleaning, tokenization, and stopword removal. The LSTM model was trained using preprocessed text represented as word sequences. Model performance was evaluated using overall accuracy and class-wise classification results. The experimental results indicate that the LSTM method achieved an overall accuracy of 87.62%. In addition, the classification performance for the positive sentiment class reached 95.27%, the neutral class achieved 4.96%, and the negative class reached 74.26%. These results demonstrate that the LSTM method performs well in classifying sentiment in Shopee user reviews, particularly for positive sentiment. This study is expected to provide insights and references for the application of deep learning methods in sentiment analysis of Indonesian e-commerce review data.

Mahruzar, Mahruzar; Setiawan Assegaff; Jasmir Jasmir; Yosefina Venus

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The increasing volume of online hotel reviews provides valuable insights into customer perceptions but poses challenges for manual analysis due to its unstructured nature. This study aims to compare the performance of Recurrent Neural Network (RNN) and Bidirectional Encoder Representations from Transformers (BERT) in hotel review sentiment analysis. A total of 20,491 TripAdvisor hotel reviews were classified into three sentiment categories: negative, neutral, and positive. The research methodology includes text preprocessing, stratified data splitting, class imbalance handling using Random Over-Sampling, tokenization, and supervised model training. Model performance was evaluated using a confusion matrix and classification metrics. The results indicate that BERT outperforms RNN, achieving an accuracy of 80.54%, while RNN reached 62.21%. BERT demonstrated superior capability in capturing contextual and semantic information in hotel reviews. These findings suggest that transformer-based models are more effective for sentiment analysis of complex textual data in the hospitality domain and can support data-driven service improvement strategies.    

Tasya Nurdin; Dodo Zaenal Abidin; Kurniabudi Kurniabudi

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

This study conducts sentiment analysis of Indonesian user reviews of the CapCut application using IndoBERT and compares two evaluation schemes: a single 80/20 train–test split and stratified 5-fold cross-validation (k=5). A total of 1,048,575 reviews were collected from the Google Play Store through web scraping and labeled into three sentiment classes based on rating: negative (1–2), neutral (3), and positive (4–5). After preprocessing—cleaning, case folding, banned-word removal, normalization—and duplicate removal, 517,962 reviews were retained. IndoBERT Base P1 was fine-tuned using fixed hyperparameters (batch size 32, learning rate 2e-5, up to 4 epochs, early stopping patience 2), while undersampling was applied to the training set to address class imbalance. Performance was assessed using accuracy, precision, recall, F1-score, and ROC-AUC, supported by confusion matrix and ROC-curve visualizations. The single split achieved an accuracy of 0.756, whereas cross-validation produced a mean accuracy of 0.740. Across both schemes, the positive class achieved the best performance (F1-score 0.850; ROC-AUC 0.918–0.919), while the neutral class remained the most challenging (precision 0.198–0.206; F1-score 0.280–0.283). Overall, cross-validation is recommended for reporting because it reduces dependence on a single partition and provides a more representative estimate across multiple splits.

Fransiskus Dapot Sihaloho; Jasmir Jasmir; Gunardi Gunardi

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The rapid growth of e-commerce platforms in Indonesia, particularly Tokopedia, has resulted in a large volume of consumer reviews containing valuable information regarding customer perceptions and satisfaction. However, manual analysis of such reviews is inefficient and prone to subjectivity, necessitating an automated approach based on machine learning. This study aims to classify the sentiment of sports product reviews on Tokopedia into positive, negative, and neutral categories by applying Logistic Regression, Support Vector Machine (SVM), and Random Forest using the Term Frequency–Inverse Document Frequency (TF-IDF) approach. The data were collected through web scraping of Indonesian-language sports product reviews and processed through several preprocessing stages, including data cleaning, case folding, tokenization, stopword removal, and stemming. Feature representation was performed using TF-IDF to transform textual data into numerical vectors, after which the dataset was divided into training and testing sets with an 80:20 ratio. Model performance was evaluated using accuracy, precision, recall, and F1-score metrics. The results indicate that the application of TF-IDF significantly improves the performance of all models, with SVM consistently achieving the most optimal performance compared to Logistic Regression and Random Forest. These findings demonstrate that classical machine learning algorithms combined with TF-IDF remain highly effective for sentiment analysis of Indonesian-language text. The implications of this study are expected to assist sellers in understanding customer opinions, support consumers in making informed purchasing decisions, and serve as a foundation for the development of sentiment analysis and recommendation systems on e-commerce platforms.

Ary Ardiansyah; Pareza Alam Jusia; Rudolf Sinaga; Clarisa Putri Valentina; Pardede, Nadia

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The Ministry of Social Affairs has made a new breakthrough in facilitating the public in checking social assistance recipients, namely the social assistance check application. User reviews can be used to find out whether the application provides benefits to the community or not. However, these reviews need to be processed using sentiment analysis. Then to do sentiment analysis requires machine learning. One method that includes machine learning is Naïve Bayes. The purpose of this research is to implement the Naïve Bayes method in conducting sentiment analysis and find out whether the social assistance check application is beneficial to society based on the results of sentiment analysis. In this study, two categories of sentiment are used, namely positive and negative. The author collects by crawling using the Google Play Scrapper library. The results of crawling data obtained as many as 4000 data. The results showed that the actual data that had been labeled using Textblob resulted in 987 negative label reviews and 628 positive label reviews. Meanwhile, the Naïve Bayes method is able to analyze the review sentiment of the social assistance check application with the results of 1181 negative sentiments and 434 positive sentiments. The Naïve Bayes model has a good accuracy rate of 0.77 or 77% in analyzing sentiment for social assistance check application reviews.

Srikandi Alifya; Jasmir Jasmir; Elvi yanti

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The growth of e-commerce in Indonesia has led to an increase in product reviews, including for beauty products on Tokopedia and Shopee. These reviews serve as important sources of information to assess consumer satisfaction; however, manually analyzing thousands of reviews daily is impractical. This study applies Natural Language Processing (NLP) with Naive Bayes, C4.5, XGBoost algorithms to classify sentiment in Indonesian-language reviews. The dataset used consists of 76,256 reviews labeled as positive, negative, and neutral. The research stages include text preprocessing, feature representation using BoW and TF-IDF, data balancing through SMOTE, and model performance evaluation based on accuracy, precision, and recall. Differences in results among the algorithms were analyzed using ANOVA. The results show that Naive Bayes achieved the highest accuracy at 67.71%, followed by XGBoost at 65.91%, and C4.5 at 58.39%, with Naive Bayes performing best in identifying positive and negative sentiments, while XGBoost and C4.5 handled more complex data patterns effectively. These findings provide guidance for sentiment analysis in Indonesian and support businesses in obtaining automated insights from customer reviews to improve product quality and services.

Nanda Mediya Sari; Jasmir Jasmir; Elvi Yanti

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Sentiment analysis is a technique in Natural Language Processing (NLP) used to identify user opinion tendencies based on textual reviews. This study analyzer user reviews of the Maxim application on the Google Play Store and compares three Machine Learning algoritmhs-Naïve Bayes, Support Vector Machine (SVM), and CatBoost-in classifying sentiment. The research stages include data collection, text preprocessing, feature extraction using TF-IDF and Chi-Square, class balancing using SMOTE, and performance evaluation through Accuracy, Precision, Recall, and F1-Score. ANOVA is used to examine the influence of feature selection on model performance. The results show that each model exhibits different performance level across the tested feature combinations. The CatBoost achieved the highest accuracy of 99,26% and demonstrating the most stable performance. Meanwhile, the Naïve Bayes and SVM models experienced performance decreases experiments, especially after applying SMOTE. These findings indicate that the choise of algorithm, feature extraction method, and class balancing technique significantly affects classification outcomes. Overall, CatBoost is identified as the best-performing model, providing more consistenst classification result in accordance with the characteristics of the user reviews.

Noronha, Marcelino Caetano; Dwiasnati, Saruni; Helena P Panjaitan, Cherlina

Journal of Information Technology and Computer Science 2025 International Forum of Researchers and Lecturers

Abstract: The rapid diffusion of Generative Artificial Intelligence (AI) has intensified public debate regarding its benefits, risks, and societal implications. This study investigates public sentiment and thematic structures surrounding Generative AI by analyzing Twitter discourse as a representation of large-scale, real-time public perception. The research addresses two main problems: how public sentiment toward Generative AI is distributed and what dominant themes shape this perception. Accordingly, the objective is to map both emotional polarity and thematic narratives embedded in social media conversations. A computational mixed-methods approach was employed using a dataset of 12,470 tweets collected on 17 December 2024. Sentiment classification was conducted using a transformer-based DistilBERT model, while semantic representations were generated with Sentence-BERT. Topic modeling was performed using BERTopic, integrating HDBSCAN clustering and class-based TF-IDF to extract coherent and interpretable topics. Human-in-the-loop validation supported the interpretive robustness of topic labeling. The findings reveal that public sentiment toward Generative AI is predominantly positive (41.8%), particularly in relation to productivity enhancement, education, and creative applications. Neutral sentiment (31.4%) reflects informational discourse, while negative sentiment (26.8%) centers on ethical concerns, privacy risks, misinformation, and AI hallucinations. Seven dominant topics were identified, with clear topic–sentiment alignment showing optimism in utility-driven themes and skepticism in ethics- and risk-related discussions. In conclusion, public perception of Generative AI is dualistic—characterized by strong enthusiasm alongside persistent caution. These results provide empirical insights for AI governance, responsible innovation, and future research on socio-technical impacts of Generative AI. *