Integrasi Principal Component Analysis dan Logistic Regression untuk Analisis Sentimen Kepuasan Pelanggan Berdasarkan Ulasan Online
DOI:
https://doi.org/10.47065/bulletincsr.v6i4.1029Keywords:
Sentiment Analysis; Logistic Regression; Principal Component Analysis; d’Besto EBM; Google ReviewsAbstract
Customer reviews on digital platforms are an important source of information for evaluating service quality and customer satisfaction levels. However, the unstructured nature of review data and its high feature dimensionality pose challenges in the sentiment analysis process. This study aims to develop a customer sentiment analysis model by integrating Principal Component Analysis (PCA) and Logistic Regression. The data used are 679 Indonesian-language reviews obtained through web scraping techniques from Google Reviews at ten d'Besto EBM branches in Padang City. The research stages include text preprocessing, TF-IDF weighting, dimensionality reduction using PCA, and sentiment classification using Logistic Regression. The results show that PCA is able to reduce data complexity by producing two principal components that explain 85.7% of the total data variance. The Logistic Regression model built on the features resulting from PCA reduction achieved an accuracy of 82%, demonstrating the model's ability to effectively classify positive and negative sentiments. In addition to improving computational efficiency, the use of PCA also helps reduce feature redundancy in high-dimensional text data. The contribution of this research is to produce a simpler and more efficient sentiment analysis approach to process customer reviews and provide data-based information that can be used to support service quality evaluation and decision-making in the culinary industry.
Downloads
References
A. Pemasaran, “Affiliate Marketing and Smartphone Usage on Student Productivity,” Journal of Digital Marketing Research, vol. 5, no. 2, pp. 1–9, 2025.
K. L. Keller, Marketing Management: Strategic Brand Analysis, Pearson Education, 2022.
B. Harto, A. Y. Rukmana, R. Tahir, and E. Waty, “Digital Business Transformation in the Era of Information Technology,” Journal of Business Innovation, vol. 8, no. 1, pp. 45–56, 2023.
N. A. Rahma, A. Anggarani, and W. Prasetyoning, “The Role of Online Customer Review, Rating, and Brand Image on Purchase Decision,” Journal of Marketing Management, vol. 12, no. 4, pp. 210–218, 2024, doi: 10.47191/jefms/v6-i6-17.
J. Nugraha, “Branding Strategy in the Digital Era,” Journal of Business Communication, vol. 7, no. 1, pp. 33–41, 2024, doi: 10.4018/978-1-6684-8351-0.ch008
A. S. Sitio and F. A. Sianturi, “Machine Learning Application in Internet Usage Behavior Analysis,” Journal of Information Systems, vol. 2, no. 1, pp. 46–51, 2024.
APJII, “Indonesia Internet Usage Report 2023,” Association of Indonesian Internet Service Providers, 2023. https://apjii.or.id
S. Mishra et al., “Principal Component Analysis,” International Journal of Learning and Research, vol. 10, no. 1, pp. 1–10, 2022. https://doi.org/10.5455/ijlr.20170415115235
D. N. Ardelia, H. D. Arifin, S. Daniswara, and A. P. Sari, “Mobile Phone Price Classification Using Logistic Regression Algorithm,” Journal of Computer Science and Informatics, vol. 4, no. 1, pp. 37–43, 2024, doi: 10.1109/ISMSIT52890.2021.9604550.
I. Rahmawati and T. R. Fitriani, “Sentiment Analysis Using Logistic Regression on Lion Air Airline Reviews,” Journal of Information Technology, vol. 1, no. 1, pp. 11–16, 2023, doi: 10.58776/jriti.v1i1.60
S. H. Pratama, “Big Data Analytics and Predictive Modeling in Industry 4.0,” Journal of Data Science and Analytics, vol. 6, no. 2, pp. 55–63, 2023.
P. Yayasan and K. Menulis, “Application of Artificial Intelligence in Data Science,” Journal of AI Research, vol. 3, no. 1, pp. 22–30, 2025.
S. D. Anggita and F. F. Abdulloh, “Sentiment Analysis Optimization Using PSO and Information Gain Feature Selection with SVM,” Journal of Machine Learning Applications, vol. 4, no. 1, pp. 52–57, 2023, doi: 10.52158/jacost.v4i1.524
Z. Shen, “Feature Selection Using Fuzzy Methods and Meta Learning,” Journal of Computational Intelligence, vol. 9, no. 2, pp. 88–97, 2022, doi: 10.1109/FUZZ48607.2020.9177769.
A. K. Jean, M. Diarra, B. A. Bakary, G. Pierre, and A. K. Jérôme, “Hybrid CNN-SVM and PCA-SVM Approaches for Cocoa Beans Classification,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 9, pp. 100–110, 2022. https://doi.org/10.14569/IJACSA.2022.0130927
H. Rachmi, “PCA and Genetic Algorithm for Sentiment Analysis Using SVM,” Journal of Data Mining and Knowledge Discovery, vol. 5, no. 2, pp. 97–104, 2022.
K. Srijiranon, “Hybrid PCA-EMD-LSTM Framework for Stock Market Prediction with Sentiment Analysis,” Applied Sciences, vol. 12, no. 1, pp. 1–15, 2022, doi: 10.3390/app122110823.
Z. Song, X. Gong, C. Zhang, and C. Yu, “Investor Sentiment Based on Scaled PCA Method for Stock Market Volatility Prediction,” International Review of Economics & Finance, vol. 83, pp. 528–545, 2023. https://doi.org/10.1016/j.iref.2022.10.007
E. O. Omuya, G. Okeyo, and M. Kimwele, “Sentiment Analysis on Social Media Tweets Using Dimensionality Reduction and NLP Techniques,” Engineering Reports, vol. 14, no. 2, pp. 1–14, 2022. https://doi.org/10.1002/eng2.12579
S. Srivastava, C. Chakraborty, and M. Kanti, “Leveraging Machine Learning and Dimensionality Reduction for Sentiment Analysis,” Measurement: Sensors, vol. 33, pp. 101182, 2024. https://doi.org/10.1016/j.measen.2024.101182
Y. Zhang and L. Wang, “Text Classification Using Logistic Regression and TF-IDF Feature Selection,” Journal of Big Data Research, vol. 11, no. 3, pp. 101–112, 2024, doi: 10.47467/visa.v4i3.2835.
R. Kumar, “Dimensionality Reduction Techniques in Text Mining: A Review,” Journal of Artificial Intelligence Review, vol. 15, no. 2, pp. 55–70, 2023, doi: 10.4018/978-1-5225-0489-4.ch003.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Integrasi Principal Component Analysis dan Logistic Regression untuk Analisis Sentimen Kepuasan Pelanggan Berdasarkan Ulasan Online
ARTICLE HISTORY
How to Cite
Issue
Section
Copyright (c) 2026 Tsalsabila Jilhan Haura, Rini Sovia, Gunadi Widi Nurcahyo

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).













