Integrasi Principal Component Analysis dan Logistic Regression untuk Analisis Sentimen Kepuasan Pelanggan Berdasarkan Ulasan Online


Authors

  • Tsalsabila Jilhan Haura Universitas Putra Indonesia YPTK Padang, Padang, Indonesia
  • Rini Sovia Universitas Putra Indonesia YPTK Padang, Padang, Indonesia
  • Gunadi Widi Nurcahyo Universitas Putra Indonesia YPTK Padang, Padang, Indonesia

DOI:

https://doi.org/10.47065/bulletincsr.v6i4.1029

Keywords:

Sentiment Analysis; Logistic Regression; Principal Component Analysis; d’Besto EBM; Google Reviews

Abstract

Customer reviews on digital platforms are an important source of information for evaluating service quality and customer satisfaction levels. However, the unstructured nature of review data and its high feature dimensionality pose challenges in the sentiment analysis process. This study aims to develop a customer sentiment analysis model by integrating Principal Component Analysis (PCA) and Logistic Regression. The data used are 679 Indonesian-language reviews obtained through web scraping techniques from Google Reviews at ten d'Besto EBM branches in Padang City. The research stages include text preprocessing, TF-IDF weighting, dimensionality reduction using PCA, and sentiment classification using Logistic Regression. The results show that PCA is able to reduce data complexity by producing two principal components that explain 85.7% of the total data variance. The Logistic Regression model built on the features resulting from PCA reduction achieved an accuracy of 82%, demonstrating the model's ability to effectively classify positive and negative sentiments. In addition to improving computational efficiency, the use of PCA also helps reduce feature redundancy in high-dimensional text data. The contribution of this research is to produce a simpler and more efficient sentiment analysis approach to process customer reviews and provide data-based information that can be used to support service quality evaluation and decision-making in the culinary industry.

Downloads

Download data is not yet available.

References

A. Pemasaran, “Affiliate Marketing and Smartphone Usage on Student Productivity,” Journal of Digital Marketing Research, vol. 5, no. 2, pp. 1–9, 2025.

K. L. Keller, Marketing Management: Strategic Brand Analysis, Pearson Education, 2022.

B. Harto, A. Y. Rukmana, R. Tahir, and E. Waty, “Digital Business Transformation in the Era of Information Technology,” Journal of Business Innovation, vol. 8, no. 1, pp. 45–56, 2023.

N. A. Rahma, A. Anggarani, and W. Prasetyoning, “The Role of Online Customer Review, Rating, and Brand Image on Purchase Decision,” Journal of Marketing Management, vol. 12, no. 4, pp. 210–218, 2024, doi: 10.47191/jefms/v6-i6-17.

J. Nugraha, “Branding Strategy in the Digital Era,” Journal of Business Communication, vol. 7, no. 1, pp. 33–41, 2024, doi: 10.4018/978-1-6684-8351-0.ch008

A. S. Sitio and F. A. Sianturi, “Machine Learning Application in Internet Usage Behavior Analysis,” Journal of Information Systems, vol. 2, no. 1, pp. 46–51, 2024.

APJII, “Indonesia Internet Usage Report 2023,” Association of Indonesian Internet Service Providers, 2023. https://apjii.or.id

S. Mishra et al., “Principal Component Analysis,” International Journal of Learning and Research, vol. 10, no. 1, pp. 1–10, 2022. https://doi.org/10.5455/ijlr.20170415115235

D. N. Ardelia, H. D. Arifin, S. Daniswara, and A. P. Sari, “Mobile Phone Price Classification Using Logistic Regression Algorithm,” Journal of Computer Science and Informatics, vol. 4, no. 1, pp. 37–43, 2024, doi: 10.1109/ISMSIT52890.2021.9604550.

I. Rahmawati and T. R. Fitriani, “Sentiment Analysis Using Logistic Regression on Lion Air Airline Reviews,” Journal of Information Technology, vol. 1, no. 1, pp. 11–16, 2023, doi: 10.58776/jriti.v1i1.60

S. H. Pratama, “Big Data Analytics and Predictive Modeling in Industry 4.0,” Journal of Data Science and Analytics, vol. 6, no. 2, pp. 55–63, 2023.

P. Yayasan and K. Menulis, “Application of Artificial Intelligence in Data Science,” Journal of AI Research, vol. 3, no. 1, pp. 22–30, 2025.

S. D. Anggita and F. F. Abdulloh, “Sentiment Analysis Optimization Using PSO and Information Gain Feature Selection with SVM,” Journal of Machine Learning Applications, vol. 4, no. 1, pp. 52–57, 2023, doi: 10.52158/jacost.v4i1.524

Z. Shen, “Feature Selection Using Fuzzy Methods and Meta Learning,” Journal of Computational Intelligence, vol. 9, no. 2, pp. 88–97, 2022, doi: 10.1109/FUZZ48607.2020.9177769.

A. K. Jean, M. Diarra, B. A. Bakary, G. Pierre, and A. K. Jérôme, “Hybrid CNN-SVM and PCA-SVM Approaches for Cocoa Beans Classification,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 9, pp. 100–110, 2022. https://doi.org/10.14569/IJACSA.2022.0130927

H. Rachmi, “PCA and Genetic Algorithm for Sentiment Analysis Using SVM,” Journal of Data Mining and Knowledge Discovery, vol. 5, no. 2, pp. 97–104, 2022.

K. Srijiranon, “Hybrid PCA-EMD-LSTM Framework for Stock Market Prediction with Sentiment Analysis,” Applied Sciences, vol. 12, no. 1, pp. 1–15, 2022, doi: 10.3390/app122110823.

Z. Song, X. Gong, C. Zhang, and C. Yu, “Investor Sentiment Based on Scaled PCA Method for Stock Market Volatility Prediction,” International Review of Economics & Finance, vol. 83, pp. 528–545, 2023. https://doi.org/10.1016/j.iref.2022.10.007

E. O. Omuya, G. Okeyo, and M. Kimwele, “Sentiment Analysis on Social Media Tweets Using Dimensionality Reduction and NLP Techniques,” Engineering Reports, vol. 14, no. 2, pp. 1–14, 2022. https://doi.org/10.1002/eng2.12579

S. Srivastava, C. Chakraborty, and M. Kanti, “Leveraging Machine Learning and Dimensionality Reduction for Sentiment Analysis,” Measurement: Sensors, vol. 33, pp. 101182, 2024. https://doi.org/10.1016/j.measen.2024.101182

Y. Zhang and L. Wang, “Text Classification Using Logistic Regression and TF-IDF Feature Selection,” Journal of Big Data Research, vol. 11, no. 3, pp. 101–112, 2024, doi: 10.47467/visa.v4i3.2835.

R. Kumar, “Dimensionality Reduction Techniques in Text Mining: A Review,” Journal of Artificial Intelligence Review, vol. 15, no. 2, pp. 55–70, 2023, doi: 10.4018/978-1-5225-0489-4.ch003.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Integrasi Principal Component Analysis dan Logistic Regression untuk Analisis Sentimen Kepuasan Pelanggan Berdasarkan Ulasan Online

Dimensions Badge

ARTICLE HISTORY

Published: 2026-06-21

Abstract View: 9 times
PDF Download: 2 times

How to Cite

Haura, T. J., Sovia, R., & Nurcahyo, G. W. (2026). Integrasi Principal Component Analysis dan Logistic Regression untuk Analisis Sentimen Kepuasan Pelanggan Berdasarkan Ulasan Online. Bulletin of Computer Science Research, 6(4), 1381-1387. https://doi.org/10.47065/bulletincsr.v6i4.1029

Issue

Section

Articles

Most read articles by the same author(s)