Klasifikasi Sentimen Ulasan Produk Olahraga di Tokopedia Menggunakan Metode Machine Learning dengan Pendekatan TF-IDF
DOI:
https://doi.org/10.61132/prosemnasproit.v2i2.130Keywords:
Sentiment Analysis, Machine Learning, Sports Products, TF-IDF, TokopediaAbstract
The rapid growth of e-commerce platforms in Indonesia, particularly Tokopedia, has resulted in a large volume of consumer reviews containing valuable information regarding customer perceptions and satisfaction. However, manual analysis of such reviews is inefficient and prone to subjectivity, necessitating an automated approach based on machine learning. This study aims to classify the sentiment of sports product reviews on Tokopedia into positive, negative, and neutral categories by applying Logistic Regression, Support Vector Machine (SVM), and Random Forest using the Term Frequency–Inverse Document Frequency (TF-IDF) approach. The data were collected through web scraping of Indonesian-language sports product reviews and processed through several preprocessing stages, including data cleaning, case folding, tokenization, stopword removal, and stemming. Feature representation was performed using TF-IDF to transform textual data into numerical vectors, after which the dataset was divided into training and testing sets with an 80:20 ratio. Model performance was evaluated using accuracy, precision, recall, and F1-score metrics. The results indicate that the application of TF-IDF significantly improves the performance of all models, with SVM consistently achieving the most optimal performance compared to Logistic Regression and Random Forest. These findings demonstrate that classical machine learning algorithms combined with TF-IDF remain highly effective for sentiment analysis of Indonesian-language text. The implications of this study are expected to assist sellers in understanding customer opinions, support consumers in making informed purchasing decisions, and serve as a foundation for the development of sentiment analysis and recommendation systems on e-commerce platforms.
References
Ariatmanto, D., & Rifai, A. M. (2024). The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection. Jurnal RESTI, 8(6), 730–736. https://doi.org/10.29207/resti.v8i6.6017
Ashbaugh, L., & Zhang, Y. (2024). A Comparative Study of Sentiment Analysis on Customer Reviews Using Machine Learning and Deep Learning. Computers, 13(12). https://doi.org/10.3390/computers13120340
Azka, F., #1, W., Romadhony, A., & #3, H. (n.d.). Sentiment Analysis of University Social Media Using Support Vector Machine and Logistic Regression Methods. https://doi.org/10.34818/indojc.2022.7.2.638
Basri, H., Junianto, M. B. S., & Kusyadi, I. (2024). Enhancing Usability Testing Through Sentiment Analysis: A Comparative Study Using SVM, Naive Bayes, Decision Trees and Random Forest. Jurnal Teknologi Sistem Informasi Dan Aplikasi, 7(4), 1603–1610. https://doi.org/10.32493/jtsi.v7i4.45117
Dakwah, M. M., Firdaus, A. A., Furizal, F., & Faresta, R. (2024). Sentiment Analysis on Marketplace in Indonesia using Support Vector Machine and Naïve Bayes Method. Jurnal Ilmiah Teknik Elektro Komputer Dan Informatika, 10(1), 39. https://doi.org/10.26555/jiteki.v10i1.28070
Fadhillah, O. S. D., Jaman, J. H., & Carudin, C. (2025). PERBANDINGAN NAIVE BAYES, SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION DAN RANDOM FOREST DALAM MENGANALISIS SENTIMEN MENGENAI TIKTOKSHOP. Jurnal Informatika Dan Teknik Elektro Terapan, 13(1). https://doi.org/10.23960/jitet.v13i1.5746
Fiddin, F., & Hidayat, T. (n.d.). Jurnal SAINTIKOM (Jurnal Sains Manajemen Informatika dan Komputer) Analisis Sentimen Ulasan Produk Sayur di Tokopedia Menggunakan Model Support Vector Machine Dengan Representasi TF-IDF. https://ojs.trigunadharma.ac.id/index.php/jis/index
Hadiwijaya, M. A., Pirdaus, F. P., Andrews, D., Achmad, S., & Sutoyo, R. (2023). Sentiment Analysis on Tokopedia Product Reviews using Natural Language Processing. 2023 International Conference on Informatics, Multimedia, Cyber and Information Systems, ICIMCIS 2023, 380–386. https://doi.org/10.1109/ICIMCIS60089.2023.10348996
Komparasi Sentiment Analysis Pada Review Aplikasi Tokopedia Dan Shopee Menggunakan Algoritma Naïve Bayes Dan Support Vector Machine Dengan Metode Tf-Idf. (n.d.).
Lin, C. H., & Nuha, U. (2023). Sentiment analysis of Indonesian datasets based on a hybrid deep-learning strategy. Journal of Big Data, 10(1). https://doi.org/10.1186/s40537-023-00782-9
Malik, N., & Bilal, M. (2024). Natural language processing for analyzing online customer reviews: a survey, taxonomy, and open research challenges. PeerJ Computer Science, 10. https://doi.org/10.7717/PEERJ-CS.2203
Muslim Karo Karo, I., Arifin Karo Karo, J., Djasmayena, S., & Wahyudi, R. (2025). Sentiment Analysis of Tourist Reviews at Waterfront City Pangururan Using Naive Bayes and TF-IDF Algorithm. Journal of Software Engineering, Information and Communication Technology (SEICT), 6(1), 13–20. https://doi.org/10.17509/seict.v6i1.86210
Rahmawati, L., & Santoso, D. B. (2023). IMPLEMENTASI METODE NAIVE BAYES UNTUK KLASIFIKASI ULASAN APLIKASI E-COMMERCE TOKOPEDIA IMPLEMENTATION OF NAIVE BAYES METHOD FOR CLASSIFICATION OF TOKOPEDIA E-COMMERCE APPLICATION REVIEW. Journal of Information Technology and Computer Science (INTECOMS), 6(1).
Stephenie, Warsito, B., & Prahutama, A. (2020). Sentiment Analysis on Tokopedia Product Online Reviews Using Random Forest Method. E3S Web of Conferences, 202. https://doi.org/10.1051/e3sconf/202020216006
Tri Rizkya, A., & Irham Gufroni, A. (n.d.). Implementation of the Naive Bayes Classifier for Sentiment Analysis of Shopee E-Commerce Application Review Data on the Google Play Store.
Umar, N., & Nur, M. A. (2022). Application of Naïve Bayes Algorithm Variations On Indonesian General Analysis Dataset for Sentiment Analysis. Jurnal RESTI, 6(4), 585–590. https://doi.org/10.29207/resti.v6i4.4179
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Prosiding Seminar Nasional Ilmu Teknik

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.





