Klasifikasi Sentimen Ulasan Produk Olahraga di Tokopedia Menggunakan Metode Machine Learning dengan Pendekatan TF-IDF

Authors

  • Fransiskus Dapot Sihaloho Universitas Dinamika Bangsa
  • Jasmir Jasmir Universitas Dinamika Bangsa
  • Gunardi Gunardi Universitas Dinamika Bangsa

DOI:

https://doi.org/10.61132/prosemnasproit.v2i2.130

Keywords:

Sentiment Analysis, Machine Learning, Sports Products, TF-IDF, Tokopedia

Abstract

The rapid growth of e-commerce platforms in Indonesia, particularly Tokopedia, has resulted in a large volume of consumer reviews containing valuable information regarding customer perceptions and satisfaction. However, manual analysis of such reviews is inefficient and prone to subjectivity, necessitating an automated approach based on machine learning. This study aims to classify the sentiment of sports product reviews on Tokopedia into positive, negative, and neutral categories by applying Logistic Regression, Support Vector Machine (SVM), and Random Forest using the Term Frequency–Inverse Document Frequency (TF-IDF) approach. The data were collected through web scraping of Indonesian-language sports product reviews and processed through several preprocessing stages, including data cleaning, case folding, tokenization, stopword removal, and stemming. Feature representation was performed using TF-IDF to transform textual data into numerical vectors, after which the dataset was divided into training and testing sets with an 80:20 ratio. Model performance was evaluated using accuracy, precision, recall, and F1-score metrics. The results indicate that the application of TF-IDF significantly improves the performance of all models, with SVM consistently achieving the most optimal performance compared to Logistic Regression and Random Forest. These findings demonstrate that classical machine learning algorithms combined with TF-IDF remain highly effective for sentiment analysis of Indonesian-language text. The implications of this study are expected to assist sellers in understanding customer opinions, support consumers in making informed purchasing decisions, and serve as a foundation for the development of sentiment analysis and recommendation systems on e-commerce platforms.

References

Ariatmanto, D., & Rifai, A. M. (2024). The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection. Jurnal RESTI, 8(6), 730–736. https://doi.org/10.29207/resti.v8i6.6017

Ashbaugh, L., & Zhang, Y. (2024). A Comparative Study of Sentiment Analysis on Customer Reviews Using Machine Learning and Deep Learning. Computers, 13(12). https://doi.org/10.3390/computers13120340

Azka, F., #1, W., Romadhony, A., & #3, H. (n.d.). Sentiment Analysis of University Social Media Using Support Vector Machine and Logistic Regression Methods. https://doi.org/10.34818/indojc.2022.7.2.638

Basri, H., Junianto, M. B. S., & Kusyadi, I. (2024). Enhancing Usability Testing Through Sentiment Analysis: A Comparative Study Using SVM, Naive Bayes, Decision Trees and Random Forest. Jurnal Teknologi Sistem Informasi Dan Aplikasi, 7(4), 1603–1610. https://doi.org/10.32493/jtsi.v7i4.45117

Dakwah, M. M., Firdaus, A. A., Furizal, F., & Faresta, R. (2024). Sentiment Analysis on Marketplace in Indonesia using Support Vector Machine and Naïve Bayes Method. Jurnal Ilmiah Teknik Elektro Komputer Dan Informatika, 10(1), 39. https://doi.org/10.26555/jiteki.v10i1.28070

Fadhillah, O. S. D., Jaman, J. H., & Carudin, C. (2025). PERBANDINGAN NAIVE BAYES, SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION DAN RANDOM FOREST DALAM MENGANALISIS SENTIMEN MENGENAI TIKTOKSHOP. Jurnal Informatika Dan Teknik Elektro Terapan, 13(1). https://doi.org/10.23960/jitet.v13i1.5746

Fiddin, F., & Hidayat, T. (n.d.). Jurnal SAINTIKOM (Jurnal Sains Manajemen Informatika dan Komputer) Analisis Sentimen Ulasan Produk Sayur di Tokopedia Menggunakan Model Support Vector Machine Dengan Representasi TF-IDF. https://ojs.trigunadharma.ac.id/index.php/jis/index

Hadiwijaya, M. A., Pirdaus, F. P., Andrews, D., Achmad, S., & Sutoyo, R. (2023). Sentiment Analysis on Tokopedia Product Reviews using Natural Language Processing. 2023 International Conference on Informatics, Multimedia, Cyber and Information Systems, ICIMCIS 2023, 380–386. https://doi.org/10.1109/ICIMCIS60089.2023.10348996

Komparasi Sentiment Analysis Pada Review Aplikasi Tokopedia Dan Shopee Menggunakan Algoritma Naïve Bayes Dan Support Vector Machine Dengan Metode Tf-Idf. (n.d.).

Lin, C. H., & Nuha, U. (2023). Sentiment analysis of Indonesian datasets based on a hybrid deep-learning strategy. Journal of Big Data, 10(1). https://doi.org/10.1186/s40537-023-00782-9

Malik, N., & Bilal, M. (2024). Natural language processing for analyzing online customer reviews: a survey, taxonomy, and open research challenges. PeerJ Computer Science, 10. https://doi.org/10.7717/PEERJ-CS.2203

Muslim Karo Karo, I., Arifin Karo Karo, J., Djasmayena, S., & Wahyudi, R. (2025). Sentiment Analysis of Tourist Reviews at Waterfront City Pangururan Using Naive Bayes and TF-IDF Algorithm. Journal of Software Engineering, Information and Communication Technology (SEICT), 6(1), 13–20. https://doi.org/10.17509/seict.v6i1.86210

Rahmawati, L., & Santoso, D. B. (2023). IMPLEMENTASI METODE NAIVE BAYES UNTUK KLASIFIKASI ULASAN APLIKASI E-COMMERCE TOKOPEDIA IMPLEMENTATION OF NAIVE BAYES METHOD FOR CLASSIFICATION OF TOKOPEDIA E-COMMERCE APPLICATION REVIEW. Journal of Information Technology and Computer Science (INTECOMS), 6(1).

Stephenie, Warsito, B., & Prahutama, A. (2020). Sentiment Analysis on Tokopedia Product Online Reviews Using Random Forest Method. E3S Web of Conferences, 202. https://doi.org/10.1051/e3sconf/202020216006

Tri Rizkya, A., & Irham Gufroni, A. (n.d.). Implementation of the Naive Bayes Classifier for Sentiment Analysis of Shopee E-Commerce Application Review Data on the Google Play Store.

Umar, N., & Nur, M. A. (2022). Application of Naïve Bayes Algorithm Variations On Indonesian General Analysis Dataset for Sentiment Analysis. Jurnal RESTI, 6(4), 585–590. https://doi.org/10.29207/resti.v6i4.4179

Downloads

Published

2025-12-30

How to Cite

Fransiskus Dapot Sihaloho, Jasmir Jasmir, & Gunardi Gunardi. (2025). Klasifikasi Sentimen Ulasan Produk Olahraga di Tokopedia Menggunakan Metode Machine Learning dengan Pendekatan TF-IDF. Prosiding Seminar Nasional Ilmu Teknik, 2(2), 976–989. https://doi.org/10.61132/prosemnasproit.v2i2.130