ENSEMBLE MODEL NAIVE BAYES DAN RANDOM FOREST UNTUK KLASIFIKASI SENTIMEN DAN PREDIKSI ULASAN PENGGUNA APLIKASI GOOGLE GEMINI

HARYANTI, JESICA WALANDA (2025) ENSEMBLE MODEL NAIVE BAYES DAN RANDOM FOREST UNTUK KLASIFIKASI SENTIMEN DAN PREDIKSI ULASAN PENGGUNA APLIKASI GOOGLE GEMINI. S1 thesis, Universitas Mercu Buana Jakarta.

[img]
Preview
Text (HAL COVER)
01. COVER.pdf

Download (316kB) | Preview
[img] Text (BAB I)
02. BAB 1.pdf
Restricted to Registered users only

Download (242kB)
[img] Text (BAB II)
03. BAB 2.pdf
Restricted to Registered users only

Download (445kB)
[img] Text (BAB III)
04. BAB 3.pdf
Restricted to Registered users only

Download (198kB)
[img] Text (BAB IV)
05. BAB 4.pdf
Restricted to Registered users only

Download (393kB)
[img] Text (BAB V)
06. BAB 5.pdf
Restricted to Registered users only

Download (99kB)
[img] Text (DAFTAR PUSTAKA)
07. DAFTAR PUSTAKA.pdf
Restricted to Registered users only

Download (305kB)
[img] Text (LAMPIRAN)
08. LAMLPIRAN.pdf
Restricted to Registered users only

Download (178kB)

Abstract

This study aims to analyze user sentiment toward the Google Gemini application on the Google Play Store and to evaluate daily perception trends using a machine learning approach. The variable examined is user sentiment, categorized into positive and negative classes. A total of 3,000 reviews were collected using purposive sampling through web scraping with Python on Google Colaboratory. The data underwent preprocessing steps including text normalization, tokenization, stopword removal, and stemming. The initial classification used the Multinomial Naive Bayes algorithm, enhanced through data balancing with the SMOTE technique and an ensemble model combining Naive Bayes and Random Forest with soft voting. The Naive Bayes model achieved 80% accuracy but had low recall for negative reviews. The ensemble model improved accuracy to 85% and reached a macro F1-score of 0.80. Furthermore, the daily sentiment trend over the past month was successfully modeled with a prediction accuracy of 85.3%. The study concludes that ensemble and data balancing methods are effective in improving sentiment classification performance and are suitable for real-time user opinion monitoring. Keywords: Sentiment Analysis, Google Gemini, Naive Bayes, Ensemble Penelitian ini bertujuan untuk menganalisis sentimen pengguna terhadap aplikasi Google Gemini di Google Play Store dan mengevaluasi tren persepsi harian menggunakan pendekatan machine learning. Variabel yang diteliti adalah sentimen pengguna, dikategorikan menjadi positif dan negatif. Sebanyak 3.000 ulasan dikumpulkan menggunakan metode purposive sampling melalui teknik web scraping dengan Python di Google Colaboratory. Data diproses melalui tahapan preprocessing mencakup normalisasi teks, tokenisasi, penghapusan stopword, dan stemming. Algoritma Multinomial Naive Bayes digunakan untuk klasifikasi awal, kemudian ditingkatkan melalui teknik SMOTE untuk penyeimbangan data dan model ensemble berbasis soft voting yang menggabungkan Naive Bayes dan Random Forest. Hasil analisis menunjukkan bahwa model Naive Bayes menghasilkan akurasi 80% namun kurang sensitif terhadap kelas negatif. Model ensemble meningkatkan akurasi menjadi 85% dan F1-score makro sebesar 0.80. Tren sentimen harian selama satu bulan terakhir juga berhasil dimodelkan dengan akurasi prediksi 85,3%. Kesimpulan penelitian ini menunjukkan bahwa pendekatan ensemble dan balancing data efektif dalam meningkatkan performa klasifikasi dan relevan untuk pemantauan opini pengguna secara berkala. Kata Kunci: Analisis Sentimen, Google Gemini, Naive Bayes, Ensemble

Item Type: Thesis (S1)
Call Number CD: FIK/SI. 25 069
NIM/NIDN Creators: 41821010021
Uncontrolled Keywords: Analisis Sentimen, Google Gemini, Naive Bayes, Ensemble
Subjects: 000 Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 000. Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 005 Computer Programmming, Programs, Data/Pemprograman Komputer, Program, Data > 005.5 General Purpose Application Programs/Program Aplikasi dengan Kegunaan Khusus
000 Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 000. Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 006 Special Computer Methods/Metode Komputer Tertentu > 006.3 Artificial Intelligence/Kecerdasan Buatan > 006.31 Machine Learning/Pembelajaran Mesin
000 Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 000. Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 006 Special Computer Methods/Metode Komputer Tertentu > 006.3 Artificial Intelligence/Kecerdasan Buatan > 006.35 Natural Language Processing/Pengolahan Bahasa Alami
300 Social Science/Ilmu-ilmu Sosial > 300. Social Science/Ilmu-ilmu Sosial > 303 Social Process/Proses Sosial > 303.3 Coordination and Control/Koordinasi dan Kontrol > 303.38 Public Opinion/Opini Publik
500 Natural Science and Mathematics/Ilmu-ilmu Alam dan Matematika > 510 Mathematics/Matematika > 518 Numerical Analysis/Analisis Numerik, Analisa Numerik > 518.1 Algorithms/Algoritma
Divisions: Fakultas Ilmu Komputer > Sistem Informasi
Depositing User: khalimah
Date Deposited: 21 Aug 2025 08:37
Last Modified: 21 Aug 2025 08:37
URI: http://repository.mercubuana.ac.id/id/eprint/96961

Actions (login required)

View Item View Item