KLASIFIKASI SIMILARITAS DOKUMEN MENGGUNAKAN ALGORITMA WINNOWING DENGAN PENDEKATAN JACCARD COEFFICIENT BERBASIS WEB

ZACHRIAS, UZDHA (2024) KLASIFIKASI SIMILARITAS DOKUMEN MENGGUNAKAN ALGORITMA WINNOWING DENGAN PENDEKATAN JACCARD COEFFICIENT BERBASIS WEB. S1 thesis, Universitas Mercu Buana Jakarta.

[img] Text (HAL COVER)
01 COVER.pdf

Download (568kB)
[img] Text (ABSTRAK)
02 ABSTRAK.pdf

Download (32kB)
[img] Text (BAB I)
03 BAB 1.pdf
Restricted to Registered users only

Download (198kB)
[img] Text (BAB II)
04 BAB 2.pdf
Restricted to Registered users only

Download (525kB)
[img] Text (BAB III)
05 BAB 3.pdf
Restricted to Registered users only

Download (151kB)
[img] Text (BAB IV)
06 BAB 4.pdf
Restricted to Registered users only

Download (627kB)
[img] Text (BAB V)
07 BAB 5.pdf
Restricted to Registered users only

Download (28kB)
[img] Text (DAFTAR PUSTAKA)
08 DAFTAR PUSTAKA.pdf
Restricted to Registered users only

Download (92kB)
[img] Text (LAMPIRAN)
09 LAMPIRAN.pdf
Restricted to Registered users only

Download (993kB)

Abstract

In the era of information technology advancement, easy access to various sources of information through the internet has transformed the way students conduct research. While providing significant benefits, this convenience also brings the problem of plagiarism, which is a detrimental act in the academic world. Plagiarism involves copying or taking ideas from someone else's work without proper attribution, which contradicts academic guidelines. This research aims to develop an effective plagiarism detection system tailored to the Indonesian language. The system utilizes the Winnowing algorithm with a Jaccard Coefficient approach and the technique of eliminating non-descriptive words (stopwords) in the Indonesian language. Samples of Indonesian-language documents are taken from the final assignments of Mercu Buana University students. Data is collected from the university's repository and will be analyzed to measure the similarity level between documents and the performance of the Winnowing algorithm in detecting plagiarism. The results of this research indicate that the development of a plagiarism detection system using the Winnowing algorithm and the Jaccard Coefficient approach has achieved a high accuracy value. The similarity index detection system is capable of providing accurate and relevant results for documents in the Indonesian language. Keywords: Jaccard Coefficient, Plagiarism, Stopword, Winnowing Dalam era kemajuan teknologi informasi, akses mudah terhadap berbagai sumber informasi melalui internet telah mengubah cara mahasiswa melakukan penelitian. Meskipun memberikan manfaat signifikan, kemudahan ini juga membawa masalah plagiarisme, yang merupakan tindakan yang merugikan dalam dunia akademik. Plagiat adalah tindakan penyalinan atau pengambilan ide dari karya orang lain tanpa memberikan kredit yang pantas, yang bertentangan dengan pedoman akademik. Penelitian ini bertujuan mengembangkan sistem pendeteksi plagiarisme yang efektif dan sesuai dengan bahasa Indonesia. Sistem ini menggunakan algoritma Winnowing dengan pendekatan Jaccard Coefficient dan teknik eliminasi kata-kata tidak deskriptif (stopword) dalam bahasa Indonesia. Sampel dokumen berbahasa Indonesia diambil dari tugas akhir mahasiswa Universitas Mercu Buana. Data dikumpulkan dari repository universitas dan akan dianalisis untuk mengukur tingkat similaritas antar dokumen dan performa algoritma Winnowing dalam mendeteksi plagiarisme. Hasil penelitian ini menunjukkan bahwa pengembangan sistem deteksi plagiarisme dengan menggunakan algoritma Winnowing dan pendekatan Jaccard Coefficient berhasil mencapai nilai akurasi yang tinggi. Sistem pendeteksi similarity index mampu memberikan hasil yang akurat dan relevan pada dokumen berbahasa Indonesia. Kata Kunci : Jaccard Coefficient, Plagiarisme, Stopword, Winnowing

Item Type: Thesis (S1)
Call Number CD: FIK/INFO. 24 070
Call Number: SIK/15/24/057
NIM/NIDN Creators: 41520010243
Uncontrolled Keywords: Jaccard Coefficient, Plagiarisme, Stopword, Winnowing
Subjects: 000 Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 000. Computer Science, Information and General Works/Ilmu Komputer, Informasi, dan Karya Umum > 004 Data Processing, Computer Science/Pemrosesan Data, Ilmu Komputer, Teknik Informatika
500 Natural Science and Mathematics/Ilmu-ilmu Alam dan Matematika > 510 Mathematics/Matematika > 518 Numerical Analysis/Analisis Numerik, Analisa Numerik
500 Natural Science and Mathematics/Ilmu-ilmu Alam dan Matematika > 510 Mathematics/Matematika > 518 Numerical Analysis/Analisis Numerik, Analisa Numerik > 518.1 Algorithms/Algoritma
600 Technology/Teknologi > 650 Management, Public Relations, Business and Auxiliary Service/Manajemen, Hubungan Masyarakat, Bisnis dan Ilmu yang Berkaitan > 652 Process of Written Communication/Proses Komunikasi Tertulis di Perkantoran > 652.4 Duplication of Records Methods/Metode Penduplikasian Dokumen
Divisions: Fakultas Ilmu Komputer > Informatika
Depositing User: khalimah
Date Deposited: 20 Mar 2024 03:31
Last Modified: 20 Mar 2024 03:31
URI: http://repository.mercubuana.ac.id/id/eprint/87302

Actions (login required)

View Item View Item