📞 +91-7667918914 | ✉️ ijireeice@gmail.com
International Journal of Innovative Research in Electrical, Electronics, Instrumentation and Control Engineering
International Journal of Innovative Research in Electrical, Electronics, Instrumentation and Control Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2321-2004ISSN Print 2321-5526Since 2013
IJIREEICE meets the suggestive parameters outlined in the latest University Grants Commission (UGC) for peer-reviewed journals, ensuring high standards of research integrity, publication ethics, and academic excellence.
← Back to VOLUME 13, ISSUE 3, MARCH 2025

A COMPARATIVE ANALYSIS OF JACCARD AND COSINE SIMILARITY FOR PLAGIARISM DETECTION

Kanishkaa. S, Santhi. K

👁 1 view📥 0 downloads
Share: 𝕏 f in
Abstract: Plagiarism, the unauthorized use or imitation of another’s work without proper acknowledgment, poses a significant challenge in academia, research, and professional content creation, amplified by the widespread sharing of digital information. Reliable plagiarism detection systems are essential to ensure originality and maintain integrity. This paper investigates two widely used algorithms—Jaccard and Cosine similarity—for their effectiveness in detecting textual similarities. Jaccard similarity excels in identifying exact or near-exact overlaps but struggles with rephrased content, whereas Cosine similarity captures deeper semantic similarities, including paraphrasing, but is computationally more demanding. Preprocessing techniques, such as tokenization, stop word removal, and stemming, are employed to optimize the algorithms’ performance. The research evaluates their strengths, limitations, and computational efficiency through a detailed comparative analysis, offering insights into their suitability for specific applications. The findings emphasize the importance of balancing detection accuracy with computational demands, guiding the selection of appropriate methods for plagiarism detection in various contexts.

Keywords: Plagiarism Detection, Cosine Similarity, Jaccard Similarity, Text Similarity, Text Preprocessing

How to Cite:

[1] Kanishkaa. S, Santhi. K, “A COMPARATIVE ANALYSIS OF JACCARD AND COSINE SIMILARITY FOR PLAGIARISM DETECTION,” International Journal of Innovative Research in Electrical, Electronics, Instrumentation and Control Engineering (IJIREEICE), DOI: 10.17148/IJIREEICE.2025.13309

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.