Review on Plagiarism Detection Systems, Algorithms, Weakness Points

Authors

  • Khaled Omar Head of Artificail Intellegence Department, Information Technology Enginerring College, Damascus University
  • Nour Esmaeel Ph.D Student, Ph.D of Web Science Program, Syrian Virtual University
  • ZoAlfekar Ebrahim Ph.D Student, Ph.D of Web Science Program, Syrian Virtual University

Keywords:

Plagiarism Detection Systems, Plagiarism Detection Algorithms

Abstract

Plagiarism detection is the process of identifying instances of plagiarism in written or digital content. Plagiarism is the act of presenting someone else's work or ideas as one's own without proper attribution or permission, Plagiarism detection can be carried out using various techniques, including manual methods, such as reading and comparing texts, as well as automated methods, such as software-based systems, Automated plagiarism detection systems use algorithms to analyze the content of a document and compare it to other documents in their database or on the internet. Plagiarism detection is important field of natural language processing , artificial intelligence fields, many algorithms and systems have been developed to detect plagiarism, in this paper we will talk about plagiarism detection systems, plagiarism types, plagiarism detection algorithms types , and weakness point of plagiarism detection algorithms , we will talk in details about the most popular plagiarisms detection algorithms which contains string based algorithms, fingerprints based algorithms, semantic based algorithms, syntax based algorithms, deep learning based algorithms, and cross language plagiarism detection algorithms.

Downloads

Download data is not yet available.

References

Khan, Nosheena & Agrawal, Chetan & Ansari, Tehreem. (2018). A Review on Various Plagiarism Detection Systems Based on Exterior and Interior Method. IJARCCE. 7. 6-12. 10.17148/

IJARCCE.2018.792.

Citation-based Plagiarism Detection - Detecting Disguised and Cross-language Plagiarism using Citation Pattern Analysis - Scientific Figure on ResearchGate. Available from: https://www.researchgate.net/figure/Classification-of-Plagiarism-Detection-Approaches_fig2_262689913 [accessed 24 May, 2023]

7 Common Types of Plagiarism, With Examples. (2022, June 2). 7 Common Types of Plagiarism, With Examples | Grammarly Blog. https://www.grammarly.com/blog/types-of-plagiarism/

WHAT IS PLAGIARISM – CIBNP. (n.d.). https://www.cibnp.com/

what-is-plagiarism,(2022, June 2)

Direct Plagiarism and How to Avoid it. (2022, July 7). FixGerald.com. https://fixgerald.com/blog/direct-plagiarism

What is the difference between plagiarism and paraphrasing? (n.d.). Enago. https://www.enago.com/plagiarism-checker/resources/

difference-between-plagiarism-and-paraphrasing.htm

Writer. (2023, January 6). Prevent plagiarism before hitting publish - Writer. https://writer.com/guides/plagiarism/

Carmil. (2022). How to Understand and Avoid Accidental Plagiarism Using a Plagiarism Checker? Copyleaks. https://copyleaks.com/blog/accidental-plagiarism-understanding-and-avoiding-it

Tlitova A, Toschev A, Talanov M and Kurnosov V (2020) Meta-Analysis of Cross-Language Plagiarism and Self-Plagiarism Detection Methods for Russian-English Language Pair. Front. Comput. Sci. 2:523053. doi: 10.3389/fcomp.2020.523053

Abdi, A., Shamsuddin, S. M., Idris, N., Alguliev, R. M., & Aliguliyev, R. M. (2017). A linguistic treatment for automatic external plagiarism detection. Knowledge-based systems, 135, 135-146. https://doi.org/10.1016/j.knosys.2017.08.008

Zechner, Mario et al. “External and Intrinsic Plagiarism Detection Using Vector Space Models.” (2009).

Checkforplag. (n.d.). Internal and External Plagiarism. https://www.checkforplag.com/Internal-and-external-plagiarism

J. J. G. Adeva, N. L. Carroll and R. A. Calvo, "Applying Plagiarism Detection to Engineering Education," 2006 7th International Conference on Information Technology Based Higher Education and Training, Ultimo, Australia, 2006, pp. 722-731, doi: 10.1109/ITHET.2006.339692.

W. G. S. Parwita, I. G. A. A. D. Indradewi and I. N. S. W. Wijaya, "String Matching based Plagiarism Detection for Document in Bahasa Indonesia," 2019 5th International Conference on New Media Studies (CONMEDIA), Bali, Indonesia, 2019, pp. 54-58, doi: 10.1109/CONMEDIA46929.2019.8981821.

Pandey, K.L., Agarwal, S., Misra, S., Prasad, R. (2012). Plagiarism Detection in Software Using Efficient String Matching. In: , et al. Computational Science and Its Applications – ICCSA 2012. ICCSA 2012. Lecture Notes in Computer Science, vol 7336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31128-4_11

Importance of String Matching in Real World Problems - Scientific Figure on ResearchGate. Available from: https://www.researchgate.

net/figure/Plagiarism-Detection-System_fig8_304305210 [accessed 20 May, 2023] 26-WordNet, “About WordNet,” http://wordnet.

princeton.edu/2010

Narayanan, Sandhya & Surendran, Simi. (2012). Source code plagiarism detection and performance analysis using fingerprint based distance measure method. ICCSE 2012 - Proceedings of 2012 7th International Conference on Computer Science and Education. 1065-1068. 10.1109/ICCSE.2012.6295247.

E. G. Hasan, A. Wicaksana and S. Hansun, "The Implementation of Winnowing Algorithm for Plagiarism Detection in Moodle-based E-learning," 2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS), Singapore, 2018, pp. 321-325, doi: 10.1109/ICIS.2018.8466429.

Jaccard Similarity. (n.d.). https://www.learndatasci.com/glossary/

jaccard-similarity/

Adel Aljohani and Masnizah Mohd, 2014. Arabic-English Cross-language Plagiarism Detection using Winnowing Algorithm. Information Technology Journal, 13: 2349-2355.

Krause, Markus. (2015). Stylometry-based Fraud and Plagiarism Detection for Learning at Scale.

How to CiteSylvia Putri Gunawan, Lucia Dwi Krisnawati, & Chrismanto, A. R. (2020). Analysis of Stylometric Features and Segmentation Strategies in Intrinsic Plagiarism Detection System. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 4(5), 988-997. https://doi.org/10.29207/resti.v4i5.2486

Gipp, Bela & Meuschke, Norman. (2011). Citation Pattern Matching Algorithms for Citation-based Plagiarism Detection: Greedy Citation Tiling, Citation Chunking and Longest Common Citation Sequence.. DocEng 2011 - Proceedings of the 2011 ACM Symposium on Document Engineering. 249-258. 10.1145/

2034741.

Gipp, Bela & Beel, Joeran. (2010). Citation based Plagiarism detection: A new approach to identify plagiarized work language independently. HT'10 - Proceedings of the 21st ACM Conference on Hypertext and Hypermedia. 273-274. 10.1145/1810617.1810671.

Citation-based Plagiarism Detection – Idea, Implementation and Evaluation Gipp, B. (2012)

Bulletin of the IEEE Technical Committee on Digital Libraries, 8(1).

Sharma, Kamlesh & Garg, Nidhi & Pandey, Arun & Yadav, Daksh & Nikhil,. (2021). Plagiarism Detection Technique using www and Wordnet. Indian Journal of Artificial Intelligence and Neural Networking. 1. 1-6. 10.35940/ijainn.B1015.061321.

WordNet, “About WordNet,” http://wordnet.princeton.edu/2010

Shenoy, Manjula. (2012). Semantic Plagiarism Detection System Using Ontology Mapping. Advanced Computing: An International Journal. 3. 59-62. 10.5121/acij.2012.3306.

M. Duracik, P. Hrkut, E. Krsak and S. Toth, "Abstract Syntax Tree Based Source Code Antiplagiarism System for Large Projects Set," in IEEE Access, vol. 8, pp. 175347-175359, 2020, doi: 10.1109/ACCESS.2020.3026422.

Torres, R., Kunkel, J. M., Dolz, M. F. and Ludwig, T. (2018) Comparison of Clang Abstract Syntax Trees using string kernels. In: CADO 2018, 16-20 July, Orleans, France, pp. 106-113. Available at http://centaur.reading.ac.uk/79588/

Bamidis PD, Lithari C, Konstantinidis ST. Revisiting Information Technology tools serving authorship and editorship: a case-guided tutorial to statistical analysis and plagiarism detection. Hippokratia. 2010 Dec;14(Suppl 1):38-48. PMID: 21487489; PMCID: PMC3049420.

El-Rashidy, M.A., Mohamed, R.G., El-Fishawy, N.A. et al. Reliable plagiarism detection system based on deep learning approaches. Neural Comput & Applic 34, 18837–18858 (2022). https://doi.org/10.1007/s00521-022-07486-w

El Mostafa Hambi, Faouzia Benabbou,A New Online Plagiarism Detection System based on Deep Learning,(IJACSA) International Journal of Advanced Computer Science and Applications Vol. 11, No. 9, 2020

K. Omar and A. Hilal, "Plagiarism Detection in Arabic Documents using word2vector and Arabic WordNet," 2022 International Arab Conference on Information Technology (ACIT), Abu Dhabi, United Arab Emirates, 2022, pp. 1-5, doi: 10.1109/ACIT57182.

9994090.

Pennington, J. (n.d.). GloVe: Global Vectors for Word Representation. https://nlp.stanford.edu/projects/glove/

Khiled, Farah & Al-Tamimi, Mohammed. (2021). Hybrid System for Plagiarism Detection on A Scientific Paper. Turkish Journal of Computer and Mathematics Education (TURCOMAT). 12. 5707-5719.

Adel Aljohani and Masnizah Mohd, 2014. Arabic-English Cross-language Plagiarism Detection using Winnowing Algorithm. Information Technology Journal, 13: 2349-2355.

Downloads

Published

24.03.2024

How to Cite

Omar, K. ., Esmaeel, N. ., & Ebrahim, Z. . (2024). Review on Plagiarism Detection Systems, Algorithms, Weakness Points. International Journal of Intelligent Systems and Applications in Engineering, 12(18s), 693–699. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5024

Issue

Section

Research Article