A linguistic treatment for automatic external plagiarism detection

Asad Abdi, Siti Mariyam Shamsuddin, Norisma Idris, Rasim M. Alguliev, Ramiz M. Aliguliyev

Research output: Contribution to journalArticleAcademicpeer-review

8 Citations (Scopus)

Abstract

Plagiarism is the unauthorized use of the ideas, presentation of someone else's words or work as your own. This paper presents an External Plagiarism Detection System (EPDS), which employs a combination of the Semantic Role Labeling (SRL) technique, the semantic and syntactic information. Most of the available methods fail to capture the meaning in the comparison between a source document sentence and a suspicious document sentence when two sentences have same surface text. Therefore, it leads to incorrect or even unnecessary matching results. However, the proposed method is able to avoid selecting the source text sentence whose similarity with suspicious text sentence is high but its meaning is different. On the other hand, an author may change the sentence from: active to passive and vice versa; hence, the method also employed the SRL technique to tackle the aforementioned challenge. Furthermore, the method used the content word expansion approach to bridge the lexical gaps and identify the similar ideas that are expressed using different wording. The proposed method is able to detect different types of plagiarism such as the exact verbatim copying, paraphrasing, transformation of sentences, changing of word structure. As a result, the experimental results have displayed that the proposed method is able to improve the performance compared with the participating systems in PAN-PC-11 and other existing techniques.
Original languageEnglish
Pages (from-to)135-146
JournalKnowledge-based systems
Volume135
DOIs
Publication statusPublished - 2017
Externally publishedYes

Keywords

  • Paraphrase recognition
  • Syntax-semantic
  • Extrinsic plagiarism
  • Semantic analysis

Fingerprint Dive into the research topics of 'A linguistic treatment for automatic external plagiarism detection'. Together they form a unique fingerprint.

Cite this