From: An academic Arabic corpus for plagiarism detection: design, construction and experimentation
N-gram Segments | Retrieved Dissertations | Segments in the Test Dataset | Segments with Simulated Plagiarism | Segments Identified as Plagiarized | Reported Plagiarism Ratio |
---|---|---|---|---|---|
3 | 4 | 718 | 99 | 114 | 15.88% |
4 | 2 | 725 | 97 | 99 | 13.93% |
5 | 2 | 730 | 95 | 95 | 13.01% |
6 | 2 | 731 | 93 | 93 | 12.72% |
7 | 2 | 729 | 91 | 91 | 12.48% |