Skip to main content

Table 17 Results of experiment III where the dataset is injected with plagiarism-simulated sentences and run against the subcorpus

From: An academic Arabic corpus for plagiarism detection: design, construction and experimentation

N-gram Segments

Retrieved Dissertations

Segments in the Test Dataset

Segments with Simulated Plagiarism

Segments Identified as Plagiarized

Reported Plagiarism Ratio

3

4

662

28

15

2.27%

4

2

672

19

2

0.30%

5

0

673

13

0

0.00%

6

0

673

8

0

0.00%

7

0

672

4

0

0.00%