header
Local cover image
Local cover image
Image from OpenLibrary

Novel classification feature sets for source code plagiarism detection of java files / Eman Hosam Adel Elsayed ; Supervised Magda B. Fayek , Amir F. Sorial , Mayada M. Ali

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Eman Hosam Adel Elsayed , 2021Description: 85 P. ; 30cmOther title:
  • مجموعات مبتكرة من الخصائص لتصنيف و اكتشاف السرقة الأدبية لبرامج الجافا [Added title page title]
Subject(s): Online resources: Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Computer Engineering Summary: In programming learning environments, the pressure of delivering many assignments makes plagiarism become the easiest solution. This problem of plagiarism threatens the learning process and obstructs the evaluation fairness. Therefore, fast, automatic and accurate detection of source code plagiarism becomes of the essence.This research proposes novel classification feature sets to detect whether a Java file is plagiarized.The proposed feature sets are based on using histograms to summarize the similarity matrix of function signatures and comparing the lexical code similarity of each individual class pair. For testing the effectiveness, a source code plagiarism dataset that consists of 12K Java files was used. The results show a 4% improvement in F-Measure. A re-annotation to the dataset is performed and improves F-Measure by 7.5%.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.13.06.M.Sc.2021.Em.N (Browse shelf(Opens below)) Not for loan 01010110084524000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.13.06.M.Sc.2021.Em.N (Browse shelf(Opens below)) 84524.CD Not for loan 01020110084524000

Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Computer Engineering

In programming learning environments, the pressure of delivering many assignments makes plagiarism become the easiest solution. This problem of plagiarism threatens the learning process and obstructs the evaluation fairness. Therefore, fast, automatic and accurate detection of source code plagiarism becomes of the essence.This research proposes novel classification feature sets to detect whether a Java file is plagiarized.The proposed feature sets are based on using histograms to summarize the similarity matrix of function signatures and comparing the lexical code similarity of each individual class pair. For testing the effectiveness, a source code plagiarism dataset that consists of 12K Java files was used. The results show a 4% improvement in F-Measure. A re-annotation to the dataset is performed and improves F-Measure by 7.5%.

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image