Evaluation of similarity metrics for programming code plagiarism detection method (CROSBI ID 594949)
Prilog sa skupa u časopisu | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Juričić, Vedran
engleski
Evaluation of similarity metrics for programming code plagiarism detection method
This paper shortly presents source code plagiarism detection method based on the low-level language. The similarity or distance metric that is used to calculate similarity coefficient between two source files has great impact on method's performance and results. This paper analyzes precision and recall of four most commonly used metrics, Levenstein distance, Cosine similarity, NGram similarity and Greedy String Tilling. Testing is based on various test cases that represent the most frequent code modification techniques.
agiarism detection; similarity; source code; similarity metric
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
83-88.
2011.
nije evidentirano
objavljeno
Podaci o matičnoj publikaciji
Central European conference on information and intelligent systems
Tihomir Hunjak, Sandra Lovrenčić, Igor Tomičić
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu
1847-2001
Podaci o skupu
Central European Conference on Information and Intelligent Systems
predavanje
21.09.2011-23.09.2011
Varaždin, Hrvatska