CitePlag : A Citation-based Plagiarism Detection System Prototype


MEUSCHKE, Norman, Bela GIPP, Corinna BREITINGER, 2012. CitePlag : A Citation-based Plagiarism Detection System Prototype. 5th International Plagiarism Conference IIP. Newcastle, 16. Jul 2012 - 18. Jul 2012. In: Proceedings of the 5th International Plagiarism Conference ‘12

@inproceedings{Meuschke2012CiteP-31362, title={CitePlag : A Citation-based Plagiarism Detection System Prototype}, url={}, year={2012}, booktitle={Proceedings of the 5th International Plagiarism Conference ‘12}, author={Meuschke, Norman and Gipp, Bela and Breitinger, Corinna} }

<rdf:RDF xmlns:dcterms="" xmlns:dc="" xmlns:rdf="" xmlns:bibo="" xmlns:dspace="" xmlns:foaf="" xmlns:void="" xmlns:xsd="" > <rdf:Description rdf:about=""> <dcterms:isPartOf rdf:resource=""/> <dc:contributor>Breitinger, Corinna</dc:contributor> <dcterms:available rdf:datatype="">2015-07-06T13:25:41Z</dcterms:available> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dcterms:issued>2012</dcterms:issued> <dcterms:title>CitePlag : A Citation-based Plagiarism Detection System Prototype</dcterms:title> <dc:contributor>Gipp, Bela</dc:contributor> <dc:date rdf:datatype="">2015-07-06T13:25:41Z</dc:date> <dc:language>eng</dc:language> <dc:creator>Breitinger, Corinna</dc:creator> <dc:creator>Gipp, Bela</dc:creator> <dcterms:abstract xml:lang="eng">This paper presents an open-source prototype of a citation-based plagiarism detection system called CitePlag. The underlying idea of the system is to evaluate the citations of academic documents as language independent markers to detect plagiarism. CitePlag uses three different detection algorithms that analyze the citation sequence of academic documents for similar patterns that may indicate unduly used foreign text or ideas. The algorithms consider multiple citation-related factors such as proximity and order of citations within the text, or their probability of co-occurrence in order to compute document similarity scores. We present technical details of CitePlag’s detection algorithms and the acquisition of test data from the PubMed Central Open Access Subset. Future advancement of the prototype lies in increasing the reference database by enabling the system to process more document and citation formats. Improving CitePlag’s detection algorithms and scoring functions to reduce the number of false positives is another major goal. Eventually, we plan to integrate text-based detection algorithms in addition to the citation-based detection algorithms within CitePlag.</dcterms:abstract> <foaf:homepage rdf:resource="http://localhost:8080/jspui"/> <bibo:uri rdf:resource=""/> <dc:contributor>Meuschke, Norman</dc:contributor> <dspace:isPartOfCollection rdf:resource=""/> <dc:creator>Meuschke, Norman</dc:creator> </rdf:Description> </rdf:RDF>

