Publikation: CitePlag : A Citation-based Plagiarism Detection System Prototype
Dateien
Datum
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
This paper presents an open-source prototype of a citation-based plagiarism detection system called CitePlag. The underlying idea of the system is to evaluate the citations of academic documents as language independent markers to detect plagiarism. CitePlag uses three different detection algorithms that analyze the citation sequence of academic documents for similar patterns that may indicate unduly used foreign text or ideas. The algorithms consider multiple citation-related factors such as proximity and order of citations within the text, or their probability of co-occurrence in order to compute document similarity scores. We present technical details of CitePlag’s detection algorithms and the acquisition of test data from the PubMed Central Open Access Subset. Future advancement of the prototype lies in increasing the reference database by enabling the system to process more document and citation formats. Improving CitePlag’s detection algorithms and scoring functions to reduce the number of false positives is another major goal. Eventually, we plan to integrate text-based detection algorithms in addition to the citation-based detection algorithms within CitePlag.
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
MEUSCHKE, Norman, Bela GIPP, Corinna BREITINGER, 2012. CitePlag : A Citation-based Plagiarism Detection System Prototype. 5th International Plagiarism Conference IIP. Newcastle, 16. Juli 2012 - 18. Juli 2012. In: Proceedings of the 5th International Plagiarism Conference ‘12. 2012BibTex
@inproceedings{Meuschke2012CiteP-31362, year={2012}, title={CitePlag : A Citation-based Plagiarism Detection System Prototype}, url={http://www.plagiarismadvice.org/research-papers/item/a-citation-based-plagiarism-detection-system-prototype}, booktitle={Proceedings of the 5th International Plagiarism Conference ‘12}, author={Meuschke, Norman and Gipp, Bela and Breitinger, Corinna} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/31362"> <dc:contributor>Meuschke, Norman</dc:contributor> <dc:creator>Breitinger, Corinna</dc:creator> <dcterms:issued>2012</dcterms:issued> <dc:language>eng</dc:language> <dc:creator>Meuschke, Norman</dc:creator> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dcterms:title>CitePlag : A Citation-based Plagiarism Detection System Prototype</dcterms:title> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-07-06T13:25:41Z</dcterms:available> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dc:contributor>Gipp, Bela</dc:contributor> <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/31362"/> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-07-06T13:25:41Z</dc:date> <dc:contributor>Breitinger, Corinna</dc:contributor> <dc:creator>Gipp, Bela</dc:creator> <dcterms:abstract xml:lang="eng">This paper presents an open-source prototype of a citation-based plagiarism detection system called CitePlag. The underlying idea of the system is to evaluate the citations of academic documents as language independent markers to detect plagiarism. CitePlag uses three different detection algorithms that analyze the citation sequence of academic documents for similar patterns that may indicate unduly used foreign text or ideas. The algorithms consider multiple citation-related factors such as proximity and order of citations within the text, or their probability of co-occurrence in order to compute document similarity scores. We present technical details of CitePlag’s detection algorithms and the acquisition of test data from the PubMed Central Open Access Subset. Future advancement of the prototype lies in increasing the reference database by enabling the system to process more document and citation formats. Improving CitePlag’s detection algorithms and scoring functions to reduce the number of false positives is another major goal. Eventually, we plan to integrate text-based detection algorithms in addition to the citation-based detection algorithms within CitePlag.</dcterms:abstract> </rdf:Description> </rdf:RDF>