Towards reproducibility in recommender-systems research

dc.contributor.authorBeel, Joeran
dc.contributor.authorBreitinger, Corinna
dc.contributor.authorLanger, Stefan
dc.contributor.authorLommatzsch, Andreas
dc.contributor.authorGipp, Bela
dc.date.accessioned2016-04-01T08:21:57Z
dc.date.available2016-04-01T08:21:57Z
dc.date.issued2016-03-12eng
dc.description.abstractNumerous recommendation approaches are in use today. However, comparing their effectiveness is a challenging task because evaluation results are rarely reproducible. In this article, we examine the challenge of reproducibility in recommender-system research. We conduct experiments using Plista’s news recommender system, and Docear’s research-paper recommender system. The experiments show that there are large discrepancies in the effectiveness of identical recommendation approaches in only slightly different scenarios, as well as large discrepancies for slightly different approaches in identical scenarios. For example, in one news-recommendation scenario, the performance of a content-based filtering approach was twice as high as the second-best approach, while in another scenario the same content-based filtering approach was the worst performing approach. We found several determinants that may contribute to the large discrepancies observed in recommendation effectiveness. Determinants we examined include user characteristics (gender and age), datasets, weighting schemes, the time at which recommendations were shown, and user-model size. Some of the determinants have interdependencies. For instance, the optimal size of an algorithms’ user model depended on users’ age. Since minor variations in approaches and scenarios can lead to significant changes in a recommendation approach’s performance, ensuring reproducibility of experimental results is difficult. We discuss these findings and conclude that to ensure reproducibility, the recommender-system community needs to (1) survey other research fields and learn from them, (2) find a common understanding of reproducibility, (3) identify and understand the determinants that affect reproducibility, (4) conduct more comprehensive experiments, (5) modernize publication practices, (6) foster the development and use of recommendation frameworks, and (7) establish best-practice guidelines for recommender-systems research.eng
dc.description.versionpublishedeng
dc.identifier.doi10.1007/s11257-016-9174-xeng
dc.identifier.ppn475012798
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/33528
dc.language.isoengeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subject.ddc004eng
dc.titleTowards reproducibility in recommender-systems researcheng
dc.typeJOURNAL_ARTICLEeng
dspace.entity.typePublication
kops.citation.bibtex
@article{Beel2016-03-12Towar-33528,
  year={2016},
  doi={10.1007/s11257-016-9174-x},
  title={Towards reproducibility in recommender-systems research},
  number={1},
  volume={26},
  issn={0924-1868},
  journal={User Modeling and User-Adapted Interaction : umuai},
  pages={69--101},
  author={Beel, Joeran and Breitinger, Corinna and Langer, Stefan and Lommatzsch, Andreas and Gipp, Bela}
}
kops.citation.iso690BEEL, Joeran, Corinna BREITINGER, Stefan LANGER, Andreas LOMMATZSCH, Bela GIPP, 2016. Towards reproducibility in recommender-systems research. In: User Modeling and User-Adapted Interaction : umuai. 2016, 26(1), pp. 69-101. ISSN 0924-1868. eISSN 1573-1391. Available under: doi: 10.1007/s11257-016-9174-xdeu
kops.citation.iso690BEEL, Joeran, Corinna BREITINGER, Stefan LANGER, Andreas LOMMATZSCH, Bela GIPP, 2016. Towards reproducibility in recommender-systems research. In: User Modeling and User-Adapted Interaction : umuai. 2016, 26(1), pp. 69-101. ISSN 0924-1868. eISSN 1573-1391. Available under: doi: 10.1007/s11257-016-9174-xeng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/33528">
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:contributor>Gipp, Bela</dc:contributor>
    <dc:creator>Lommatzsch, Andreas</dc:creator>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:contributor>Breitinger, Corinna</dc:contributor>
    <dcterms:title>Towards reproducibility in recommender-systems research</dcterms:title>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2016-04-01T08:21:57Z</dc:date>
    <dcterms:abstract xml:lang="eng">Numerous recommendation approaches are in use today. However, comparing their effectiveness is a challenging task because evaluation results are rarely reproducible. In this article, we examine the challenge of reproducibility in recommender-system research. We conduct experiments using Plista’s news recommender system, and Docear’s research-paper recommender system. The experiments show that there are large discrepancies in the effectiveness of identical recommendation approaches in only slightly different scenarios, as well as large discrepancies for slightly different approaches in identical scenarios. For example, in one news-recommendation scenario, the performance of a content-based filtering approach was twice as high as the second-best approach, while in another scenario the same content-based filtering approach was the worst performing approach. We found several determinants that may contribute to the large discrepancies observed in recommendation effectiveness. Determinants we examined include user characteristics (gender and age), datasets, weighting schemes, the time at which recommendations were shown, and user-model size. Some of the determinants have interdependencies. For instance, the optimal size of an algorithms’ user model depended on users’ age. Since minor variations in approaches and scenarios can lead to significant changes in a recommendation approach’s performance, ensuring reproducibility of experimental results is difficult. We discuss these findings and conclude that to ensure reproducibility, the recommender-system community needs to (1) survey other research fields and learn from them, (2) find a common understanding of reproducibility, (3) identify and understand the determinants that affect reproducibility, (4) conduct more comprehensive experiments, (5) modernize publication practices, (6) foster the development and use of recommendation frameworks, and (7) establish best-practice guidelines for recommender-systems research.</dcterms:abstract>
    <dc:creator>Gipp, Bela</dc:creator>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/33528/1/Beel_0-324818.pdf"/>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/33528/1/Beel_0-324818.pdf"/>
    <dc:contributor>Langer, Stefan</dc:contributor>
    <dc:rights>terms-of-use</dc:rights>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/33528"/>
    <dc:contributor>Beel, Joeran</dc:contributor>
    <dc:creator>Breitinger, Corinna</dc:creator>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:creator>Langer, Stefan</dc:creator>
    <dc:contributor>Lommatzsch, Andreas</dc:contributor>
    <dc:creator>Beel, Joeran</dc:creator>
    <dcterms:issued>2016-03-12</dcterms:issued>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2016-04-01T08:21:57Z</dcterms:available>
    <dc:language>eng</dc:language>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
  </rdf:Description>
</rdf:RDF>
kops.description.openAccessopenaccessgreen
kops.flag.knbibliographytrue
kops.identifier.nbnurn:nbn:de:bsz:352-0-324818
kops.sourcefieldUser Modeling and User-Adapted Interaction : umuai. 2016, <b>26</b>(1), pp. 69-101. ISSN 0924-1868. eISSN 1573-1391. Available under: doi: 10.1007/s11257-016-9174-xdeu
kops.sourcefield.plainUser Modeling and User-Adapted Interaction : umuai. 2016, 26(1), pp. 69-101. ISSN 0924-1868. eISSN 1573-1391. Available under: doi: 10.1007/s11257-016-9174-xdeu
kops.sourcefield.plainUser Modeling and User-Adapted Interaction : umuai. 2016, 26(1), pp. 69-101. ISSN 0924-1868. eISSN 1573-1391. Available under: doi: 10.1007/s11257-016-9174-xeng
relation.isAuthorOfPublicationebdceabd-fdd9-44b2-b7bb-57ebea6b5574
relation.isAuthorOfPublication358ad52f-dab7-4582-bf8e-8adcf477a2d4
relation.isAuthorOfPublication.latestForDiscoveryebdceabd-fdd9-44b2-b7bb-57ebea6b5574
source.bibliographicInfo.fromPage69eng
source.bibliographicInfo.issue1eng
source.bibliographicInfo.toPage101eng
source.bibliographicInfo.volume26eng
source.identifier.eissn1573-1391eng
source.identifier.issn0924-1868eng
source.periodicalTitleUser Modeling and User-Adapted Interaction : umuaieng

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Beel_0-324818.pdf
Größe:
1.76 MB
Format:
Adobe Portable Document Format
Beschreibung:
Beel_0-324818.pdf
Beel_0-324818.pdfGröße: 1.76 MBDownloads: 1551