Publikation: Comparative Exploration of Document Collections : a Visual Analytics Approach
Dateien
Datum
Autor:innen
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
DOI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
We present an analysis and visualization method for computing what distinguishes a given document collection from others. We determine topics that discriminate a subset of collections from the remaining ones by applying probabilistic topic modeling and subsequently approximating the two relevant criteria distinctiveness and characteristicness algorithmically through a set of heuristics. Furthermore, we suggest a novel visualization method called DiTop-View, in which topics are represented by glyphs (topic coins) that are arranged on a 2D plane. Topic coins are designed to encode all information necessary for performing comparative analyses such as the class membership of a topic, its most probable terms and the discriminative relations. We evaluate our topic analysis using statistical measures and a small user experiment and present an expert case study with researchers from political sciences analyzing two real-world datasets.
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
OELKE, Daniela, Hendrik STROBELT, Christian ROHRDANTZ, Iryna GUREVYCH, Oliver DEUSSEN, 2014. Comparative Exploration of Document Collections : a Visual Analytics Approach. In: Computer Graphics Forum. 2014, 33(3), pp. 201-210. ISSN 0167-7055. eISSN 1467-8659. Available under: doi: 10.1111/cgf.12376BibTex
@article{Oelke2014Compa-29163, year={2014}, doi={10.1111/cgf.12376}, title={Comparative Exploration of Document Collections : a Visual Analytics Approach}, number={3}, volume={33}, issn={0167-7055}, journal={Computer Graphics Forum}, pages={201--210}, author={Oelke, Daniela and Strobelt, Hendrik and Rohrdantz, Christian and Gurevych, Iryna and Deussen, Oliver} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/29163"> <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/29163"/> <dc:creator>Strobelt, Hendrik</dc:creator> <dcterms:issued>2014</dcterms:issued> <dc:creator>Deussen, Oliver</dc:creator> <dc:contributor>Strobelt, Hendrik</dc:contributor> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dc:contributor>Gurevych, Iryna</dc:contributor> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dcterms:title>Comparative Exploration of Document Collections : a Visual Analytics Approach</dcterms:title> <dcterms:bibliographicCitation>Computer Graphics Forum, 33 (2014), 3. - S.</dcterms:bibliographicCitation> <dc:contributor>Rohrdantz, Christian</dc:contributor> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-10-22T09:57:37Z</dcterms:available> <dc:contributor>Deussen, Oliver</dc:contributor> <dc:language>eng</dc:language> <dcterms:abstract xml:lang="eng">We present an analysis and visualization method for computing what distinguishes a given document collection from others. We determine topics that discriminate a subset of collections from the remaining ones by applying probabilistic topic modeling and subsequently approximating the two relevant criteria distinctiveness and characteristicness algorithmically through a set of heuristics. Furthermore, we suggest a novel visualization method called DiTop-View, in which topics are represented by glyphs (topic coins) that are arranged on a 2D plane. Topic coins are designed to encode all information necessary for performing comparative analyses such as the class membership of a topic, its most probable terms and the discriminative relations. We evaluate our topic analysis using statistical measures and a small user experiment and present an expert case study with researchers from political sciences analyzing two real-world datasets.</dcterms:abstract> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/29163/3/Oelke_0-253295.pdf"/> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-10-22T09:57:37Z</dc:date> <dc:creator>Gurevych, Iryna</dc:creator> <dc:creator>Rohrdantz, Christian</dc:creator> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <dc:contributor>Oelke, Daniela</dc:contributor> <dc:creator>Oelke, Daniela</dc:creator> <dc:rights>terms-of-use</dc:rights> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/29163/3/Oelke_0-253295.pdf"/> </rdf:Description> </rdf:RDF>