KOPS - The Institutional Repository of the University of Konstanz

Interpretable and Comparative Textual Dataset Exploration Using Near-Identity Mention Relations

Interpretable and Comparative Textual Dataset Exploration Using Near-Identity Mention Relations

Cite This

Files in this item

Files Size Format View

There are no files associated with this item.

ZHUKOVA, Anastasia, Felix HAMBORG, Bela GIPP, 2020. Interpretable and Comparative Textual Dataset Exploration Using Near-Identity Mention Relations. JCDL '20. China (Virtual Event), Aug 1, 2020 - Aug 5, 2020. In: HUANG, Ruhua, ed. and others. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL '20). New York:ACM, pp. 457-458. ISBN 978-1-4503-7585-6. Available under: doi: 10.1145/3383583.3398562

@inproceedings{Zhukova2020Inter-51923, title={Interpretable and Comparative Textual Dataset Exploration Using Near-Identity Mention Relations}, year={2020}, doi={10.1145/3383583.3398562}, isbn={978-1-4503-7585-6}, address={New York}, publisher={ACM}, booktitle={Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL '20)}, pages={457--458}, editor={Huang, Ruhua}, author={Zhukova, Anastasia and Hamborg, Felix and Gipp, Bela} }

<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/rdf/resource/123456789/51923"> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/rdf/resource/123456789/36"/> <foaf:homepage rdf:resource="http://localhost:8080/jspui"/> <dc:contributor>Gipp, Bela</dc:contributor> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/rdf/resource/123456789/36"/> <dcterms:title>Interpretable and Comparative Textual Dataset Exploration Using Near-Identity Mention Relations</dcterms:title> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2020-11-25T14:24:20Z</dcterms:available> <dc:creator>Zhukova, Anastasia</dc:creator> <dc:creator>Hamborg, Felix</dc:creator> <dc:contributor>Zhukova, Anastasia</dc:contributor> <dc:contributor>Hamborg, Felix</dc:contributor> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2020-11-25T14:24:20Z</dc:date> <dcterms:issued>2020</dcterms:issued> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/51923"/> <dcterms:abstract xml:lang="eng">Dataset exploration is a set of techniques crucial in many research and data science projects. For textual datasets, commonly used techniques include topic modeling, document summarization, and methods related to dimension reduction. Despite their robustness, these techniques suffer from at least one of the following drawbacks: document summarization does not explicitly set documents in relation, the others yield summaries or topics that often are difficult to interpret and yield poor results for topics that consist of context-dependent terms. We propose a method for dataset exploration that employs cross-document near-identity resolution of mentions of semantic concepts, such as persons, other named entity types, events, actions. The method not only sets documents in relation and thus allows for comparative dataset exploration, but also yields well interpretable document representations. Additionally, due to the underlying approach for cross-document resolution of concept mentions, the method is able to set documents in relation as to their near-identity terms, e.g., synonyms that are not universally valid but only in the given dataset.</dcterms:abstract> <dc:language>eng</dc:language> <dc:creator>Gipp, Bela</dc:creator> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <dc:rights>terms-of-use</dc:rights> </rdf:Description> </rdf:RDF>

This item appears in the following Collection(s)

Search KOPS


Browse

My Account