Measure-Driven Visual Analytics of Categorical Data

Lade...
Vorschaubild
Dateien
Dennig_2-nj5k6k8ri1t02.pdf
Dennig_2-nj5k6k8ri1t02.pdfGröße: 21.6 MBDownloads: 34
Datum
2024
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
DOI (zitierfähiger Link)
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Publikationstyp
Dissertation
Publikationsstatus
Published
Erschienen in
Zusammenfassung

Visual Analytics (VA) enables data analysts and domain experts to engage in analytical reasoning through interactive visual interfaces. One type of data often encountered in data analysis tasks is categorical data. Unlike numerical data, categorical data with nominal attributes has no inherent order or scale and, therefore, does not lend itself to the application of common arithmetic operations. However, many data mining and visualization techniques are predominantly based on numerical data. Notwithstanding these challenges, the analysis of categorical data is crucial in various domains, including linguistics and software engineering. This dissertation addresses the challenges posed by categorical data, including difficulties in establishing an order of attributes for visualization and defining numerical abstractions. This work bridges the qualitative-quantitative divide in the visual analysis of categorical data by introducing abstractions that improve the readability of categorical data visualizations, developing new strategies for applying methods typically designed for numerical data, and exploring their interplay with numerical data. This thesis is structured in three parts: The first part introduces quality measures for the Parallel Sets visualization. In addition, we present measures that guide the exploration of categorical data projections by suggesting attributes that differentiate groups of data items. The second part presents measure-driven approaches for expressing categorical data properties and deriving numerical representations for the domains of linguistics and software engineering, demonstrating the power of measure-driven approaches in real-world applications. The third part addresses the joint analysis of categorical attributes and numerical data dimensions. It offers strategies for the use of categorical data for model training and exploratory data analysis in supervised and unsupervised frameworks. Finally, this thesis outlines the limitations and lessons learned from the explored measure-driven approaches and suggests future directions for more effectively integrating categorical data into VA with the goal of improving the readability of visualization, pattern quantification and user guidance. In conclusion, this work improves the analysis and visualization of categorical data by proposing new measure-driven approaches, improving readability and interpretability of visualizations, providing domain-agnostic and domain-specific support for exploratory data analysis, and their integration into supervised and unsupervised VA frameworks.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
004 Informatik
Schlagwörter
Konferenz
Rezension
undefined / . - undefined, undefined
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Datensätze
Zitieren
ISO 690DENNIG, Frederik L., 2024. Measure-Driven Visual Analytics of Categorical Data [Dissertation]. Konstanz: Universität Konstanz
BibTex
@phdthesis{Dennig2024-09-30Measu-70870,
  year={2024},
  title={Measure-Driven Visual Analytics of Categorical Data},
  author={Dennig, Frederik L.},
  address={Konstanz},
  school={Universität Konstanz}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/70870">
    <dc:contributor>Dennig, Frederik L.</dc:contributor>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dcterms:title>Measure-Driven Visual Analytics of Categorical Data</dcterms:title>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/70870/3/Dennig_2-nj5k6k8ri1t02.pdf"/>
    <dcterms:issued>2024-09-30</dcterms:issued>
    <dcterms:abstract>Visual Analytics (VA) enables data analysts and domain experts to engage in analytical reasoning through interactive visual interfaces.  One type of data often encountered in data analysis tasks is categorical data. Unlike numerical data, categorical data with nominal attributes has no inherent order or scale and, therefore, does not lend itself to the application of common arithmetic operations. However, many data mining and visualization techniques are predominantly based on numerical data. Notwithstanding these challenges, the analysis of categorical data is crucial in various domains, including linguistics and software engineering. This dissertation addresses the challenges posed by categorical data, including difficulties in establishing an order of attributes for visualization and defining numerical abstractions. This work bridges the qualitative-quantitative divide in the visual analysis of categorical data by introducing abstractions that improve the readability of categorical data visualizations, developing new strategies for applying methods typically designed for numerical data, and exploring their interplay with numerical data. This thesis is structured in three parts: The first part introduces quality measures for the Parallel Sets visualization. In addition, we present measures that guide the exploration of categorical data projections by suggesting attributes that differentiate groups of data items. The second part presents measure-driven approaches for expressing categorical data properties and deriving numerical representations for the domains of linguistics and software engineering, demonstrating the power of measure-driven approaches in real-world applications. The third part addresses the joint analysis of categorical attributes and numerical data dimensions. It offers strategies for the use of categorical data for model training and exploratory data analysis in supervised and unsupervised frameworks. Finally, this thesis outlines the limitations and lessons learned from the explored measure-driven approaches and suggests future directions for more effectively integrating categorical data into VA with the goal of improving the readability of visualization, pattern quantification and user guidance. In conclusion, this work improves the analysis and visualization of categorical data by proposing new measure-driven approaches, improving readability and interpretability of visualizations, providing domain-agnostic and domain-specific support for exploratory data analysis, and their integration into supervised and unsupervised VA frameworks.</dcterms:abstract>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:rights>terms-of-use</dc:rights>
    <dc:language>eng</dc:language>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2024-09-30T11:07:29Z</dc:date>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2024-09-30T11:07:29Z</dcterms:available>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/70870/3/Dennig_2-nj5k6k8ri1t02.pdf"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/70870"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:creator>Dennig, Frederik L.</dc:creator>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
July 17, 2024
Hochschulschriftenvermerk
Konstanz, Univ., Diss., 2024
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Begutachtet
Diese Publikation teilen