Measure-Driven Visual Analytics of Categorical Data
Dateien
Datum
Autor:innen
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Sammlungen
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
Visual Analytics (VA) enables data analysts and domain experts to engage in analytical reasoning through interactive visual interfaces. One type of data often encountered in data analysis tasks is categorical data. Unlike numerical data, categorical data with nominal attributes has no inherent order or scale and, therefore, does not lend itself to the application of common arithmetic operations. However, many data mining and visualization techniques are predominantly based on numerical data. Notwithstanding these challenges, the analysis of categorical data is crucial in various domains, including linguistics and software engineering. This dissertation addresses the challenges posed by categorical data, including difficulties in establishing an order of attributes for visualization and defining numerical abstractions. This work bridges the qualitative-quantitative divide in the visual analysis of categorical data by introducing abstractions that improve the readability of categorical data visualizations, developing new strategies for applying methods typically designed for numerical data, and exploring their interplay with numerical data. This thesis is structured in three parts: The first part introduces quality measures for the Parallel Sets visualization. In addition, we present measures that guide the exploration of categorical data projections by suggesting attributes that differentiate groups of data items. The second part presents measure-driven approaches for expressing categorical data properties and deriving numerical representations for the domains of linguistics and software engineering, demonstrating the power of measure-driven approaches in real-world applications. The third part addresses the joint analysis of categorical attributes and numerical data dimensions. It offers strategies for the use of categorical data for model training and exploratory data analysis in supervised and unsupervised frameworks. Finally, this thesis outlines the limitations and lessons learned from the explored measure-driven approaches and suggests future directions for more effectively integrating categorical data into VA with the goal of improving the readability of visualization, pattern quantification and user guidance. In conclusion, this work improves the analysis and visualization of categorical data by proposing new measure-driven approaches, improving readability and interpretability of visualizations, providing domain-agnostic and domain-specific support for exploratory data analysis, and their integration into supervised and unsupervised VA frameworks.
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
DENNIG, Frederik L., 2024. Measure-Driven Visual Analytics of Categorical Data [Dissertation]. Konstanz: Universität KonstanzBibTex
@phdthesis{Dennig2024-09-30Measu-70870, year={2024}, title={Measure-Driven Visual Analytics of Categorical Data}, author={Dennig, Frederik L.}, address={Konstanz}, school={Universität Konstanz} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/70870"> <dc:contributor>Dennig, Frederik L.</dc:contributor> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dcterms:title>Measure-Driven Visual Analytics of Categorical Data</dcterms:title> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/70870/3/Dennig_2-nj5k6k8ri1t02.pdf"/> <dcterms:issued>2024-09-30</dcterms:issued> <dcterms:abstract>Visual Analytics (VA) enables data analysts and domain experts to engage in analytical reasoning through interactive visual interfaces. One type of data often encountered in data analysis tasks is categorical data. Unlike numerical data, categorical data with nominal attributes has no inherent order or scale and, therefore, does not lend itself to the application of common arithmetic operations. However, many data mining and visualization techniques are predominantly based on numerical data. Notwithstanding these challenges, the analysis of categorical data is crucial in various domains, including linguistics and software engineering. This dissertation addresses the challenges posed by categorical data, including difficulties in establishing an order of attributes for visualization and defining numerical abstractions. This work bridges the qualitative-quantitative divide in the visual analysis of categorical data by introducing abstractions that improve the readability of categorical data visualizations, developing new strategies for applying methods typically designed for numerical data, and exploring their interplay with numerical data. This thesis is structured in three parts: The first part introduces quality measures for the Parallel Sets visualization. In addition, we present measures that guide the exploration of categorical data projections by suggesting attributes that differentiate groups of data items. The second part presents measure-driven approaches for expressing categorical data properties and deriving numerical representations for the domains of linguistics and software engineering, demonstrating the power of measure-driven approaches in real-world applications. The third part addresses the joint analysis of categorical attributes and numerical data dimensions. It offers strategies for the use of categorical data for model training and exploratory data analysis in supervised and unsupervised frameworks. Finally, this thesis outlines the limitations and lessons learned from the explored measure-driven approaches and suggests future directions for more effectively integrating categorical data into VA with the goal of improving the readability of visualization, pattern quantification and user guidance. In conclusion, this work improves the analysis and visualization of categorical data by proposing new measure-driven approaches, improving readability and interpretability of visualizations, providing domain-agnostic and domain-specific support for exploratory data analysis, and their integration into supervised and unsupervised VA frameworks.</dcterms:abstract> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <dc:rights>terms-of-use</dc:rights> <dc:language>eng</dc:language> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2024-09-30T11:07:29Z</dc:date> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2024-09-30T11:07:29Z</dcterms:available> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/70870/3/Dennig_2-nj5k6k8ri1t02.pdf"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/70870"/> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dc:creator>Dennig, Frederik L.</dc:creator> </rdf:Description> </rdf:RDF>