Visual Analytics of Co-Occurrences to Discover Subspaces in Structured Data

dc.contributor.authorJentner, Wolfgang
dc.contributor.authorLindholz, Giuliana
dc.contributor.authorSchäfer, Hanna
dc.contributor.authorEl-Assady, Mennatallah
dc.contributor.authorMa, Kwan-Liu
dc.contributor.authorKeim, Daniel A.
dc.date.accessioned2023-01-30T08:31:02Z
dc.date.available2023-01-30T08:31:02Z
dc.date.issued2023eng
dc.description.abstractWe present an approach that shows all relevant subspaces of categorical data condensed in a single picture. We model the categorical values of the attributes as co-occurrences with data partitions generated from structured data using pattern mining. We show that these co-occurrences are a-priori allowing us to greatly reduce the search space effectively generating the condensed picture where conventional approaches filter out several subspaces as these are deemed insignificant. The task of identifying interesting subspaces is common but difficult due to exponential search spaces and the curse of dimensionality. One application of such a task might be identifying a cohort of patients defined by attributes such as gender, age, and diabetes type that share a common patient history, which is modeled as event sequences. Filtering the data by these attributes is common but cumbersome and often does not allow a comparison of subspaces. We contribute a powerful multi-dimensional pattern exploration approach (MDPE-approach) agnostic to the structured data type that models multiple attributes and their characteristics as co-occurrences, allowing the user to identify and compare thousands of subspaces of interest in a single picture. In our MDPE-approach, we introduce two methods to dramatically reduce the search space, outputting only the boundaries of the search space in the form of two tables. We implement the MDPE-approach in an interactive visual interface (MDPE-vis) that provides a scalable, pixel-based visualization design allowing the identification, comparison, and sense-making of subspaces in structured data. Our case studies using a gold-standard dataset and external domain experts confirm our approach’s and implementation’s applicability. A third use case sheds light on the scalability of our approach and a user study with 15 participants underlines its usefulness and power.eng
dc.description.versionpublishedeng
dc.identifier.doi10.1145/3579031eng
dc.identifier.ppn1857261380
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/59965
dc.language.isoengeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subjectstructured data mining, pattern mining, subspace searcheng
dc.subject.ccsHuman-centered computing
dc.subject.ccsVisualization
dc.subject.ccsVisualization application domains
dc.subject.ccsVisual analytics
dc.subject.ddc004eng
dc.titleVisual Analytics of Co-Occurrences to Discover Subspaces in Structured Dataeng
dc.typeJOURNAL_ARTICLEeng
dspace.entity.typePublication
kops.citation.bibtex
@article{Jentner2023Visua-59965,
  year={2023},
  doi={10.1145/3579031},
  title={Visual Analytics of Co-Occurrences to Discover Subspaces in Structured Data},
  url={https://dl.acm.org/doi/10.1145/3579031},
  number={2},
  volume={13},
  issn={2160-6455},
  journal={ACM Transactions on Interactive Intelligent Systems},
  author={Jentner, Wolfgang and Lindholz, Giuliana and Schäfer, Hanna and El-Assady, Mennatallah and Ma, Kwan-Liu and Keim, Daniel A.},
  note={Article Number: 10}
}
kops.citation.iso690JENTNER, Wolfgang, Giuliana LINDHOLZ, Hanna SCHÄFER, Mennatallah EL-ASSADY, Kwan-Liu MA, Daniel A. KEIM, 2023. Visual Analytics of Co-Occurrences to Discover Subspaces in Structured Data. In: ACM Transactions on Interactive Intelligent Systems. ACM. 2023, 13(2), 10. ISSN 2160-6455. eISSN 2160-6463. Available under: doi: 10.1145/3579031deu
kops.citation.iso690JENTNER, Wolfgang, Giuliana LINDHOLZ, Hanna SCHÄFER, Mennatallah EL-ASSADY, Kwan-Liu MA, Daniel A. KEIM, 2023. Visual Analytics of Co-Occurrences to Discover Subspaces in Structured Data. In: ACM Transactions on Interactive Intelligent Systems. ACM. 2023, 13(2), 10. ISSN 2160-6455. eISSN 2160-6463. Available under: doi: 10.1145/3579031eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/59965">
    <dcterms:abstract xml:lang="eng">We present an approach that shows all relevant subspaces of categorical data condensed in a single picture. We model the categorical values of the attributes as co-occurrences with data partitions generated from structured data using pattern mining. We show that these co-occurrences are a-priori allowing us to greatly reduce the search space effectively generating the condensed picture where conventional approaches filter out several subspaces as these are deemed insignificant. The task of identifying interesting subspaces is common but difficult due to exponential search spaces and the curse of dimensionality. One application of such a task might be identifying a cohort of patients defined by attributes such as gender, age, and diabetes type that share a common patient history, which is modeled as event sequences. Filtering the data by these attributes is common but cumbersome and often does not allow a comparison of subspaces. We contribute a powerful multi-dimensional pattern exploration approach (MDPE-approach) agnostic to the structured data type that models multiple attributes and their characteristics as co-occurrences, allowing the user to identify and compare thousands of subspaces of interest in a single picture. In our MDPE-approach, we introduce two methods to dramatically reduce the search space, outputting only the boundaries of the search space in the form of two tables. We implement the MDPE-approach in an interactive visual interface (MDPE-vis) that provides a scalable, pixel-based visualization design allowing the identification, comparison, and sense-making of subspaces in structured data. Our case studies using a gold-standard dataset and external domain experts confirm our approach’s and implementation’s applicability. A third use case sheds light on the scalability of our approach and a user study with 15 participants underlines its usefulness and power.</dcterms:abstract>
    <dc:contributor>El-Assady, Mennatallah</dc:contributor>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-01-30T08:31:02Z</dcterms:available>
    <dc:contributor>Jentner, Wolfgang</dc:contributor>
    <dcterms:issued>2023</dcterms:issued>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:title>Visual Analytics of Co-Occurrences to Discover Subspaces in Structured Data</dcterms:title>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:creator>Schäfer, Hanna</dc:creator>
    <dc:rights>terms-of-use</dc:rights>
    <dc:creator>El-Assady, Mennatallah</dc:creator>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/59965"/>
    <dc:contributor>Schäfer, Hanna</dc:contributor>
    <dc:creator>Jentner, Wolfgang</dc:creator>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/59965/3/Jentner_2-1e8x9clmy6c6v0.pdf"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/59965/3/Jentner_2-1e8x9clmy6c6v0.pdf"/>
    <dc:creator>Ma, Kwan-Liu</dc:creator>
    <dc:contributor>Keim, Daniel A.</dc:contributor>
    <dc:contributor>Lindholz, Giuliana</dc:contributor>
    <dc:contributor>Ma, Kwan-Liu</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:creator>Keim, Daniel A.</dc:creator>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-01-30T08:31:02Z</dc:date>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:creator>Lindholz, Giuliana</dc:creator>
    <dc:language>eng</dc:language>
  </rdf:Description>
</rdf:RDF>
kops.description.funding{"first": "eu", "second": "826494"}
kops.description.openAccessopenaccesshybrid
kops.flag.isPeerReviewedtrueeng
kops.flag.knbibliographytrue
kops.identifier.nbnurn:nbn:de:bsz:352-2-1e8x9clmy6c6v0
kops.relation.euProjectID826494eng
kops.relation.uniknProjectTitlePRIMAGE - PRedictive In-silico Multiscale Analytics to support cancer personalized diaGnosis and prognosis, Empowered by imaging biomarkers
kops.sourcefieldACM Transactions on Interactive Intelligent Systems. ACM. 2023, <b>13</b>(2), 10. ISSN 2160-6455. eISSN 2160-6463. Available under: doi: 10.1145/3579031deu
kops.sourcefield.plainACM Transactions on Interactive Intelligent Systems. ACM. 2023, 13(2), 10. ISSN 2160-6455. eISSN 2160-6463. Available under: doi: 10.1145/3579031deu
kops.sourcefield.plainACM Transactions on Interactive Intelligent Systems. ACM. 2023, 13(2), 10. ISSN 2160-6455. eISSN 2160-6463. Available under: doi: 10.1145/3579031eng
kops.urlhttps://dl.acm.org/doi/10.1145/3579031eng
relation.isAuthorOfPublication4e437f35-db84-47a0-bf20-94a9801aeb18
relation.isAuthorOfPublication5d29a894-1518-44b8-b42b-10cbcb037506
relation.isAuthorOfPublicationb5024762-2f04-44ad-9c05-c6305f6e7f8b
relation.isAuthorOfPublicationda7dafb0-6003-4fd4-803c-11e1e72d621a
relation.isAuthorOfPublication.latestForDiscovery4e437f35-db84-47a0-bf20-94a9801aeb18
source.bibliographicInfo.articleNumber10
source.bibliographicInfo.issue2
source.bibliographicInfo.volume13
source.identifier.eissn2160-6463eng
source.identifier.issn2160-6455eng
source.periodicalTitleACM Transactions on Interactive Intelligent Systemseng
source.publisherACMeng

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Jentner_2-1e8x9clmy6c6v0.pdf
Größe:
8.88 MB
Format:
Adobe Portable Document Format
Jentner_2-1e8x9clmy6c6v0.pdf
Jentner_2-1e8x9clmy6c6v0.pdfGröße: 8.88 MBDownloads: 201

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
3.96 KB
Format:
Item-specific license agreed upon to submission
Beschreibung:
license.txt
license.txtGröße: 3.96 KBDownloads: 0