Relating interesting quantitative time series patterns with text events and text features
Dateien
Datum
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
DOI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Sammlungen
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
In many application areas, the key to successful data analysis is the integrated analysis of heterogeneous data. One example is the financial domain, where time-dependent and highly frequent quantitative data (e.g., trading volume and price information) and textual data (e.g., economic and political news reports) need to be considered jointly. Data analysis tools need to support an integrated analysis, which allows studying the relationships between textual news documents and quantitative properties of the stock market price series. In this paper, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which reflect quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a-priori method. First, based on heuristics we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a-priori method supports the discovery of such sequential temporal patterns. Then, various text features like the degree of sentence nesting, noun phrase complexity, the vocabulary richness, etc. are extracted from the news to obtain meta patterns. Meta patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time-, cluster- and sequence visualization and analysis functionality. We provide two case studies, showing the effectiveness of our combined quantitative and textual analysis work flow. The workflow can also be generalized to other application domains such as data analysis of smart grids, cyber physical systems or the security of critical infrastructure, where the data consists of a combination of quantitative and textual time series data.
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
WANNER, Franz, Tobias SCHRECK, Wolfgang JENTNER, Lyubka SHARALIEVA, Daniel A. KEIM, 2014. Relating interesting quantitative time series patterns with text events and text features. IS&T/SPIE Electronic Imaging. San Francisco, California, USA. In: WONG, Pak Chung, ed. and others. Visualization and Data Analysis 2014. SPIE, 2014, 90170G. SPIE Proceedings. 9017. Available under: doi: 10.1117/12.2039639BibTex
@inproceedings{Wanner2014-02-03Relat-26414, year={2014}, doi={10.1117/12.2039639}, title={Relating interesting quantitative time series patterns with text events and text features}, number={9017}, publisher={SPIE}, series={SPIE Proceedings}, booktitle={Visualization and Data Analysis 2014}, editor={Wong, Pak Chung}, author={Wanner, Franz and Schreck, Tobias and Jentner, Wolfgang and Sharalieva, Lyubka and Keim, Daniel A.}, note={Article Number: 90170G} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/26414"> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/26414/2/Wanner-264141.pdf"/> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/26414/2/Wanner-264141.pdf"/> <dc:creator>Wanner, Franz</dc:creator> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dc:contributor>Schreck, Tobias</dc:contributor> <dcterms:bibliographicCitation>Visualization and data analysis 2014 : 3 - 5 February 2014, San Francisco, California, United States ; proceedings IS&T/SPIE electronic imaging, science and technology / sponsored and publ. by IS&T - the Society for Imaging Science and Technology; SPIE. Pak Chung Wong ... (Ed.). - Bellingham, Wash. : SPIE [u.a.], 2014. - (Proceedings of SPIE ; 9017). - ISBN 978-0-8194-9934-9</dcterms:bibliographicCitation> <dc:creator>Sharalieva, Lyubka</dc:creator> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-03-05T20:18:43Z</dcterms:available> <dcterms:abstract xml:lang="eng">In many application areas, the key to successful data analysis is the integrated analysis of heterogeneous data. One example is the financial domain, where time-dependent and highly frequent quantitative data (e.g., trading volume and price information) and textual data (e.g., economic and political news reports) need to be considered jointly. Data analysis tools need to support an integrated analysis, which allows studying the relationships between textual news documents and quantitative properties of the stock market price series. In this paper, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which reflect quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a-priori method. First, based on heuristics we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a-priori method supports the discovery of such sequential temporal patterns. Then, various text features like the degree of sentence nesting, noun phrase complexity, the vocabulary richness, etc. are extracted from the news to obtain meta patterns. Meta patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time-, cluster- and sequence visualization and analysis functionality. We provide two case studies, showing the effectiveness of our combined quantitative and textual analysis work flow. The workflow can also be generalized to other application domains such as data analysis of smart grids, cyber physical systems or the security of critical infrastructure, where the data consists of a combination of quantitative and textual time series data.</dcterms:abstract> <dc:contributor>Jentner, Wolfgang</dc:contributor> <dc:language>eng</dc:language> <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/26414"/> <dcterms:title>Relating interesting quantitative time series patterns with text events and text features</dcterms:title> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dc:creator>Keim, Daniel A.</dc:creator> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-03-05T20:18:43Z</dc:date> <dc:creator>Jentner, Wolfgang</dc:creator> <dcterms:issued>2014-02-03</dcterms:issued> <dc:rights>terms-of-use</dc:rights> <dc:contributor>Sharalieva, Lyubka</dc:contributor> <dc:contributor>Wanner, Franz</dc:contributor> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dc:creator>Schreck, Tobias</dc:creator> <dc:contributor>Keim, Daniel A.</dc:contributor> </rdf:Description> </rdf:RDF>