Visual analysis of news streams with article threads

dc.contributor.authorKrstajic, Milos
dc.contributor.authorBertini, Enrico
dc.contributor.authorMansmann, Florian
dc.contributor.authorKeim, Daniel A.
dc.date.accessioned2011-09-13T06:49:11Zdeu
dc.date.available2011-09-13T06:49:11Zdeu
dc.date.issued2010
dc.description.abstractThe analysis of large quantities of news is an emerging area in the field of data analysis and visualization. International agencies collect thousands of news every day from a large number of sources and making sense of them is becoming increasingly complex due to the rate of the incoming news, as well as the inherent complexity of analyzing large quantities of evolving text corpora. Current visual techniques that deal with temporal evolution of such complex datasets, together with research efforts in related domains like text mining and topic detection and tracking, represent early attempts to understand, gain insight and make sense of these data. Despite these initial propositions, there is still a lack of techniques dealing directly with the problem of visualizing news streams in a "on-line" fashion, that is, in a way that the evolution of news can be monitored in real-time by the operator. In this paper we propose a purely visual technique that permits to see the evolution of news in real-time. The technique permits to show the stream of news as they enter into the system as well as a series of important threads which are computed on the fly. By merging single articles into threads, the technique permits to offload the visualization and retain only the most relevant information. The proposed technique is applied to the visualization of news streams generated by a news aggregation system that monitors over 4000 sites from 1600 key news portals world-wide and retrieves over 80000 reports per day in 43 languages.eng
dc.description.versionpublished
dc.identifier.citationFirst publ. in: Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques : KDD '10, The 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining / Margaret H. Dunham (Ed.). New York : ACM, 2010, pp. 39-46deu
dc.identifier.doi10.1145/1833280.1833286deu
dc.identifier.ppn350333114deu
dc.identifier.urihttp://kops.uni-konstanz.de/handle/123456789/12706
dc.language.isoengdeu
dc.legacy.dateIssued2011-09-13deu
dc.rightsterms-of-usedeu
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/deu
dc.subjectVisual Analyticsdeu
dc.subjectNews Analysisdeu
dc.subjectData Streamingdeu
dc.subject.ddc004deu
dc.titleVisual analysis of news streams with article threadseng
dc.typeINPROCEEDINGSdeu
dspace.entity.typePublication
kops.citation.bibtex
@inproceedings{Krstajic2010Visua-12706,
  year={2010},
  doi={10.1145/1833280.1833286},
  title={Visual analysis of news streams with article threads},
  isbn={978-1-4503-0226-5},
  publisher={ACM Press},
  address={New York, New York, USA},
  booktitle={Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10},
  pages={39--46},
  author={Krstajic, Milos and Bertini, Enrico and Mansmann, Florian and Keim, Daniel A.}
}
kops.citation.iso690KRSTAJIC, Milos, Enrico BERTINI, Florian MANSMANN, Daniel A. KEIM, 2010. Visual analysis of news streams with article threads. the First International Workshop. Washington, D.C., 25. Juli 2010 - 25. Juli 2010. In: Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10. New York, New York, USA: ACM Press, 2010, pp. 39-46. ISBN 978-1-4503-0226-5. Available under: doi: 10.1145/1833280.1833286deu
kops.citation.iso690KRSTAJIC, Milos, Enrico BERTINI, Florian MANSMANN, Daniel A. KEIM, 2010. Visual analysis of news streams with article threads. the First International Workshop. Washington, D.C., Jul 25, 2010 - Jul 25, 2010. In: Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10. New York, New York, USA: ACM Press, 2010, pp. 39-46. ISBN 978-1-4503-0226-5. Available under: doi: 10.1145/1833280.1833286eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/12706">
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/12706/1/krstajic_visual.pdf"/>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-09-13T06:49:11Z</dc:date>
    <dc:contributor>Mansmann, Florian</dc:contributor>
    <dc:contributor>Keim, Daniel A.</dc:contributor>
    <dc:rights>terms-of-use</dc:rights>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/12706/1/krstajic_visual.pdf"/>
    <dc:creator>Bertini, Enrico</dc:creator>
    <dcterms:bibliographicCitation>First publ. in: Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques : KDD '10, The 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining / 	Margaret H. Dunham (Ed.). New York : ACM, 2010, pp. 39-46</dcterms:bibliographicCitation>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:issued>2010</dcterms:issued>
    <dc:creator>Keim, Daniel A.</dc:creator>
    <dc:creator>Krstajic, Milos</dc:creator>
    <dcterms:abstract xml:lang="eng">The analysis of large quantities of news is an emerging area in the field of data analysis and visualization. International agencies collect thousands of news every day from a large number of sources and making sense of them is becoming increasingly complex due to the rate of the incoming news, as well as the inherent complexity of analyzing large quantities of evolving text corpora. Current visual techniques that deal with temporal evolution of such complex datasets, together with research efforts in related domains like text mining and topic detection and tracking, represent early attempts to understand, gain insight and make sense of these data. Despite these initial propositions, there is still a lack of techniques dealing directly with the problem of visualizing news streams in a "on-line" fashion, that is, in a way that the evolution of news can be monitored in real-time by the operator. In this paper we propose a purely visual technique that permits to see the evolution of news in real-time. The technique permits to show the stream of news as they enter into the system as well as a series of important threads which are computed on the fly. By merging single articles into threads, the technique permits to offload the visualization and retain only the most relevant information. The proposed technique is applied to the visualization of news streams generated by a news aggregation system that monitors over 4000 sites from 1600 key news portals world-wide and retrieves over 80000 reports per day in 43 languages.</dcterms:abstract>
    <dcterms:title>Visual analysis of news streams with article threads</dcterms:title>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-09-13T06:49:11Z</dcterms:available>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/12706"/>
    <dc:contributor>Bertini, Enrico</dc:contributor>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Krstajic, Milos</dc:contributor>
    <dc:language>eng</dc:language>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:creator>Mansmann, Florian</dc:creator>
  </rdf:Description>
</rdf:RDF>
kops.conferencefieldthe First International Workshop, 25. Juli 2010 - 25. Juli 2010, Washington, D.C.deu
kops.date.conferenceEnd2010-07-25
kops.date.conferenceStart2010-07-25
kops.description.openAccessopenaccessgreen
kops.flag.knbibliographytrue
kops.identifier.nbnurn:nbn:de:bsz:352-127061deu
kops.location.conferenceWashington, D.C.
kops.sourcefield<i>Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10</i>. New York, New York, USA: ACM Press, 2010, pp. 39-46. ISBN 978-1-4503-0226-5. Available under: doi: 10.1145/1833280.1833286deu
kops.sourcefield.plainProceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10. New York, New York, USA: ACM Press, 2010, pp. 39-46. ISBN 978-1-4503-0226-5. Available under: doi: 10.1145/1833280.1833286deu
kops.sourcefield.plainProceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10. New York, New York, USA: ACM Press, 2010, pp. 39-46. ISBN 978-1-4503-0226-5. Available under: doi: 10.1145/1833280.1833286eng
kops.submitter.emailmichael.ketzer@uni-konstanz.dedeu
kops.title.conferencethe First International Workshop
relation.isAuthorOfPublication51744407-f0ac-47af-bf4e-f85821560ef5
relation.isAuthorOfPublicationac932161-40c6-4b8c-ba2a-900f97afb0e3
relation.isAuthorOfPublication90244953-4003-4a15-ae6e-0b9d164ea2a3
relation.isAuthorOfPublicationda7dafb0-6003-4fd4-803c-11e1e72d621a
relation.isAuthorOfPublication.latestForDiscovery51744407-f0ac-47af-bf4e-f85821560ef5
source.bibliographicInfo.fromPage39
source.bibliographicInfo.toPage46
source.identifier.isbn978-1-4503-0226-5
source.publisherACM Press
source.publisher.locationNew York, New York, USA
source.titleProceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques - StreamKDD '10

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
krstajic_visual.pdf
Größe:
569.9 KB
Format:
Adobe Portable Document Format
krstajic_visual.pdf
krstajic_visual.pdfGröße: 569.9 KBDownloads: 752

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
1.92 KB
Format:
Plain Text
Beschreibung:
license.txt
license.txtGröße: 1.92 KBDownloads: 0