Publikation:

Walking Wales : The Data Challenge

Lade...
Vorschaubild

Dateien

Kolb_0-305478.pdf
Kolb_0-305478.pdfGröße: 6.94 MBDownloads: 583

Datum

2015

Autor:innen

Herausgeber:innen

Kontakt

ISSN der Zeitschrift

Electronic ISSN

ISBN

Bibliografische Daten

Verlag

Schriftenreihe

Auflagebezeichnung

DOI (zitierfähiger Link)
ArXiv-ID

Internationale Patentnummer

Angaben zur Forschungsförderung

Projekt

Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz

Gesperrt bis

Titel in einer weiteren Sprache

Publikationstyp
Studienarbeit
Publikationsstatus
Published

Erschienen in

Zusammenfassung

The handling and analysis of large amounts of data is no trivial task, especially if the data comes from diverse sources with various data formats, and is subject to inconsistencies and errors. This project deals with such data, collected during a 3 month walk around the perimeter of Wales in the UK. It details the difficulties of processing and ultimately making sense of real world data, including GPS, ECG and free text, showing how problems in the raw data were identified and resolved through the use of open source tools or special tools written by the author. The importance of understand the data is emphasised, of which assessing the quality of the data is a major issue. Unstructured text from blog posts was analysed to extract a sentiment score, which involved creating a domain specific sentiment dictionary. A small study highlighted some of the problems of assessing sentiment in this context. Finally, several multivariate visualisations were created to allow browsing and a visual exploration of the data. This included the results of the sentiment analysis, GPS track, heart-rate, elevation, acceleration and skin conductivity on a zoomable timeline. A zoomable map was also created, showing the walked track with an indication of the sentiment score. The use of the visualisations to find interesting artefacts are demonstrated.

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
004 Informatik

Schlagwörter

Information visualisation, data cleaning, sentiment analysis, long distance walking, data cleansing

Konferenz

Rezension
undefined / . - undefined, undefined

Forschungsvorhaben

Organisationseinheiten

Zeitschriftenheft

Zugehörige Datensätze in KOPS

Zitieren

ISO 690KOLB, David, 2015. Walking Wales : The Data Challenge
BibTex
@misc{Kolb2015Walki-32085,
  year={2015},
  title={Walking Wales : The Data Challenge},
  author={Kolb, David},
  note={Es handelt sich um einen Bericht von einem Bachelor-Projekt.}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/32085">
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/32085/3/Kolb_0-305478.pdf"/>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/32085"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/32085/3/Kolb_0-305478.pdf"/>
    <dcterms:abstract xml:lang="eng">The handling and analysis of large amounts of data is no trivial task, especially if the data comes from diverse sources with various data formats, and is subject to inconsistencies and errors. This project deals with such data, collected during a 3 month walk around the perimeter of Wales in the UK. It details the difficulties of processing and ultimately making sense of real world data, including GPS, ECG and free text, showing how problems in the raw data were identified and resolved through the use of open source tools or special tools written by the author. The importance of understand the data is emphasised, of which assessing the quality of the data is a major issue. Unstructured text from blog posts was analysed to extract a sentiment score, which involved creating a domain specific sentiment dictionary. A small study highlighted some of the problems of assessing sentiment in this context. Finally, several multivariate visualisations were created to allow browsing and a visual exploration of the data. This included the results of the sentiment analysis, GPS track, heart-rate, elevation, acceleration and skin conductivity on a zoomable timeline. A zoomable map was also created, showing the walked track with an indication of the sentiment score. The use of the visualisations to find interesting artefacts are demonstrated.</dcterms:abstract>
    <dcterms:title>Walking Wales : The Data Challenge</dcterms:title>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:contributor>Kolb, David</dc:contributor>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-11-09T08:36:19Z</dcterms:available>
    <dcterms:issued>2015</dcterms:issued>
    <dc:rights>terms-of-use</dc:rights>
    <dc:creator>Kolb, David</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-11-09T08:36:19Z</dc:date>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:language>eng</dc:language>
  </rdf:Description>
</rdf:RDF>

Interner Vermerk

xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter

Kontakt
URL der Originalveröffentl.

Prüfdatum der URL

Prüfungsdatum der Dissertation

Finanzierungsart

Kommentar zur Publikation

Es handelt sich um einen Bericht von einem Bachelor-Projekt.
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Begutachtet
Diese Publikation teilen