Walking Wales : The Data Challenge

dc.contributor.authorKolb, David
dc.date.accessioned2015-11-09T08:36:19Z
dc.date.available2015-11-09T08:36:19Z
dc.date.issued2015eng
dc.description.abstractThe handling and analysis of large amounts of data is no trivial task, especially if the data comes from diverse sources with various data formats, and is subject to inconsistencies and errors. This project deals with such data, collected during a 3 month walk around the perimeter of Wales in the UK. It details the difficulties of processing and ultimately making sense of real world data, including GPS, ECG and free text, showing how problems in the raw data were identified and resolved through the use of open source tools or special tools written by the author. The importance of understand the data is emphasised, of which assessing the quality of the data is a major issue. Unstructured text from blog posts was analysed to extract a sentiment score, which involved creating a domain specific sentiment dictionary. A small study highlighted some of the problems of assessing sentiment in this context. Finally, several multivariate visualisations were created to allow browsing and a visual exploration of the data. This included the results of the sentiment analysis, GPS track, heart-rate, elevation, acceleration and skin conductivity on a zoomable timeline. A zoomable map was also created, showing the walked track with an indication of the sentiment score. The use of the visualisations to find interesting artefacts are demonstrated.eng
dc.description.versionpublished
dc.identifier.ppn451538498
dc.identifier.urihttp://kops.uni-konstanz.de/handle/123456789/32085
dc.language.isoengeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subjectInformation visualisation, data cleaning, sentiment analysis, long distance walking, data cleansingeng
dc.subject.ccsinformation visualisation
dc.subject.ddc004eng
dc.titleWalking Wales : The Data Challengeeng
dc.typeSTUDENTTEXTeng
dspace.entity.typePublication
kops.citation.bibtex
@misc{Kolb2015Walki-32085,
  year={2015},
  title={Walking Wales : The Data Challenge},
  author={Kolb, David},
  note={Es handelt sich um einen Bericht von einem Bachelor-Projekt.}
}
kops.citation.iso690KOLB, David, 2015. Walking Wales : The Data Challengedeu
kops.citation.iso690KOLB, David, 2015. Walking Wales : The Data Challengeeng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/32085">
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/32085/3/Kolb_0-305478.pdf"/>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/32085"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/32085/3/Kolb_0-305478.pdf"/>
    <dcterms:abstract xml:lang="eng">The handling and analysis of large amounts of data is no trivial task, especially if the data comes from diverse sources with various data formats, and is subject to inconsistencies and errors. This project deals with such data, collected during a 3 month walk around the perimeter of Wales in the UK. It details the difficulties of processing and ultimately making sense of real world data, including GPS, ECG and free text, showing how problems in the raw data were identified and resolved through the use of open source tools or special tools written by the author. The importance of understand the data is emphasised, of which assessing the quality of the data is a major issue. Unstructured text from blog posts was analysed to extract a sentiment score, which involved creating a domain specific sentiment dictionary. A small study highlighted some of the problems of assessing sentiment in this context. Finally, several multivariate visualisations were created to allow browsing and a visual exploration of the data. This included the results of the sentiment analysis, GPS track, heart-rate, elevation, acceleration and skin conductivity on a zoomable timeline. A zoomable map was also created, showing the walked track with an indication of the sentiment score. The use of the visualisations to find interesting artefacts are demonstrated.</dcterms:abstract>
    <dcterms:title>Walking Wales : The Data Challenge</dcterms:title>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:contributor>Kolb, David</dc:contributor>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-11-09T08:36:19Z</dcterms:available>
    <dcterms:issued>2015</dcterms:issued>
    <dc:rights>terms-of-use</dc:rights>
    <dc:creator>Kolb, David</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-11-09T08:36:19Z</dc:date>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:language>eng</dc:language>
  </rdf:Description>
</rdf:RDF>
kops.description.commentEs handelt sich um einen Bericht von einem Bachelor-Projekt.eng
kops.description.openAccessopenaccessgreen
kops.identifier.nbnurn:nbn:de:bsz:352-0-305478
relation.isAuthorOfPublicationa278540f-1bd6-436c-83c6-5ffff83ae241
relation.isAuthorOfPublication.latestForDiscoverya278540f-1bd6-436c-83c6-5ffff83ae241
temp.internal.duplicates<p>Keine Dubletten gefunden. Letzte Überprüfung: 04.11.2015 14:37:25</p>deu

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Kolb_0-305478.pdf
Größe:
6.94 MB
Format:
Adobe Portable Document Format
Kolb_0-305478.pdf
Kolb_0-305478.pdfGröße: 6.94 MBDownloads: 719

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
3.88 KB
Format:
Item-specific license agreed upon to submission
Beschreibung:
license.txt
license.txtGröße: 3.88 KBDownloads: 0