Walking Wales : The Data Challenge
| dc.contributor.author | Kolb, David | |
| dc.date.accessioned | 2015-11-09T08:36:19Z | |
| dc.date.available | 2015-11-09T08:36:19Z | |
| dc.date.issued | 2015 | eng |
| dc.description.abstract | The handling and analysis of large amounts of data is no trivial task, especially if the data comes from diverse sources with various data formats, and is subject to inconsistencies and errors. This project deals with such data, collected during a 3 month walk around the perimeter of Wales in the UK. It details the difficulties of processing and ultimately making sense of real world data, including GPS, ECG and free text, showing how problems in the raw data were identified and resolved through the use of open source tools or special tools written by the author. The importance of understand the data is emphasised, of which assessing the quality of the data is a major issue. Unstructured text from blog posts was analysed to extract a sentiment score, which involved creating a domain specific sentiment dictionary. A small study highlighted some of the problems of assessing sentiment in this context. Finally, several multivariate visualisations were created to allow browsing and a visual exploration of the data. This included the results of the sentiment analysis, GPS track, heart-rate, elevation, acceleration and skin conductivity on a zoomable timeline. A zoomable map was also created, showing the walked track with an indication of the sentiment score. The use of the visualisations to find interesting artefacts are demonstrated. | eng |
| dc.description.version | published | |
| dc.identifier.ppn | 451538498 | |
| dc.identifier.uri | http://kops.uni-konstanz.de/handle/123456789/32085 | |
| dc.language.iso | eng | eng |
| dc.rights | terms-of-use | |
| dc.rights.uri | https://rightsstatements.org/page/InC/1.0/ | |
| dc.subject | Information visualisation, data cleaning, sentiment analysis, long distance walking, data cleansing | eng |
| dc.subject.ccs | information visualisation | |
| dc.subject.ddc | 004 | eng |
| dc.title | Walking Wales : The Data Challenge | eng |
| dc.type | STUDENTTEXT | eng |
| dspace.entity.type | Publication | |
| kops.citation.bibtex | @misc{Kolb2015Walki-32085,
year={2015},
title={Walking Wales : The Data Challenge},
author={Kolb, David},
note={Es handelt sich um einen Bericht von einem Bachelor-Projekt.}
} | |
| kops.citation.iso690 | KOLB, David, 2015. Walking Wales : The Data Challenge | deu |
| kops.citation.iso690 | KOLB, David, 2015. Walking Wales : The Data Challenge | eng |
| kops.citation.rdf | <rdf:RDF
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bibo="http://purl.org/ontology/bibo/"
xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:void="http://rdfs.org/ns/void#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#" >
<rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/32085">
<dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/32085/3/Kolb_0-305478.pdf"/>
<bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/32085"/>
<dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
<dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/32085/3/Kolb_0-305478.pdf"/>
<dcterms:abstract xml:lang="eng">The handling and analysis of large amounts of data is no trivial task, especially if the data comes from diverse sources with various data formats, and is subject to inconsistencies and errors. This project deals with such data, collected during a 3 month walk around the perimeter of Wales in the UK. It details the difficulties of processing and ultimately making sense of real world data, including GPS, ECG and free text, showing how problems in the raw data were identified and resolved through the use of open source tools or special tools written by the author. The importance of understand the data is emphasised, of which assessing the quality of the data is a major issue. Unstructured text from blog posts was analysed to extract a sentiment score, which involved creating a domain specific sentiment dictionary. A small study highlighted some of the problems of assessing sentiment in this context. Finally, several multivariate visualisations were created to allow browsing and a visual exploration of the data. This included the results of the sentiment analysis, GPS track, heart-rate, elevation, acceleration and skin conductivity on a zoomable timeline. A zoomable map was also created, showing the walked track with an indication of the sentiment score. The use of the visualisations to find interesting artefacts are demonstrated.</dcterms:abstract>
<dcterms:title>Walking Wales : The Data Challenge</dcterms:title>
<dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
<dc:contributor>Kolb, David</dc:contributor>
<dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
<dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-11-09T08:36:19Z</dcterms:available>
<dcterms:issued>2015</dcterms:issued>
<dc:rights>terms-of-use</dc:rights>
<dc:creator>Kolb, David</dc:creator>
<foaf:homepage rdf:resource="http://localhost:8080/"/>
<dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-11-09T08:36:19Z</dc:date>
<void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
<dc:language>eng</dc:language>
</rdf:Description>
</rdf:RDF> | |
| kops.description.comment | Es handelt sich um einen Bericht von einem Bachelor-Projekt. | eng |
| kops.description.openAccess | openaccessgreen | |
| kops.identifier.nbn | urn:nbn:de:bsz:352-0-305478 | |
| relation.isAuthorOfPublication | a278540f-1bd6-436c-83c6-5ffff83ae241 | |
| relation.isAuthorOfPublication.latestForDiscovery | a278540f-1bd6-436c-83c6-5ffff83ae241 | |
| temp.internal.duplicates | <p>Keine Dubletten gefunden. Letzte Überprüfung: 04.11.2015 14:37:25</p> | deu |
Dateien
Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
- Name:
- Kolb_0-305478.pdf
- Größe:
- 6.94 MB
- Format:
- Adobe Portable Document Format
Lizenzbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
- Name:
- license.txt
- Größe:
- 3.88 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung:

