On Clustering Time Series Using Euclidean Distance and Pearson Correlation

dc.contributor.authorBerthold, Michael R.
dc.contributor.authorHöppner, Frank
dc.date.accessioned2016-07-13T13:07:14Z
dc.date.available2016-07-13T13:07:14Z
dc.date.issued2016eng
dc.description.abstractFor time series comparisons, it has often been observed that z-score normalized Euclidean distances far outperform the unnormalized variant. In this paper we show that a z-score normalized, squared Euclidean Distance is, in fact, equal to a distance based on Pearson Correlation. This has profound impact on many distance-based classification or clustering methods. In addition to this theoretically sound result we also show that the often used k-Means algorithm formally needs a mod ification to keep the interpretation as Pearson correlation strictly valid. Experimental results demonstrate that in many cases the standard k-Means algorithm generally produces the same results.eng
dc.description.versionpublishedde
dc.identifier.arxiv1601.02213eng
dc.identifier.ppn474032485
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/34784
dc.language.isoengeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subject.ddc004eng
dc.titleOn Clustering Time Series Using Euclidean Distance and Pearson Correlationeng
dc.typePREPRINTde
dspace.entity.typePublication
kops.citation.bibtex
@unpublished{Berthold2016Clust-34784,
  year={2016},
  title={On Clustering Time Series Using Euclidean Distance and Pearson Correlation},
  author={Berthold, Michael R. and Höppner, Frank}
}
kops.citation.iso690BERTHOLD, Michael R., Frank HÖPPNER, 2016. On Clustering Time Series Using Euclidean Distance and Pearson Correlationdeu
kops.citation.iso690BERTHOLD, Michael R., Frank HÖPPNER, 2016. On Clustering Time Series Using Euclidean Distance and Pearson Correlationeng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/34784">
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/34784/1/Berthold_0-347884.pdf"/>
    <dcterms:issued>2016</dcterms:issued>
    <dc:contributor>Berthold, Michael R.</dc:contributor>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:abstract xml:lang="eng">For time series comparisons, it has often been observed that z-score normalized Euclidean distances far outperform the unnormalized variant. In this paper we show that a z-score normalized, squared Euclidean Distance is, in fact, equal to a distance based on Pearson Correlation. This has profound impact on many distance-based classification or clustering methods. In addition to this theoretically sound result we also show that the often used k-Means algorithm formally needs a mod ification to keep the interpretation as Pearson correlation strictly valid. Experimental results demonstrate that in many cases the standard k-Means algorithm generally produces the same results.</dcterms:abstract>
    <dc:language>eng</dc:language>
    <dc:contributor>Höppner, Frank</dc:contributor>
    <dcterms:title>On Clustering Time Series Using Euclidean Distance and Pearson Correlation</dcterms:title>
    <dc:creator>Berthold, Michael R.</dc:creator>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/34784"/>
    <dc:creator>Höppner, Frank</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2016-07-13T13:07:14Z</dcterms:available>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:rights>terms-of-use</dc:rights>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2016-07-13T13:07:14Z</dc:date>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/34784/1/Berthold_0-347884.pdf"/>
  </rdf:Description>
</rdf:RDF>
kops.description.openAccessopenaccessgreen
kops.flag.knbibliographytrue
kops.identifier.nbnurn:nbn:de:bsz:352-0-347884
relation.isAuthorOfPublication56ea9ab6-14a4-493e-8ef1-3c064e0c50a1
relation.isAuthorOfPublication.latestForDiscovery56ea9ab6-14a4-493e-8ef1-3c064e0c50a1

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Berthold_0-347884.pdf
Größe:
312.08 KB
Format:
Adobe Portable Document Format
Beschreibung:
Berthold_0-347884.pdf
Berthold_0-347884.pdfGröße: 312.08 KBDownloads: 649

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
3.88 KB
Format:
Item-specific license agreed upon to submission
Beschreibung:
license.txt
license.txtGröße: 3.88 KBDownloads: 0