Automatic Detection of Prosodic Cues

Lade...
Vorschaubild
Dateien
ProsAlign4.0.pdf
ProsAlign4.0.pdfGröße: 2.55 MBDownloads: 717
Datum
2003
Autor:innen
Braunschweiler, Norbert
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
DOI (zitierfähiger Link)
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Open Access Green
Sammlungen
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Automatische Erkennung prosodischer Schlüsselparameter
Publikationstyp
Dissertation
Publikationsstatus
Published
Erschienen in
Zusammenfassung

This study is about an approach that formulates an explicit way from continuous acoustic parameters to discrete and abstract phonological entities. The method is implemented in a computer program and uses a linguistic theory about the underlying structure of prosody in speech. The program is designed to automatically detect the position of prosodic events from acoustic speech signals. Such a program can be of great benefit for the linguist working with large acoustic databases. It enables the researcher to process unlabeled speech material automatically and systematically. The program can search for specific
intonational patterns in a given language, or can test a theory about the underlying structure of prosody against the acoustic reality or the language learner can use it by seeing some visual feedback to his or her freshly acquired foreign language abilities. Furthermore the program can be used for labeling
prosodic events in a spoken speech synthesis corpus and consequently improve the synthesis quality. Last but not least there are possible applications in the field of automatic speech recognition.

The process of information extraction from acoustic speech signals involves not only the recognition of segmental features, phonemes, syllables or words and subsequent linguistic processing, but also the recognition of prosodic events including the position of accented words, the type of pitch movement associated with them, the general trendline of pitch and also the grouping of information units, phrases or words.

The prosodic events are important conveyors of the information structure in utterances, which this work aims at unfolding for improved speech analysis and recognition. To fulfill these aims, the following tasks are done: (i) review and discussion of intonation models, (ii) development of a new approach for the automatic detection of prosodic cues, (iii) acoustic analysis of cues of prosodic events, (iv) implementation of algorithms for detecting these
prosodic cues, and (v) evaluation of the new approach. Important aspects of the thesis include integration and evaluation of linguistic theory and quantitative
acoustic modeling.

Zusammenfassung in einer weiteren Sprache

Die Dissertation befasst sich mit dem Thema der automatischen Erkennung prosodischer Schlüsselmerkmale. Dabei geht es um die Entwicklung eines Verfahrens, welches automatisch eine akustische Repräsentation einer sprachlichen Äusserung in eine phonologische transformiert. Da der Prozess der Informationsextraktion von akustischen Sprachsignalen nicht allein die Wahrnehmung von segmentalen Merkmalen, Phonemen, Silben oder Worten umfasst, sondern auch die Wahrnehmung der prosodischen Ereignisse einschliesst, wie zum Beispiel die Position der akzentuierten Wörter, der damit assoziierten Tonhöhenbewegung, dem allgemeinen Verlauf der Tonhöhe und auch der Gruppierung von Informationseinheiten, Phrasen und Wörtern, ist es von zentraler Bedeutung eben jene prosodischen Ereignisse adequat zu erfassen und zu beschreiben. Die prosodischen Ereignisse sind wichtige Überträger der Informationsstruktur einer Äusserung. Letztere wird versucht in der vorliegenden Arbeit zu entfalten, um eine verbesserte Sprach-Analyse und -Erkennung zu ermöglichen. Für die Erreichung dieser Ziele werden folgende Arbeiten durchgeführt: (1) Rückblick und kritische Diskussion bestehender Intonationsmodelle, (2) Entwicklung einer neuen Methode zur automatischen Erkennung prosodischer Schlüsselparameter, (3) akustische Analysen prosodischer Ereignisse, (4) Computer-Implementierung eines Verfahrens zur automatischen Erkennung prosodischer Ereignisse, und (5) die Evaluierung der neuen Methode. Wichtige Aspekte dieser Arbeit sind die Integration und Evaluation linguistischer Theorien und die quantitative Modellierung akustischer Sprache.

Fachgebiet (DDC)
400 Sprachwissenschaft, Linguistik
Schlagwörter
Intonation, Prosody, Phonetics, Phonology, ToBI
Konferenz
Rezension
undefined / . - undefined, undefined
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Datensätze
Zitieren
ISO 690BRAUNSCHWEILER, Norbert, 2003. Automatic Detection of Prosodic Cues [Dissertation]. Konstanz: University of Konstanz
BibTex
@phdthesis{Braunschweiler2003Autom-3776,
  year={2003},
  title={Automatic Detection of Prosodic Cues},
  author={Braunschweiler, Norbert},
  address={Konstanz},
  school={Universität Konstanz}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/3776">
    <dc:format>application/pdf</dc:format>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/3776/1/ProsAlign4.0.pdf"/>
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:contributor>Braunschweiler, Norbert</dc:contributor>
    <dc:language>eng</dc:language>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dcterms:issued>2003</dcterms:issued>
    <dcterms:abstract xml:lang="eng">This study is about an approach that formulates an explicit way from continuous acoustic parameters to discrete and abstract phonological entities. The method is implemented in a computer program and uses a linguistic theory about the underlying structure of prosody in speech. The program is designed to automatically detect the position of prosodic events from acoustic speech signals. Such a program can be of great benefit for the linguist working with large acoustic databases. It enables the researcher to process unlabeled speech material automatically and systematically. The program can search for specific&lt;br /&gt;intonational patterns in a given language, or can test a theory about the underlying structure of prosody against the acoustic reality or the language learner can use it by seeing some visual feedback to his or her freshly acquired foreign language abilities. Furthermore the program can be used for labeling&lt;br /&gt;prosodic events in a spoken speech synthesis corpus and consequently improve the synthesis quality. Last but not least there are possible applications in the field of automatic speech recognition.&lt;br /&gt;&lt;br /&gt;The process of information extraction from acoustic speech signals involves not only the recognition of segmental features, phonemes, syllables or words and subsequent linguistic processing, but also the recognition of prosodic events including the position of accented words, the type of pitch movement associated with them, the general trendline of pitch and also the grouping of information units, phrases or words.&lt;br /&gt;&lt;br /&gt;The prosodic events are important conveyors of the information structure in utterances, which this work aims at unfolding for improved speech analysis and recognition. To fulfill these aims, the following tasks are done: (i) review and discussion of intonation models, (ii) development of a new approach for the automatic detection of prosodic cues, (iii) acoustic analysis of cues of prosodic events, (iv) implementation of algorithms for detecting these&lt;br /&gt;prosodic cues, and (v) evaluation of the new approach. Important aspects of the thesis include integration and evaluation of linguistic theory and quantitative&lt;br /&gt;acoustic modeling.</dcterms:abstract>
    <dcterms:title>Automatic Detection of Prosodic Cues</dcterms:title>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-03-24T10:06:29Z</dc:date>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-03-24T10:06:29Z</dcterms:available>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/3776"/>
    <dcterms:alternative>Automatische Erkennung prosodischer Schlüsselparameter</dcterms:alternative>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:creator>Braunschweiler, Norbert</dc:creator>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/3776/1/ProsAlign4.0.pdf"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
August 1, 2003
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Begutachtet
Diese Publikation teilen