KOPS - The Institutional Repository of the University of Konstanz

Testing Acoustic Voice Quality Classification Across Languages and Speech Styles

Testing Acoustic Voice Quality Classification Across Languages and Speech Styles

Cite This

Files in this item

Checksum: MD5:e6c78800c609e58b995e32fd36c7b00a

BRAUN, Bettina, Nicole DEHÉ, Marieke EINFELDT, Daniela WOCHNER, Katharina ZAHNER-RITTER, 2021. Testing Acoustic Voice Quality Classification Across Languages and Speech Styles. Interspeech 2021. Brno, Czechia, Aug 30, 2022 - Sep 3, 2022. In: HEŘMANSKÝ, Hynek, ed. and others. Proceedings of Interspeech 2021. Baixas, France:ISCA, pp. 3920-3924. Available under: doi: 10.21437/Interspeech.2021-315

@inproceedings{Braun2021Testi-59052, title={Testing Acoustic Voice Quality Classification Across Languages and Speech Styles}, year={2021}, doi={10.21437/Interspeech.2021-315}, address={Baixas, France}, publisher={ISCA}, booktitle={Proceedings of Interspeech 2021}, pages={3920--3924}, editor={Heřmanský, Hynek}, author={Braun, Bettina and Dehé, Nicole and Einfeldt, Marieke and Wochner, Daniela and Zahner-Ritter, Katharina} }

<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/rdf/resource/123456789/59052"> <dc:language>eng</dc:language> <dc:contributor>Braun, Bettina</dc:contributor> <dc:creator>Einfeldt, Marieke</dc:creator> <dcterms:title>Testing Acoustic Voice Quality Classification Across Languages and Speech Styles</dcterms:title> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2022-11-07T14:20:45Z</dcterms:available> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/rdf/resource/123456789/45"/> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/59052/3/Braun_2-1bz2rt3fbpory9.pdf"/> <dc:creator>Braun, Bettina</dc:creator> <dc:creator>Dehé, Nicole</dc:creator> <dc:contributor>Zahner-Ritter, Katharina</dc:contributor> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/59052"/> <foaf:homepage rdf:resource="http://localhost:8080/jspui"/> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/59052/3/Braun_2-1bz2rt3fbpory9.pdf"/> <dcterms:abstract xml:lang="eng">Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.</dcterms:abstract> <dc:rights>terms-of-use</dc:rights> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/rdf/resource/123456789/45"/> <dcterms:issued>2021</dcterms:issued> <dc:contributor>Wochner, Daniela</dc:contributor> <dc:contributor>Einfeldt, Marieke</dc:contributor> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2022-11-07T14:20:45Z</dc:date> <dc:contributor>Dehé, Nicole</dc:contributor> <dc:creator>Wochner, Daniela</dc:creator> <dc:creator>Zahner-Ritter, Katharina</dc:creator> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> </rdf:Description> </rdf:RDF>

Downloads since Nov 7, 2022 (Information about access statistics)

Braun_2-1bz2rt3fbpory9.pdf 22

This item appears in the following Collection(s)

Search KOPS


Browse

My Account