A Penn-style Treebank of Middle Low German

Lade...
Vorschaubild
Dateien
Zu diesem Dokument gibt es keine Dateien.
Datum
2020
Autor:innen
Breitbarth, Anne
Ecay, Aaron
Farasyn, Melissa
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
DOI (zitierfähiger Link)
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Sammlungen
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published
Erschienen in
CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775
Zusammenfassung

We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
400 Sprachwissenschaft, Linguistik
Schlagwörter
Konferenz
12th Language Resources and Evaluation Conference, 11. Mai 2020 - 16. Mai 2020, Marseille, France
Rezension
undefined / . - undefined, undefined
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Datensätze
Zitieren
ISO 690BOOTH, Hannah, Anne BREITBARTH, Aaron ECAY, Melissa FARASYN, 2020. A Penn-style Treebank of Middle Low German. 12th Language Resources and Evaluation Conference. Marseille, France, 11. Mai 2020 - 16. Mai 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775
BibTex
@inproceedings{Booth2020Penns-54166,
  year={2020},
  title={A Penn-style Treebank of Middle Low German},
  url={https://www.aclweb.org/anthology/2020.lrec-1.96/},
  publisher={The European Language Resources Association},
  address={Paris},
  booktitle={Proceedings of the 12th Language Resources and Evaluation Conference},
  pages={766--775},
  editor={Calzolari, Nicoletta and Béchet, Frédéric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph},
  author={Booth, Hannah and Breitbarth, Anne and Ecay, Aaron and Farasyn, Melissa}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/54166">
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Breitbarth, Anne</dc:contributor>
    <dc:creator>Ecay, Aaron</dc:creator>
    <dc:creator>Breitbarth, Anne</dc:creator>
    <dcterms:abstract xml:lang="eng">We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.</dcterms:abstract>
    <dc:contributor>Ecay, Aaron</dc:contributor>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dcterms:available>
    <dc:language>eng</dc:language>
    <dc:creator>Booth, Hannah</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:contributor>Booth, Hannah</dc:contributor>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dc:date>
    <dcterms:title>A Penn-style Treebank of Middle Low German</dcterms:title>
    <dcterms:issued>2020</dcterms:issued>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/54166"/>
    <dc:contributor>Farasyn, Melissa</dc:contributor>
    <dc:creator>Farasyn, Melissa</dc:creator>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
Prüfdatum der URL
2021-06-30
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen