Publikation:

A Penn-style Treebank of Middle Low German

Lade...
Vorschaubild

Dateien

Zu diesem Dokument gibt es keine Dateien.

Datum

2020

Autor:innen

Breitbarth, Anne
Ecay, Aaron
Farasyn, Melissa

Herausgeber:innen

Kontakt

ISSN der Zeitschrift

Electronic ISSN

ISBN

Bibliografische Daten

Verlag

Schriftenreihe

Auflagebezeichnung

URI (zitierfähiger Link)
DOI (zitierfähiger Link)
ArXiv-ID

Internationale Patentnummer

Angaben zur Forschungsförderung

Projekt

Open Access-Veröffentlichung
Core Facility der Universität Konstanz

Gesperrt bis

Titel in einer weiteren Sprache

Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published

Erschienen in

CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775

Zusammenfassung

We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
400 Sprachwissenschaft, Linguistik

Schlagwörter

Konferenz

12th Language Resources and Evaluation Conference, 11. Mai 2020 - 16. Mai 2020, Marseille, France
Rezension
undefined / . - undefined, undefined

Forschungsvorhaben

Organisationseinheiten

Zeitschriftenheft

Zugehörige Datensätze in KOPS

Zitieren

ISO 690BOOTH, Hannah, Anne BREITBARTH, Aaron ECAY, Melissa FARASYN, 2020. A Penn-style Treebank of Middle Low German. 12th Language Resources and Evaluation Conference. Marseille, France, 11. Mai 2020 - 16. Mai 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775
BibTex
@inproceedings{Booth2020Penns-54166,
  year={2020},
  title={A Penn-style Treebank of Middle Low German},
  url={https://www.aclweb.org/anthology/2020.lrec-1.96/},
  publisher={The European Language Resources Association},
  address={Paris},
  booktitle={Proceedings of the 12th Language Resources and Evaluation Conference},
  pages={766--775},
  editor={Calzolari, Nicoletta and Béchet, Frédéric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph},
  author={Booth, Hannah and Breitbarth, Anne and Ecay, Aaron and Farasyn, Melissa}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/54166">
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Breitbarth, Anne</dc:contributor>
    <dc:creator>Ecay, Aaron</dc:creator>
    <dc:creator>Breitbarth, Anne</dc:creator>
    <dcterms:abstract xml:lang="eng">We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.</dcterms:abstract>
    <dc:contributor>Ecay, Aaron</dc:contributor>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dcterms:available>
    <dc:language>eng</dc:language>
    <dc:creator>Booth, Hannah</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:contributor>Booth, Hannah</dc:contributor>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dc:date>
    <dcterms:title>A Penn-style Treebank of Middle Low German</dcterms:title>
    <dcterms:issued>2020</dcterms:issued>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/54166"/>
    <dc:contributor>Farasyn, Melissa</dc:contributor>
    <dc:creator>Farasyn, Melissa</dc:creator>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
  </rdf:Description>
</rdf:RDF>

Interner Vermerk

xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter

Kontakt

Prüfdatum der URL

2021-06-30

Prüfungsdatum der Dissertation

Finanzierungsart

Kommentar zur Publikation

Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen