A Penn-style Treebank of Middle Low German

dc.contributor.authorBooth, Hannah
dc.contributor.authorBreitbarth, Anne
dc.contributor.authorEcay, Aaron
dc.contributor.authorFarasyn, Melissa
dc.date.accessioned2021-07-01T09:42:54Z
dc.date.available2021-07-01T09:42:54Z
dc.date.issued2020eng
dc.description.abstractWe outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.eng
dc.description.versionpublishedeng
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/54166
dc.language.isoengeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subject.ddc400eng
dc.titleA Penn-style Treebank of Middle Low Germaneng
dc.typeINPROCEEDINGSeng
dspace.entity.typePublication
kops.citation.bibtex
@inproceedings{Booth2020Penns-54166,
  year={2020},
  title={A Penn-style Treebank of Middle Low German},
  url={https://www.aclweb.org/anthology/2020.lrec-1.96/},
  publisher={The European Language Resources Association},
  address={Paris},
  booktitle={Proceedings of the 12th Language Resources and Evaluation Conference},
  pages={766--775},
  editor={Calzolari, Nicoletta and Béchet, Frédéric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph},
  author={Booth, Hannah and Breitbarth, Anne and Ecay, Aaron and Farasyn, Melissa}
}
kops.citation.iso690BOOTH, Hannah, Anne BREITBARTH, Aaron ECAY, Melissa FARASYN, 2020. A Penn-style Treebank of Middle Low German. 12th Language Resources and Evaluation Conference. Marseille, France, 11. Mai 2020 - 16. Mai 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775deu
kops.citation.iso690BOOTH, Hannah, Anne BREITBARTH, Aaron ECAY, Melissa FARASYN, 2020. A Penn-style Treebank of Middle Low German. 12th Language Resources and Evaluation Conference. Marseille, France, May 11, 2020 - May 16, 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/54166">
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Breitbarth, Anne</dc:contributor>
    <dc:creator>Ecay, Aaron</dc:creator>
    <dc:creator>Breitbarth, Anne</dc:creator>
    <dcterms:abstract xml:lang="eng">We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.</dcterms:abstract>
    <dc:contributor>Ecay, Aaron</dc:contributor>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dcterms:available>
    <dc:language>eng</dc:language>
    <dc:creator>Booth, Hannah</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:contributor>Booth, Hannah</dc:contributor>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dc:date>
    <dcterms:title>A Penn-style Treebank of Middle Low German</dcterms:title>
    <dcterms:issued>2020</dcterms:issued>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/54166"/>
    <dc:contributor>Farasyn, Melissa</dc:contributor>
    <dc:creator>Farasyn, Melissa</dc:creator>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
  </rdf:Description>
</rdf:RDF>
kops.conferencefield12th Language Resources and Evaluation Conference, 11. Mai 2020 - 16. Mai 2020, Marseille, Francedeu
kops.date.conferenceEnd2020-05-16eng
kops.date.conferenceStart2020-05-11eng
kops.flag.knbibliographytrue
kops.location.conferenceMarseille, Franceeng
kops.sourcefieldCALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. <i>Proceedings of the 12th Language Resources and Evaluation Conference</i>. Paris: The European Language Resources Association, 2020, pp. 766-775deu
kops.sourcefield.plainCALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775deu
kops.sourcefield.plainCALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775eng
kops.title.conference12th Language Resources and Evaluation Conferenceeng
kops.urlhttps://www.aclweb.org/anthology/2020.lrec-1.96/eng
kops.urlDate2021-06-30eng
relation.isAuthorOfPublication21cbbb96-60b0-4d00-95cb-94eecfc33d72
relation.isAuthorOfPublication.latestForDiscovery21cbbb96-60b0-4d00-95cb-94eecfc33d72
source.bibliographicInfo.fromPage766eng
source.bibliographicInfo.toPage775eng
source.contributor.editorCalzolari, Nicoletta
source.contributor.editorBéchet, Frédéric
source.contributor.editorBlache, Philippe
source.contributor.editorChoukri, Khalid
source.contributor.editorCieri, Christopher
source.contributor.editorDeclerck, Thierry
source.contributor.editorGoggi, Sara
source.contributor.editorIsahara, Hitoshi
source.contributor.editorMaegaard, Bente
source.contributor.editorMariani, Joseph
source.flag.etalEditortrueeng
source.publisherThe European Language Resources Associationeng
source.publisher.locationPariseng
source.titleProceedings of the 12th Language Resources and Evaluation Conferenceeng

Dateien