A Penn-style Treebank of Middle Low German
| dc.contributor.author | Booth, Hannah | |
| dc.contributor.author | Breitbarth, Anne | |
| dc.contributor.author | Ecay, Aaron | |
| dc.contributor.author | Farasyn, Melissa | |
| dc.date.accessioned | 2021-07-01T09:42:54Z | |
| dc.date.available | 2021-07-01T09:42:54Z | |
| dc.date.issued | 2020 | eng |
| dc.description.abstract | We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation. | eng |
| dc.description.version | published | eng |
| dc.identifier.uri | https://kops.uni-konstanz.de/handle/123456789/54166 | |
| dc.language.iso | eng | eng |
| dc.rights | terms-of-use | |
| dc.rights.uri | https://rightsstatements.org/page/InC/1.0/ | |
| dc.subject.ddc | 400 | eng |
| dc.title | A Penn-style Treebank of Middle Low German | eng |
| dc.type | INPROCEEDINGS | eng |
| dspace.entity.type | Publication | |
| kops.citation.bibtex | @inproceedings{Booth2020Penns-54166,
year={2020},
title={A Penn-style Treebank of Middle Low German},
url={https://www.aclweb.org/anthology/2020.lrec-1.96/},
publisher={The European Language Resources Association},
address={Paris},
booktitle={Proceedings of the 12th Language Resources and Evaluation Conference},
pages={766--775},
editor={Calzolari, Nicoletta and Béchet, Frédéric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph},
author={Booth, Hannah and Breitbarth, Anne and Ecay, Aaron and Farasyn, Melissa}
} | |
| kops.citation.iso690 | BOOTH, Hannah, Anne BREITBARTH, Aaron ECAY, Melissa FARASYN, 2020. A Penn-style Treebank of Middle Low German. 12th Language Resources and Evaluation Conference. Marseille, France, 11. Mai 2020 - 16. Mai 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775 | deu |
| kops.citation.iso690 | BOOTH, Hannah, Anne BREITBARTH, Aaron ECAY, Melissa FARASYN, 2020. A Penn-style Treebank of Middle Low German. 12th Language Resources and Evaluation Conference. Marseille, France, May 11, 2020 - May 16, 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775 | eng |
| kops.citation.rdf | <rdf:RDF
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bibo="http://purl.org/ontology/bibo/"
xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:void="http://rdfs.org/ns/void#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#" >
<rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/54166">
<dc:rights>terms-of-use</dc:rights>
<dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
<foaf:homepage rdf:resource="http://localhost:8080/"/>
<dc:contributor>Breitbarth, Anne</dc:contributor>
<dc:creator>Ecay, Aaron</dc:creator>
<dc:creator>Breitbarth, Anne</dc:creator>
<dcterms:abstract xml:lang="eng">We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, specifically, the practical and linguistic reasons for adopting the Penn annotation scheme, the stages of the annotation process itself, and how we have adapted the Penn scheme for syntactic features specific to MLG. We also discuss the issue of data uncertainty, which is a major issue when building a corpus of an under-researched language stage like MLG, and some novel ways in which we capture this uncertainty in the annotation.</dcterms:abstract>
<dc:contributor>Ecay, Aaron</dc:contributor>
<dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
<dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dcterms:available>
<dc:language>eng</dc:language>
<dc:creator>Booth, Hannah</dc:creator>
<dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
<dc:contributor>Booth, Hannah</dc:contributor>
<dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-07-01T09:42:54Z</dc:date>
<dcterms:title>A Penn-style Treebank of Middle Low German</dcterms:title>
<dcterms:issued>2020</dcterms:issued>
<bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/54166"/>
<dc:contributor>Farasyn, Melissa</dc:contributor>
<dc:creator>Farasyn, Melissa</dc:creator>
<void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
</rdf:Description>
</rdf:RDF> | |
| kops.conferencefield | 12th Language Resources and Evaluation Conference, 11. Mai 2020 - 16. Mai 2020, Marseille, France | deu |
| kops.date.conferenceEnd | 2020-05-16 | eng |
| kops.date.conferenceStart | 2020-05-11 | eng |
| kops.flag.knbibliography | true | |
| kops.location.conference | Marseille, France | eng |
| kops.sourcefield | CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. <i>Proceedings of the 12th Language Resources and Evaluation Conference</i>. Paris: The European Language Resources Association, 2020, pp. 766-775 | deu |
| kops.sourcefield.plain | CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775 | deu |
| kops.sourcefield.plain | CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed., Khalid CHOUKRI, ed., Christopher CIERI, ed., Thierry DECLERCK, ed., Sara GOGGI, ed., Hitoshi ISAHARA, ed., Bente MAEGAARD, ed., Joseph MARIANI, ed. and others. Proceedings of the 12th Language Resources and Evaluation Conference. Paris: The European Language Resources Association, 2020, pp. 766-775 | eng |
| kops.title.conference | 12th Language Resources and Evaluation Conference | eng |
| kops.url | https://www.aclweb.org/anthology/2020.lrec-1.96/ | eng |
| kops.urlDate | 2021-06-30 | eng |
| relation.isAuthorOfPublication | 21cbbb96-60b0-4d00-95cb-94eecfc33d72 | |
| relation.isAuthorOfPublication.latestForDiscovery | 21cbbb96-60b0-4d00-95cb-94eecfc33d72 | |
| source.bibliographicInfo.fromPage | 766 | eng |
| source.bibliographicInfo.toPage | 775 | eng |
| source.contributor.editor | Calzolari, Nicoletta | |
| source.contributor.editor | Béchet, Frédéric | |
| source.contributor.editor | Blache, Philippe | |
| source.contributor.editor | Choukri, Khalid | |
| source.contributor.editor | Cieri, Christopher | |
| source.contributor.editor | Declerck, Thierry | |
| source.contributor.editor | Goggi, Sara | |
| source.contributor.editor | Isahara, Hitoshi | |
| source.contributor.editor | Maegaard, Bente | |
| source.contributor.editor | Mariani, Joseph | |
| source.flag.etalEditor | true | eng |
| source.publisher | The European Language Resources Association | eng |
| source.publisher.location | Paris | eng |
| source.title | Proceedings of the 12th Language Resources and Evaluation Conference | eng |