Publikation: Transmission problems? : An embedded approach for unification of Latin prefixes and text variants during text matching
Dateien
Datum
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
DOI (zitierfähiger Link)
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
The manuscript tradition of pre-modern texts poses a specific problem for scholars in the field of Digital Humanities: before printing made the production of standardized editions of texts feasible, copying texts by hand (and often by different people) was inherently an error-prone process, which not only led to differences in wording but also in spelling—across multiple transmitted variants. This applies especially to ancient texts, where the temporal distances to the archetypes tend to be fairly large. In computerized research, especially in the case of text matching within the field of citation research and text mining, these differences in wording and spelling—however small they might be—may prevent a successful matching of texts. This case study presents a solution for the problem of textual differences arising from (non-)assimilated prefixes in Latin, a feature where modern editions mostly differ from author to author, but sometimes even between two editions of the same text. With regard to the letters of the church father Jerome as well as Virgil’s Eclogues, Georgics, and Aeneid, two approaches are compared in terms of error rate and efficiency for a given set of prefixes: (1) performing and (2) reversing corpus-wide assimilation. Moreover, the broader implications of the (in-)accessibility of text-critical data in digital editions are discussed. Finally, general desiderata regarding text-critical data for computerized research on classical texts are elaborated.
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
SCHROPP, Franziska, Thomas E. KONRAD, Marie REVELLIO, Barbara FEICHTINGER, 2024. Transmission problems? : An embedded approach for unification of Latin prefixes and text variants during text matching. In: Digital Scholarship in the Humanities. Oxford University Press. 2024, 39(1), pp. 348-353. ISSN 2055-7671. eISSN 2055-768X. Available under: doi: 10.1093/llc/fqad069BibTex
@article{Schropp2024-04-02Trans-67930, year={2024}, doi={10.1093/llc/fqad069}, title={Transmission problems? : An embedded approach for unification of Latin prefixes and text variants during text matching}, number={1}, volume={39}, issn={2055-7671}, journal={Digital Scholarship in the Humanities}, pages={348--353}, author={Schropp, Franziska and Konrad, Thomas E. and Revellio, Marie and Feichtinger, Barbara} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/67930"> <dc:contributor>Revellio, Marie</dc:contributor> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/38"/> <dc:creator>Konrad, Thomas E.</dc:creator> <dcterms:title>Transmission problems? : An embedded approach for unification of Latin prefixes and text variants during text matching</dcterms:title> <dc:language>eng</dc:language> <dc:contributor>Feichtinger, Barbara</dc:contributor> <dcterms:abstract>The manuscript tradition of pre-modern texts poses a specific problem for scholars in the field of Digital Humanities: before printing made the production of standardized editions of texts feasible, copying texts by hand (and often by different people) was inherently an error-prone process, which not only led to differences in wording but also in spelling—across multiple transmitted variants. This applies especially to ancient texts, where the temporal distances to the archetypes tend to be fairly large. In computerized research, especially in the case of text matching within the field of citation research and text mining, these differences in wording and spelling—however small they might be—may prevent a successful matching of texts. This case study presents a solution for the problem of textual differences arising from (non-)assimilated prefixes in Latin, a feature where modern editions mostly differ from author to author, but sometimes even between two editions of the same text. With regard to the letters of the church father Jerome as well as Virgil’s Eclogues, Georgics, and Aeneid, two approaches are compared in terms of error rate and efficiency for a given set of prefixes: (1) performing and (2) reversing corpus-wide assimilation. Moreover, the broader implications of the (in-)accessibility of text-critical data in digital editions are discussed. Finally, general desiderata regarding text-critical data for computerized research on classical texts are elaborated.</dcterms:abstract> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/67930"/> <dc:creator>Schropp, Franziska</dc:creator> <dc:contributor>Konrad, Thomas E.</dc:contributor> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-10-13T12:48:19Z</dcterms:available> <dc:creator>Revellio, Marie</dc:creator> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-10-13T12:48:19Z</dc:date> <dcterms:issued>2024-04-02</dcterms:issued> <dc:creator>Feichtinger, Barbara</dc:creator> <dc:contributor>Schropp, Franziska</dc:contributor> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/38"/> </rdf:Description> </rdf:RDF>