Publikation:

Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context

Lade...
Vorschaubild

Dateien

Schubotz_2-jm8xs95avu406.pdf
Schubotz_2-jm8xs95avu406.pdfGröße: 600.03 KBDownloads: 322

Datum

2018

Herausgeber:innen

Kontakt

ISSN der Zeitschrift

Electronic ISSN

ISBN

Bibliografische Daten

Verlag

Schriftenreihe

Auflagebezeichnung

Internationale Patentnummer

Angaben zur Forschungsförderung

Projekt

Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz

Gesperrt bis

Titel in einer weiteren Sprache

Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published

Erschienen in

CHEN, Jiangping, ed. and others. Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18. New York: ACM Press, 2018, pp. 233-242. ISBN 978-1-4503-5178-2. Available under: doi: 10.1145/3197026.3197058

Zusammenfassung

Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial to communicate information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable formats that can represent both the presentation and content, i.e., the semantics, of formulae. Exchanging such information between systems additionally requires conversion methods for mathematical representation formats. We analyze how the semantic enrichment of formulae improves the format conversion process and show that considering the textual context of formulae reduces the error rate of such conversions. Our main contributions are: (1) providing an openly available benchmark dataset for the mathematical format conversion task consisting of a newly created test collection, an extensive, manually curated gold standard and task-specific evaluation metrics; (2) performing a quantitative evaluation of state-of-the-art tools for mathematical format conversions; (3) presenting a new approach that considers the textual context of formulae to reduce the error rate for mathematical format conversions. Our benchmark dataset facilitates future research on mathematical format conversions as well as research on many problems in mathematical information retrieval. Because we annotated and linked all components of formulae, e.g., identifiers, operators and other entities, to Wikidata entries, the gold standard can, for instance, be used to train methods for formula concept discovery and recognition. Such methods can then be applied to improve mathematical information retrieval systems, e.g., for semantic formula search, recommendation of mathematical content, or detection of mathematical plagiarism.

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
004 Informatik

Schlagwörter

Konferenz

18th ACM/IEEE on Joint Conference on Digital Libraries, 3. Juni 2018 - 7. Juni 2018, Fort Worth, USA
Rezension
undefined / . - undefined, undefined

Forschungsvorhaben

Organisationseinheiten

Zeitschriftenheft

Zugehörige Datensätze in KOPS

Zitieren

ISO 690SCHUBOTZ, Moritz, André GREINER-PETTER, Philipp SCHARPF, Norman MEUSCHKE, Howard S. COHL, Bela GIPP, 2018. Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context. 18th ACM/IEEE on Joint Conference on Digital Libraries. Fort Worth, USA, 3. Juni 2018 - 7. Juni 2018. In: CHEN, Jiangping, ed. and others. Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18. New York: ACM Press, 2018, pp. 233-242. ISBN 978-1-4503-5178-2. Available under: doi: 10.1145/3197026.3197058
BibTex
@inproceedings{Schubotz2018-04-13Impro-43286,
  year={2018},
  doi={10.1145/3197026.3197058},
  title={Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context},
  isbn={978-1-4503-5178-2},
  publisher={ACM Press},
  address={New York},
  booktitle={Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries  - JCDL '18},
  pages={233--242},
  editor={Chen, Jiangping},
  author={Schubotz, Moritz and Greiner-Petter, André and Scharpf, Philipp and Meuschke, Norman and Cohl, Howard S. and Gipp, Bela}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43286">
    <dc:creator>Scharpf, Philipp</dc:creator>
    <dc:creator>Gipp, Bela</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:contributor>Gipp, Bela</dc:contributor>
    <dcterms:issued>2018-04-13</dcterms:issued>
    <dc:creator>Greiner-Petter, André</dc:creator>
    <dc:contributor>Scharpf, Philipp</dc:contributor>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/43286"/>
    <dcterms:abstract xml:lang="eng">Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial to communicate information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable formats that can represent both the presentation and content, i.e., the semantics, of formulae. Exchanging such information between systems additionally requires conversion methods for mathematical representation formats. We analyze how the semantic enrichment of formulae improves the format conversion process and show that considering the textual context of formulae reduces the error rate of such conversions. Our main contributions are: (1) providing an openly available benchmark dataset for the mathematical format conversion task consisting of a newly created test collection, an extensive, manually curated gold standard and task-specific evaluation metrics; (2) performing a quantitative evaluation of state-of-the-art tools for mathematical format conversions; (3) presenting a new approach that considers the textual context of formulae to reduce the error rate for mathematical format conversions. Our benchmark dataset facilitates future research on mathematical format conversions as well as research on many problems in mathematical information retrieval. Because we annotated and linked all components of formulae, e.g., identifiers, operators and other entities, to Wikidata entries, the gold standard can, for instance, be used to train methods for formula concept discovery and recognition. Such methods can then be applied to improve mathematical information retrieval systems, e.g., for semantic formula search, recommendation of mathematical content, or detection of mathematical plagiarism.</dcterms:abstract>
    <dc:creator>Cohl, Howard S.</dc:creator>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-09-17T13:53:57Z</dc:date>
    <dc:creator>Schubotz, Moritz</dc:creator>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/43286/1/Schubotz_2-jm8xs95avu406.pdf"/>
    <dc:language>eng</dc:language>
    <dc:contributor>Cohl, Howard S.</dc:contributor>
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/43286/1/Schubotz_2-jm8xs95avu406.pdf"/>
    <dcterms:title>Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context</dcterms:title>
    <dc:contributor>Schubotz, Moritz</dc:contributor>
    <dc:creator>Meuschke, Norman</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-09-17T13:53:57Z</dcterms:available>
    <dc:contributor>Greiner-Petter, André</dc:contributor>
    <dc:contributor>Meuschke, Norman</dc:contributor>
  </rdf:Description>
</rdf:RDF>

Interner Vermerk

xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter

Kontakt
URL der Originalveröffentl.

Prüfdatum der URL

Prüfungsdatum der Dissertation

Finanzierungsart

Kommentar zur Publikation

Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen