Do the Math : Making Mathematics in Wikipedia Computable

Lade...
Vorschaubild
Dateien
Greiner-Petter_2-1nrkoqt9bxcoi0.pdf
Greiner-Petter_2-1nrkoqt9bxcoi0.pdfGröße: 1.15 MBDownloads: 18
Datum
2023
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
ArXiv-ID
Internationale Patentnummer
EU-Projektnummer
DFG-Projektnummer
Projekt
Open Access-Veröffentlichung
Gesperrt bis
Titel in einer weiteren Sprache
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Publikationstyp
Zeitschriftenartikel
Publikationsstatus
Published
Erschienen in
IEEE Transactions on Pattern Analysis and Machine Intelligence. IEEE. 2023, 45(4), pp. 4384-4395. ISSN 0162-8828. eISSN 2160-9292. Available under: doi: 10.1109/tpami.2022.3195261
Zusammenfassung

Wikipedia combines the power of AI solutions and human reviewers to safeguard article quality. Quality control objectives include detecting malicious edits, fixing typos, and spotting inconsistent formatting. However, no automated quality control mechanisms currently exist for mathematical formulae. Spell checkers are widely used to highlight textual errors, yet no equivalent tool exists to detect algebraically incorrect formulae. Our paper addresses this shortcoming by making mathematical formulae computable. We present a method that (1) gathers the semantic information surrounding the context of each mathematical formulae, (2) provides access to the information in a graph-structured dependency hierarchy, and (3) performs automatic plausibility checks on equations. We evaluate the performance of our approach on 6,337 mathematical expressions contained in 104 Wikipedia articles on the topic of orthogonal polynomials and special functions. Our system, LACAST , verified 358 out of 1,516 equations as error-free. LACAST successfully translated 27% of the mathematical expressions and outperformed existing translation approaches by 16%. Additionally, LACAST achieved an F1 score of .495 for annotating mathematical expressions with relevant textual descriptions, which is a significant step towards advancing searchability, readability, and accessibility of mathematical formulae in Wikipedia. A prototype of LACAST and the semantically enhanced Wikipedia articles are available at: https://tpami.wmflabs.org .

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
004 Informatik
Schlagwörter
Mathematical information retrieval, presentation to computation translation, mathematical objects of interest, mathematical representation transformation, computer algebra systems
Konferenz
Rezension
undefined / . - undefined, undefined
Zitieren
ISO 690GREINER-PETTER, Andre, Moritz SCHUBOTZ, Corinna BREITINGER, Philipp SCHARPF, Akiko AIZAWA, Bela GIPP, 2023. Do the Math : Making Mathematics in Wikipedia Computable. In: IEEE Transactions on Pattern Analysis and Machine Intelligence. IEEE. 2023, 45(4), pp. 4384-4395. ISSN 0162-8828. eISSN 2160-9292. Available under: doi: 10.1109/tpami.2022.3195261
BibTex
@article{GreinerPetter2023Makin-66873,
  year={2023},
  doi={10.1109/tpami.2022.3195261},
  title={Do the Math : Making Mathematics in Wikipedia Computable},
  number={4},
  volume={45},
  issn={0162-8828},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  pages={4384--4395},
  author={Greiner-Petter, Andre and Schubotz, Moritz and Breitinger, Corinna and Scharpf, Philipp and Aizawa, Akiko and Gipp, Bela}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/66873">
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:contributor>Breitinger, Corinna</dc:contributor>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-05-11T11:32:04Z</dcterms:available>
    <dcterms:title>Do the Math : Making Mathematics in Wikipedia Computable</dcterms:title>
    <dc:language>eng</dc:language>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:creator>Aizawa, Akiko</dc:creator>
    <dc:creator>Schubotz, Moritz</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-05-11T11:32:04Z</dc:date>
    <dcterms:issued>2023</dcterms:issued>
    <dcterms:abstract>Wikipedia combines the power of AI solutions and human reviewers to safeguard article quality. Quality control objectives include detecting malicious edits, fixing typos, and spotting inconsistent formatting. However, no automated quality control mechanisms currently exist for mathematical formulae. Spell checkers are widely used to highlight textual errors, yet no equivalent tool exists to detect algebraically incorrect formulae. Our paper addresses this shortcoming by making mathematical formulae computable. We present a method that (1) gathers the semantic information surrounding the context of each mathematical formulae, (2) provides access to the information in a graph-structured dependency hierarchy, and (3) performs automatic plausibility checks on equations. We evaluate the performance of our approach on 6,337 mathematical expressions contained in 104 Wikipedia articles on the topic of orthogonal polynomials and special functions. Our system, LACAST , verified 358 out of 1,516 equations as error-free. LACAST successfully translated 27% of the mathematical expressions and outperformed existing translation approaches by 16%. Additionally, LACAST achieved an F1 score of .495 for annotating mathematical expressions with relevant textual descriptions, which is a significant step towards advancing searchability, readability, and accessibility of mathematical formulae in Wikipedia. A prototype of LACAST and the semantically enhanced Wikipedia articles are available at: https://tpami.wmflabs.org .</dcterms:abstract>
    <dc:creator>Breitinger, Corinna</dc:creator>
    <dc:creator>Scharpf, Philipp</dc:creator>
    <dc:creator>Greiner-Petter, Andre</dc:creator>
    <dc:contributor>Gipp, Bela</dc:contributor>
    <dc:contributor>Scharpf, Philipp</dc:contributor>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/66873"/>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/66873/1/Greiner-Petter_2-1nrkoqt9bxcoi0.pdf"/>
    <dc:contributor>Schubotz, Moritz</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:contributor>Greiner-Petter, Andre</dc:contributor>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/66873/1/Greiner-Petter_2-1nrkoqt9bxcoi0.pdf"/>
    <dc:creator>Gipp, Bela</dc:creator>
    <dc:contributor>Aizawa, Akiko</dc:contributor>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Begutachtet
Unbekannt