Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions

Lade...
Vorschaubild
Dateien
Hamborg_2-cpytktbqkenn6.pdf
Hamborg_2-cpytktbqkenn6.pdfGröße: 114.9 KBDownloads: 614
Datum
2018
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published
Erschienen in
CHEN, Jiangping, ed. and others. Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18. New York: ACM Press, 2018, pp. 339-340. ISBN 978-1-4503-5178-2. Available under: doi: 10.1145/3197026.3203899
Zusammenfassung

The identification and extraction of the events that news articles report on is a commonly performed task in the analysis workflow of various projects that analyze news articles. However, due to the lack of universally usable and publicly available methods for news articles, many researchers must redundantly implement methods for event extraction to be used within their projects. Answers to the journalistic five W and one H questions (5W1H) describe the main event of a news story, i.e., who did what, when, where, why, and how. We propose Giveme5W1H, an open-source system that uses syntactic and domain-specific rules to extract phrases answering the 5W1H. In our evaluation, we find that the extraction precision of 5W1H phrases is p=0.64, and p=0.79 for the first four W questions, which discretely describe an event.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
004 Informatik
Schlagwörter
Konferenz
18th ACM/IEEE on Joint Conference on Digital Libraries, 3. Juni 2018 - 7. Juni 2018, Fort Worth, USA
Rezension
undefined / . - undefined, undefined
Zitieren
ISO 690HAMBORG, Felix, Corinna BREITINGER, Moritz SCHUBOTZ, Soeren LACHNIT, Bela GIPP, 2018. Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions. 18th ACM/IEEE on Joint Conference on Digital Libraries. Fort Worth, USA, 3. Juni 2018 - 7. Juni 2018. In: CHEN, Jiangping, ed. and others. Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18. New York: ACM Press, 2018, pp. 339-340. ISBN 978-1-4503-5178-2. Available under: doi: 10.1145/3197026.3203899
BibTex
@inproceedings{Hamborg2018Extra-43285,
  year={2018},
  doi={10.1145/3197026.3203899},
  title={Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions},
  isbn={978-1-4503-5178-2},
  publisher={ACM Press},
  address={New York},
  booktitle={Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries  - JCDL '18},
  pages={339--340},
  editor={Chen, Jiangping},
  author={Hamborg, Felix and Breitinger, Corinna and Schubotz, Moritz and Lachnit, Soeren and Gipp, Bela}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43285">
    <dc:contributor>Lachnit, Soeren</dc:contributor>
    <dc:creator>Gipp, Bela</dc:creator>
    <dc:contributor>Hamborg, Felix</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:contributor>Schubotz, Moritz</dc:contributor>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/43285/1/Hamborg_2-cpytktbqkenn6.pdf"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/43285/1/Hamborg_2-cpytktbqkenn6.pdf"/>
    <dcterms:title>Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions</dcterms:title>
    <dcterms:issued>2018</dcterms:issued>
    <dc:contributor>Breitinger, Corinna</dc:contributor>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-09-17T13:49:06Z</dc:date>
    <dc:language>eng</dc:language>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-09-17T13:49:06Z</dcterms:available>
    <dc:creator>Hamborg, Felix</dc:creator>
    <dc:creator>Schubotz, Moritz</dc:creator>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:creator>Breitinger, Corinna</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dcterms:abstract xml:lang="eng">The identification and extraction of the events that news articles report on is a commonly performed task in the analysis workflow of various projects that analyze news articles. However, due to the lack of universally usable and publicly available methods for news articles, many researchers must redundantly implement methods for event extraction to be used within their projects. Answers to the journalistic five W and one H questions (5W1H) describe the main event of a news story, i.e., who did what, when, where, why, and how. We propose Giveme5W1H, an open-source system that uses syntactic and domain-specific rules to extract phrases answering the 5W1H. In our evaluation, we find that the extraction precision of 5W1H phrases is p=0.64, and p=0.79 for the first four W questions, which discretely describe an event.</dcterms:abstract>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/43285"/>
    <dc:contributor>Gipp, Bela</dc:contributor>
    <dc:creator>Lachnit, Soeren</dc:creator>
    <dc:rights>terms-of-use</dc:rights>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen