Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions

Thumbnail Image
Date
2018
Editors
Contact
Journal ISSN
Electronic ISSN
ISBN
Bibliographical data
Publisher
Series
DOI (citable link)
ArXiv-ID
International patent number
Link to the license
EU project number
Project
Open Access publication
Restricted until
Title in another language
Research Projects
Organizational Units
Journal Issue
Publication type
Contribution to a conference collection
Publication status
Published
Published in
Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18 / Chen, Jiangping et al. (ed.). - New York : ACM Press, 2018. - pp. 339-340. - ISBN 978-1-4503-5178-2
Abstract
The identification and extraction of the events that news articles report on is a commonly performed task in the analysis workflow of various projects that analyze news articles. However, due to the lack of universally usable and publicly available methods for news articles, many researchers must redundantly implement methods for event extraction to be used within their projects. Answers to the journalistic five W and one H questions (5W1H) describe the main event of a news story, i.e., who did what, when, where, why, and how. We propose Giveme5W1H, an open-source system that uses syntactic and domain-specific rules to extract phrases answering the 5W1H. In our evaluation, we find that the extraction precision of 5W1H phrases is p=0.64, and p=0.79 for the first four W questions, which discretely describe an event.
Summary in another language
Subject (DDC)
004 Computer Science
Keywords
Conference
18th ACM/IEEE on Joint Conference on Digital Libraries, Jun 3, 2018 - Jun 7, 2018, Fort Worth, USA
Review
undefined / . - undefined, undefined. - (undefined; undefined)
Cite This
ISO 690HAMBORG, Felix, Corinna BREITINGER, Moritz SCHUBOTZ, Soeren LACHNIT, Bela GIPP, 2018. Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions. 18th ACM/IEEE on Joint Conference on Digital Libraries. Fort Worth, USA, Jun 3, 2018 - Jun 7, 2018. In: CHEN, Jiangping, ed. and others. Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18. New York:ACM Press, pp. 339-340. ISBN 978-1-4503-5178-2. Available under: doi: 10.1145/3197026.3203899
BibTex
@inproceedings{Hamborg2018Extra-43285,
  year={2018},
  doi={10.1145/3197026.3203899},
  title={Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions},
  isbn={978-1-4503-5178-2},
  publisher={ACM Press},
  address={New York},
  booktitle={Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries  - JCDL '18},
  pages={339--340},
  editor={Chen, Jiangping},
  author={Hamborg, Felix and Breitinger, Corinna and Schubotz, Moritz and Lachnit, Soeren and Gipp, Bela}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43285">
    <dc:contributor>Lachnit, Soeren</dc:contributor>
    <dc:creator>Gipp, Bela</dc:creator>
    <dc:contributor>Hamborg, Felix</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:contributor>Schubotz, Moritz</dc:contributor>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/43285/1/Hamborg_2-cpytktbqkenn6.pdf"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/43285/1/Hamborg_2-cpytktbqkenn6.pdf"/>
    <dcterms:title>Extraction of Main Event Descriptors from News Articles by Answering the Journalistic Five W and One H Questions</dcterms:title>
    <dcterms:issued>2018</dcterms:issued>
    <dc:contributor>Breitinger, Corinna</dc:contributor>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-09-17T13:49:06Z</dc:date>
    <dc:language>eng</dc:language>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-09-17T13:49:06Z</dcterms:available>
    <dc:creator>Hamborg, Felix</dc:creator>
    <dc:creator>Schubotz, Moritz</dc:creator>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:creator>Breitinger, Corinna</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dcterms:abstract xml:lang="eng">The identification and extraction of the events that news articles report on is a commonly performed task in the analysis workflow of various projects that analyze news articles. However, due to the lack of universally usable and publicly available methods for news articles, many researchers must redundantly implement methods for event extraction to be used within their projects. Answers to the journalistic five W and one H questions (5W1H) describe the main event of a news story, i.e., who did what, when, where, why, and how. We propose Giveme5W1H, an open-source system that uses syntactic and domain-specific rules to extract phrases answering the 5W1H. In our evaluation, we find that the extraction precision of 5W1H phrases is p=0.64, and p=0.79 for the first four W questions, which discretely describe an event.</dcterms:abstract>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/43285"/>
    <dc:contributor>Gipp, Bela</dc:contributor>
    <dc:creator>Lachnit, Soeren</dc:creator>
    <dc:rights>terms-of-use</dc:rights>
  </rdf:Description>
</rdf:RDF>
Internal note
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Contact
URL of original publication
Test date of URL
Examination date of dissertation
Method of financing
Comment on publication
Alliance license
Corresponding Authors der Uni Konstanz vorhanden
International Co-Authors
Bibliography of Konstanz
Yes
Refereed