Publikation: Towards multi-lingual summarization : a comparative analysis of sentence extraction methods on English and Hebrew corpora
Dateien
Datum
Autor:innen
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
The trend toward the growing multilinguality of the Internet requires text summarization techniques that work equally well in multiple languages. Only some of the automated summarization methods proposed in the literature, however, can be defined as “languageindependent”, as they are not based on any morphological analysis of the summarized text. In this paper, we perform an in-depth comparative analysis of language-independent sentence scoring methods for extractive single-document summarization. We evaluate 15 published summarization methods proposed in the literature and 16 methods introduced in (Litvak et al., 2010). The evaluation is performed on English and Hebrew corpora. The results suggest that the performance ranking of the compared methods is quite similar in both languages. The top ten bilingual scoring methods include six methods introduced in (Litvak et al., 2010).
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
LITVAK, Marina, Hagay LIPMAN, Assaf BEN GUR, Mark LAST, Slava KISILEVICH, Daniel A. KEIM, 2010. Towards multi-lingual summarization : a comparative analysis of sentence extraction methods on English and Hebrew corpora. In: Proceedings of the 4 th Workshop on Cross Lingual Information Access. Beijing: COLING, 2010, pp. 61-69BibTex
@inproceedings{Litvak2010Towar-20227, year={2010}, title={Towards multi-lingual summarization : a comparative analysis of sentence extraction methods on English and Hebrew corpora}, publisher={COLING}, address={Beijing}, booktitle={Proceedings of the 4 th Workshop on Cross Lingual Information Access}, pages={61--69}, author={Litvak, Marina and Lipman, Hagay and Ben Gur, Assaf and Last, Mark and Kisilevich, Slava and Keim, Daniel A.}, note={Paper presented at: 4th International Workshop On Cross Lingual Information Access, Beijing, China, 2010, http://www.aclweb.org/anthology/W10-4010} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/20227"> <dcterms:abstract xml:lang="eng">The trend toward the growing multilinguality of the Internet requires text summarization techniques that work equally well in multiple languages. Only some of the automated summarization methods proposed in the literature, however, can be defined as “languageindependent”, as they are not based on any morphological analysis of the summarized text. In this paper, we perform an in-depth comparative analysis of language-independent sentence scoring methods for extractive single-document summarization. We evaluate 15 published summarization methods proposed in the literature and 16 methods introduced in (Litvak et al., 2010). The evaluation is performed on English and Hebrew corpora. The results suggest that the performance ranking of the compared methods is quite similar in both languages. The top ten bilingual scoring methods include six methods introduced in (Litvak et al., 2010).</dcterms:abstract> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dcterms:title>Towards multi-lingual summarization : a comparative analysis of sentence extraction methods on English and Hebrew corpora</dcterms:title> <dc:creator>Litvak, Marina</dc:creator> <dc:contributor>Last, Mark</dc:contributor> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-08-24T10:26:30Z</dc:date> <dc:contributor>Litvak, Marina</dc:contributor> <dc:creator>Lipman, Hagay</dc:creator> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dc:contributor>Lipman, Hagay</dc:contributor> <dc:contributor>Kisilevich, Slava</dc:contributor> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/20227"/> <dcterms:bibliographicCitation>Proceedings of the 4th Workshop on Cross Lingual Information Access. - Beijing : COLING, 2010. - S. 61-69</dcterms:bibliographicCitation> <dc:creator>Last, Mark</dc:creator> <dc:language>eng</dc:language> <dc:creator>Keim, Daniel A.</dc:creator> <dc:contributor>Keim, Daniel A.</dc:contributor> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-08-24T10:26:30Z</dcterms:available> <dc:contributor>Ben Gur, Assaf</dc:contributor> <dcterms:issued>2010</dcterms:issued> <dc:creator>Kisilevich, Slava</dc:creator> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/20227/1/Keim_219.pdf"/> <dc:creator>Ben Gur, Assaf</dc:creator> <dc:rights>terms-of-use</dc:rights> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/20227/1/Keim_219.pdf"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> </rdf:Description> </rdf:RDF>