Generalizable Sarcasm Detection is Just Around the Corner, of Course!

Jang, Hyewon; Frassinelli, Diego

doi:10.18653/v1/2024.naacl-long.238

Generalizable Sarcasm Detection is Just Around the Corner, of Course!

dc.contributor.author	Jang, Hyewon
dc.contributor.author	Frassinelli, Diego
dc.date.accessioned	2026-03-03T09:07:01Z
dc.date.available	2026-03-03T09:07:01Z
dc.date.issued	2024
dc.description.abstract	We tested the robustness of sarcasm detection models by examining their behavior when fine-tuned on four sarcasm datasets containing varying characteristics of sarcasm: label source (authors vs. third-party), domain (social media/online vs. offline conversations/dialogues), style (aggressive vs. humorous mocking). We tested their prediction performance on the same dataset (intra-dataset) and across different datasets (cross-dataset). For intra-dataset predictions, models consistently performed better when fine-tuned with third-party labels rather than with author labels. For cross-dataset predictions, most models failed to generalize well to the other datasets, implying that one type of dataset cannot represent all sorts of sarcasm with different styles and domains. Compared to the existing datasets, models fine-tuned on the new dataset we release in this work showed the highest generalizability to other datasets. With a manual inspection of the datasets and post-hoc analysis, we attributed the difficulty in generalization to the fact that sarcasm actually comes in different domains and styles. We argue that future sarcasm research should take the broad scope of sarcasm into account.
dc.description.version	published	deu
dc.identifier.doi	10.18653/v1/2024.naacl-long.238
dc.identifier.uri	https://kops.uni-konstanz.de/handle/123456789/76428
dc.language.iso	eng
dc.subject.ddc	400
dc.title	Generalizable Sarcasm Detection is Just Around the Corner, of Course!	eng
dc.type	INPROCEEDINGS
dspace.entity.type	Publication
kops.citation.bibtex	@inproceedings{Jang2024Gener-76428, title={Generalizable Sarcasm Detection is Just Around the Corner, of Course!}, year={2024}, doi={10.18653/v1/2024.naacl-long.238}, address={Stroudsburg, PA}, publisher={Association for Computational Linguistics ACL}, booktitle={Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)}, pages={4238--4249}, editor={Duh, Kevin and Gomez, Helena and Bethard, Steven}, author={Jang, Hyewon and Frassinelli, Diego} }
kops.citation.iso690	JANG, Hyewon, Diego FRASSINELLI, 2024. Generalizable Sarcasm Detection is Just Around the Corner, of Course!. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Mexico City, Mexico, 16. Juni 2024 - 21. Juni 2024. In: DUH, Kevin, Hrsg., Helena GOMEZ, Hrsg., Steven BETHARD, Hrsg.. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Stroudsburg, PA: Association for Computational Linguistics ACL, 2024, S. 4238-4249. Verfügbar unter: doi: 10.18653/v1/2024.naacl-long.238	deu
kops.citation.iso690	JANG, Hyewon, Diego FRASSINELLI, 2024. Generalizable Sarcasm Detection is Just Around the Corner, of Course!. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Mexico City, Mexico, Jun 16, 2024 - Jun 21, 2024. In: DUH, Kevin, ed., Helena GOMEZ, ed., Steven BETHARD, ed.. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Stroudsburg, PA: Association for Computational Linguistics ACL, 2024, pp. 4238-4249. Available under: doi: 10.18653/v1/2024.naacl-long.238	eng
kops.citation.rdf	<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/76428"> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dc:creator>Jang, Hyewon</dc:creator> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/> <dc:contributor>Jang, Hyewon</dc:contributor> <dc:contributor>Frassinelli, Diego</dc:contributor> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/> <dcterms:abstract>We tested the robustness of sarcasm detection models by examining their behavior when fine-tuned on four sarcasm datasets containing varying characteristics of sarcasm: label source (authors vs. third-party), domain (social media/online vs. offline conversations/dialogues), style (aggressive vs. humorous mocking). We tested their prediction performance on the same dataset (intra-dataset) and across different datasets (cross-dataset). For intra-dataset predictions, models consistently performed better when fine-tuned with third-party labels rather than with author labels. For cross-dataset predictions, most models failed to generalize well to the other datasets, implying that one type of dataset cannot represent all sorts of sarcasm with different styles and domains. Compared to the existing datasets, models fine-tuned on the new dataset we release in this work showed the highest generalizability to other datasets. With a manual inspection of the datasets and post-hoc analysis, we attributed the difficulty in generalization to the fact that sarcasm actually comes in different domains and styles. We argue that future sarcasm research should take the broad scope of sarcasm into account.</dcterms:abstract> <dc:creator>Frassinelli, Diego</dc:creator> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/76428"/> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2026-03-03T09:07:01Z</dc:date> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dcterms:issued>2024</dcterms:issued> <dcterms:title>Generalizable Sarcasm Detection is Just Around the Corner, of Course!</dcterms:title> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2026-03-03T09:07:01Z</dcterms:available> <dc:language>eng</dc:language> </rdf:Description> </rdf:RDF>
kops.conferencefield	Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 16. Juni 2024 - 21. Juni 2024, Mexico City, Mexico	deu
kops.date.conferenceEnd	2024-06-21
kops.date.conferenceStart	2024-06-16
kops.flag.knbibliography	true
kops.location.conference	Mexico City, Mexico
kops.sourcefield	DUH, Kevin, Hrsg., Helena GOMEZ, Hrsg., Steven BETHARD, Hrsg.. <i>Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)</i>. Stroudsburg, PA: Association for Computational Linguistics ACL, 2024, S. 4238-4249. Verfügbar unter: doi: 10.18653/v1/2024.naacl-long.238	deu
kops.sourcefield.plain	DUH, Kevin, Hrsg., Helena GOMEZ, Hrsg., Steven BETHARD, Hrsg.. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Stroudsburg, PA: Association for Computational Linguistics ACL, 2024, S. 4238-4249. Verfügbar unter: doi: 10.18653/v1/2024.naacl-long.238	deu
kops.sourcefield.plain	DUH, Kevin, ed., Helena GOMEZ, ed., Steven BETHARD, ed.. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Stroudsburg, PA: Association for Computational Linguistics ACL, 2024, pp. 4238-4249. Available under: doi: 10.18653/v1/2024.naacl-long.238	eng
kops.title.conference	Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
relation.isAuthorOfPublication	bf0689a7-23f2-460a-8abb-42ea30bb2d29
relation.isAuthorOfPublication.latestForDiscovery	bf0689a7-23f2-460a-8abb-42ea30bb2d29
source.bibliographicInfo.fromPage	4238
source.bibliographicInfo.toPage	4249
source.contributor.editor	Duh, Kevin
source.contributor.editor	Gomez, Helena
source.contributor.editor	Bethard, Steven
source.publisher	Association for Computational Linguistics ACL
source.publisher.location	Stroudsburg, PA
source.title	Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)

Sammlungen

Linguistik: Publikationen

Generalizable Sarcasm Detection is Just Around the Corner, of Course!

Dateien

Sammlungen