Publikation:

Collective moderation of hate, toxicity, and extremity in online discussions

Lade...
Vorschaubild

Dateien

Zu diesem Dokument gibt es keine Dateien.

Datum

2025

Autor:innen

Lasser, Jana
Herderich, Alina
Garland, Joshua
Aroyehun, Segun Taofeek
Galesic, Mirta

Herausgeber:innen

Kontakt

ISSN der Zeitschrift

Electronic ISSN

ISBN

Bibliografische Daten

Verlag

Schriftenreihe

Auflagebezeichnung

URI (zitierfähiger Link)
ArXiv-ID

Internationale Patentnummer

Angaben zur Forschungsförderung

U.S. National Science Foundation (NSF): 1757211
European Union (EU): 101020961
European Union (EU): 101140741

Projekt

Open Access-Veröffentlichung
Open Access Gold
Core Facility der Universität Konstanz

Gesperrt bis

Titel in einer weiteren Sprache

Publikationstyp
Zeitschriftenartikel
Publikationsstatus
Published

Erschienen in

PNAS Nexus. Oxford University Press (OUP). 2025, 4(11), pgaf369. eISSN 2752-6542. Verfügbar unter: doi: 10.1093/pnasnexus/pgaf369

Zusammenfassung

In the digital age, hate speech poses a threat to the functioning of social media platforms as spaces for public discourse. Top-down approaches to moderate hate speech encounter difficulties due to conflicts with freedom of expression and issues of scalability. Counter speech, a form of collective moderation by citizens, has emerged as a potential remedy. Here, we aim to investigate which counter speech strategies are most effective in reducing the prevalence of hate, toxicity, and extremity on online platforms. We analyze more than 130,000 discussions on German Twitter starting at the peak of the migrant crisis in 2015 and extending over 4 years. We use human annotation and machine learning classifiers to identify argumentation strategies, ingroup and outgroup references, emotional tone, and different measures of discourse quality. Using matching and time-series analyses we discern the effectiveness of naturally observed counter speech strategies on the microlevel (individual tweet pairs), mesolevel (entire discussions) and macrolevel (over days). We find that expressing straightforward opinions, even if not factual but devoid of insults, results in the least subsequent hate, toxicity, and extremity over all levels of analyses. This strategy complements currently recommended counter speech strategies and is easy for citizens to engage in. Sarcasm can also be effective in improving discourse quality, especially in the presence of organized extreme groups. Going beyond one-shot analyses on smaller samples prevalent in most prior studies, our findings have implications for the successful management of public online spaces through collective civic moderation.

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
320 Politik

Schlagwörter

collective moderation, counter speech, outgroup thinking, emotion, political discussions

Konferenz

Rezension
undefined / . - undefined, undefined

Forschungsvorhaben

Organisationseinheiten

Zeitschriftenheft

Zugehörige Datensätze in KOPS

Zitieren

ISO 690LASSER, Jana, Alina HERDERICH, Joshua GARLAND, Segun Taofeek AROYEHUN, David GARCIA, Mirta GALESIC, 2025. Collective moderation of hate, toxicity, and extremity in online discussions. In: PNAS Nexus. Oxford University Press (OUP). 2025, 4(11), pgaf369. eISSN 2752-6542. Verfügbar unter: doi: 10.1093/pnasnexus/pgaf369
BibTex
@article{Lasser2025-10-31Colle-75570,
  title={Collective moderation of hate, toxicity, and extremity in online discussions},
  year={2025},
  doi={10.1093/pnasnexus/pgaf369},
  number={11},
  volume={4},
  journal={PNAS Nexus},
  author={Lasser, Jana and Herderich, Alina and Garland, Joshua and Aroyehun, Segun Taofeek and Garcia, David and Galesic, Mirta},
  note={Article Number: pgaf369}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/75570">
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:contributor>Garland, Joshua</dc:contributor>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/75570"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Galesic, Mirta</dc:contributor>
    <dc:contributor>Garcia, David</dc:contributor>
    <dc:contributor>Aroyehun, Segun Taofeek</dc:contributor>
    <dc:creator>Lasser, Jana</dc:creator>
    <dcterms:issued>2025-10-31</dcterms:issued>
    <dc:creator>Garland, Joshua</dc:creator>
    <dc:creator>Herderich, Alina</dc:creator>
    <dc:rights>Attribution 4.0 International</dc:rights>
    <dc:contributor>Herderich, Alina</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/42"/>
    <dc:creator>Galesic, Mirta</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-12-19T12:07:03Z</dcterms:available>
    <dc:language>eng</dc:language>
    <dc:creator>Garcia, David</dc:creator>
    <dcterms:title>Collective moderation of hate, toxicity, and extremity in online discussions</dcterms:title>
    <dcterms:abstract>In the digital age, hate speech poses a threat to the functioning of social media platforms as spaces for public discourse. Top-down approaches to moderate hate speech encounter difficulties due to conflicts with freedom of expression and issues of scalability. Counter speech, a form of collective moderation by citizens, has emerged as a potential remedy. Here, we aim to investigate which counter speech strategies are most effective in reducing the prevalence of hate, toxicity, and extremity on online platforms. We analyze more than 130,000 discussions on German Twitter starting at the peak of the migrant crisis in 2015 and extending over 4 years. We use human annotation and machine learning classifiers to identify argumentation strategies, ingroup and outgroup references, emotional tone, and different measures of discourse quality. Using matching and time-series analyses we discern the effectiveness of naturally observed counter speech strategies on the microlevel (individual tweet pairs), mesolevel (entire discussions) and macrolevel (over days). We find that expressing straightforward opinions, even if not factual but devoid of insults, results in the least subsequent hate, toxicity, and extremity over all levels of analyses. This strategy complements currently recommended counter speech strategies and is easy for citizens to engage in. Sarcasm can also be effective in improving discourse quality, especially in the presence of organized extreme groups. Going beyond one-shot analyses on smaller samples prevalent in most prior studies, our findings have implications for the successful management of public online spaces through collective civic moderation.</dcterms:abstract>
    <dc:creator>Aroyehun, Segun Taofeek</dc:creator>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-12-19T12:07:03Z</dc:date>
    <dc:contributor>Lasser, Jana</dc:contributor>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/42"/>
    <dcterms:rights rdf:resource="http://creativecommons.org/licenses/by/4.0/"/>
  </rdf:Description>
</rdf:RDF>

Interner Vermerk

xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter

Kontakt
URL der Originalveröffentl.

Prüfdatum der URL

Prüfungsdatum der Dissertation

Finanzierungsart

Kommentar zur Publikation

Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Ja
Diese Publikation teilen