Wikipedia Edit Event Data 2021 (WikiEvent.2021)

dc.contributor.authorLerner, Jürgen
dc.date.accessioned2025-07-03T10:45:46Z
dc.date.available2025-07-03T10:45:46Z
dc.date.created2021-02-09T09:28:05Z
dc.date.issued2021
dc.description.abstractThe "Wikipedia Edit Event Data 2021 (WikiEvent.2021)" gives the time, user name, and article title of every edit that any registered and logged-in Wikipedia user performed on any article in the English-language edition of Wikipedia from January 15th, 2001 (the launch of Wikipedia) to January 2021. This dataset extends the older version WikiEven.2018 (https://zenodo.org/record/1626323). The edit event data has been extracted from the file 'enwiki-20210101-stub-meta-history.xml.gz'; which was at that time linked from 'https://dumps.wikimedia.org/enwiki/20210101/'. These files get deleted some months after data collection - however the information is still available in any file 'enwiki-<date>-stub-meta-history.xml.gz' where is 20210101 or later. These data are provided by the Wikimedia Foundation licensed under the GNU Free Documentation License (GFDL) and the Creative Commons Attribution-Share-Alike 3.0 License. The Wikipedia Edit Event Data 2021 comprises the file ('WikiEvent.2021.csv') giving a table with 3 columns and more than 450 million rows in CSV format. Cell delimiter is semicolon (';') and strings are quoted by double-quotes ('"'). The table has a header given in the first row and the three columns are labeled 'time', 'user', and 'article' respectively. The uncompressed size of the file is about 23 GB. How to analyze the WikiEvent Data with relational event models is explained in the eventnet tutorial at: https://github.com/juergenlerner/eventnet/wiki/Large-event-networks-(tutorial).
dc.description.versionpublisheddeu
dc.identifier.doi10.5281/zenodo.4522066
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/73793
dc.language.isoeng
dc.rightsCreative Commons Attribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/legalcode
dc.subjectWikipedia
dc.subjectonline peer-production
dc.subjectsocial networks
dc.subjectdynamic networks
dc.subjectrelational event networks
dc.subject.ddc004
dc.titleWikipedia Edit Event Data 2021 (WikiEvent.2021)eng
dspace.entity.typeDataset
kops.citation.bibtex
kops.citation.iso690LERNER, Jürgen, 2021. Wikipedia Edit Event Data 2021 (WikiEvent.2021)deu
kops.citation.iso690LERNER, Jürgen, 2021. Wikipedia Edit Event Data 2021 (WikiEvent.2021)eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/73793">
    <dc:language>eng</dc:language>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/71925"/>
    <dc:rights>Creative Commons Attribution 4.0 International</dc:rights>
    <dcterms:title>Wikipedia Edit Event Data 2021 (WikiEvent.2021)</dcterms:title>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/73793"/>
    <dc:creator>Lerner, Jürgen</dc:creator>
    <dcterms:abstract>The "Wikipedia Edit Event Data 2021 (WikiEvent.2021)" gives the time, user name, and article title of every edit that any registered and logged-in Wikipedia user performed on any article in the English-language edition of Wikipedia from January 15th, 2001 (the launch of Wikipedia) to January 2021. 
This dataset extends the older version WikiEven.2018 (https://zenodo.org/record/1626323). 
The edit event data has been extracted from the file 'enwiki-20210101-stub-meta-history.xml.gz'; which was at that time linked from 'https://dumps.wikimedia.org/enwiki/20210101/'. These files get deleted some months after data collection - however the information is still available in any file 'enwiki-&amp;lt;date&amp;gt;-stub-meta-history.xml.gz' where &lt;date&gt; is 20210101 or later. These data are provided by the Wikimedia Foundation licensed under the GNU Free Documentation License (GFDL) and the Creative Commons Attribution-Share-Alike 3.0 License. The Wikipedia Edit Event Data 2021 comprises the file ('WikiEvent.2021.csv') giving a table with 3 columns and more than 450 million rows in CSV format. Cell delimiter is semicolon (';') and strings are quoted by double-quotes ('"'). The table has a header given in the first row and the three columns are labeled 'time', 'user', and 'article' respectively. The uncompressed size of the file is about 23 GB. 

How to analyze the WikiEvent Data with relational event models is explained in the eventnet tutorial at: https://github.com/juergenlerner/eventnet/wiki/Large-event-networks-(tutorial).</dcterms:abstract>
    <dcterms:rights rdf:resource="https://creativecommons.org/licenses/by/4.0/legalcode"/>
    <dcterms:issued>2021</dcterms:issued>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-07-03T10:45:46Z</dcterms:available>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dcterms:created rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-02-09T09:28:05Z</dcterms:created>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-07-03T10:45:46Z</dc:date>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Lerner, Jürgen</dc:contributor>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/71925"/>
  </rdf:Description>
</rdf:RDF>
kops.datacite.repositoryZenodo
kops.flag.knbibliographytrue
relation.isAuthorOfDataset90913c2c-3951-48c7-b33f-891bad2abfc1
relation.isAuthorOfDataset.latestForDiscovery90913c2c-3951-48c7-b33f-891bad2abfc1
relation.isPublicationOfDataset1aeec9f9-dd42-4412-90be-adf807b2ded2
relation.isPublicationOfDataset.latestForDiscovery1aeec9f9-dd42-4412-90be-adf807b2ded2

Dateien