Datensatz:

DCASE 2021 Task 5: Few-shot Bioacoustic Event Detection Development Set

Lade...
Vorschaubild

Datum der Erstveröffentlichung

2021

Autor:innen

Morfi, Veronica
Stowell, Dan
Lostanlen, Vincent
Gill, Lisa
Pamula, Hanna
Benvent, David
Nolasco, Ines
Singh, Shubhr
Sridhar, Sripathi

Andere Beitragende

Repositorium der Erstveröffentlichung

Zenodo

Version des Datensatzes

Link zur Lizenz

Angaben zur Forschungsförderung

Projekt

Core Facility der Universität Konstanz
Bewerten Sie die FAIRness der Forschungsdaten

Gesperrt bis

Titel in einer weiteren Sprache

Publikationsstatus
Published

Zusammenfassung

General Description The development set for task 5 of DCASE 2021 "Few-shot Bioacoustic Event Detection" consists of 19 audio files acquired from different bioacoustic sources. The dataset is split into training and validation Sets. Multi-class annotations are provided for the training set with positive (POS), negative (NEG) and unkwown (UNK) values for each class. UNK indicates uncertainty about a class. Single-class (class of interest) annotations are provided for the validation set, with events marked as positive (POS) or unkwown (UNK) provided for the class of interest. Folder Structure Development_Set.zip |Development_Set/ |__Training_Set/ |BV/ |.wav |____.csv |HT/ |*.wav |*.csv |JD/ |*.wav |*.csv |MT/ |*.wav |*.csv |__Validation_Set/ |HV/ |*.wav |*.csv |PB/ |*.wav |___*.csv Development_Set_Audio.zip has the same structure but contains only the *.wav files. Development_Set_Annotations.zip has the same structure but contains only the *.csv files Dataset statistics Some statistics on this dataset are as follows, split between training and validation set and their sub-folders: -----------------------------------------------------
TRAINING SET
-----------------------------------------------------
Number of audio recordings | 11
Total duration | 14 hours and 20 mins
Total classes (excl. UNK) | 19
Total events (excl. UNK) | 4,686
-----------------------------------------------------
TRAINING SET/BV
-----------------------------------------------------
Number of audio recordings | 5
Total duration | 10 hours
Total classes (excl. UNK) | 11
Total events (excl. UNK) | 2,662
Sampling rate | 24,000 Hz
-----------------------------------------------------
TRAINING SET/HT
-----------------------------------------------------
Number of audio recordings | 3
Total duration | 3 hours
Total classes (excl. UNK) | 3
Total events (excl. UNK) | 435
Sampling rate | 6,000 Hz
-----------------------------------------------------
TRAINING SET/JD
-----------------------------------------------------
Number of audio recordings | 1
Total duration | 10 mins
Total classes (excl. UNK) | 1
Total events (excl. UNK) | 355
Sampling rate | 22,050 Hz
-----------------------------------------------------
TRAINING SET/MT
-----------------------------------------------------
Number of audio recordings | 2
Total duration | 1 hour and 10 mins
Total classes (excl. UNK) | 4
Total events (excl. UNK) | 1,234
Sampling rate | 8,000 Hz
-----------------------------------------------------
-----------------------------------------------------
VALIDATION SET
-----------------------------------------------------
Number of audio recordings | 8
Total duration | 5 hours
Total classes (excl. UNK) | 4
Total events (excl. UNK) | 310
-----------------------------------------------------
VALIDATION SET/HV
-----------------------------------------------------
Number of audio recordings | 2
Total duration | 2 hours
Total classes (excl. UNK) | 2
Total events (excl. UNK) | 50
Sampling rate | 6,000 Hz
-----------------------------------------------------
VALIDATION SET/PB
-----------------------------------------------------
Number of audio recordings | 6
Total duration | 3 hours
Total classes (excl. UNK) | 2
Total events (excl. UNK) | 260
Sampling rate | 44,100 Hz
----------------------------------------------------- Annotation structure Each line of the annotation csv represents an event in the audio file. The column descriptions are as follows: TRAINING SET
---------------------
Audiofilename, Starttime, Endtime, CLASS_1, CLASS_2, ...CLASS_N VALIDATION SET
---------------------
Audiofilename, Starttime, Endtime, Q Classes DCASE2021_task5_training_set_classes.csv and DCASE2021_task5_validation_set_classes.csv provide a table with class code correspondace to class name for all classes in the Development set. DCASE2021_task5_training_set_classes.csv
---------------------
dataset, class_code, class_name DCASE2021_task5_validation_set_classes.csv
---------------------
dataset, recording, class_code, class_name Evaluation Set The Evaluation set for the same task can be found at: https://doi.org/10.5281/zenodo.5413149 Open Access This dataset is available under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Contact info Please send any feedback or questions to:
Veronica Morfi: g.v.morfi@qmul.ac.uk

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
570 Biowissenschaften, Biologie

Schlagwörter

bioacoustics, few-shot learning, dcase2021, audio event detection

Zugehörige Publikationen in KOPS

Link zu zugehöriger Publikation
Link zu zugehörigem Datensatz

Zitieren

ISO 690MORFI, Veronica, Dan STOWELL, Vincent LOSTANLEN, Ariana STRANDBURG-PESHKIN, Lisa GILL, Hanna PAMULA, David BENVENT, Ines NOLASCO, Shubhr SINGH, Sripathi SRIDHAR, Mathieu DUTEIL, Andrew FARNSWORTH, 2021. DCASE 2021 Task 5: Few-shot Bioacoustic Event Detection Development Set
BibTex
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/73794">
    <dc:creator>Nolasco, Ines</dc:creator>
    <dc:contributor>Gill, Lisa</dc:contributor>
    <dc:creator>Pamula, Hanna</dc:creator>
    <dc:creator>Benvent, David</dc:creator>
    <dc:creator>Sridhar, Sripathi</dc:creator>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/71914"/>
    <dc:creator>Singh, Shubhr</dc:creator>
    <dc:contributor>Farnsworth, Andrew</dc:contributor>
    <dc:creator>Strandburg-Peshkin, Ariana</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/71914"/>
    <dc:contributor>Benvent, David</dc:contributor>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-07-03T10:51:56Z</dc:date>
    <dc:creator>Gill, Lisa</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-07-03T10:51:56Z</dcterms:available>
    <dcterms:title>DCASE 2021 Task 5: Few-shot Bioacoustic Event Detection Development Set</dcterms:title>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/73794"/>
    <dc:contributor>Strandburg-Peshkin, Ariana</dc:contributor>
    <dcterms:abstract>&lt;strong&gt;General Description&lt;/strong&gt; The development set for task 5 of DCASE 2021 "Few-shot Bioacoustic Event Detection" consists of 19 audio files acquired from different bioacoustic sources. The dataset is split into training and validation Sets. Multi-class annotations are provided for the training set with positive (POS), negative (NEG) and unkwown (UNK) values for each class. UNK indicates uncertainty about a class. Single-class (class of interest) annotations are provided for the validation set, with events marked as positive (POS) or unkwown (UNK) provided for the class of interest. &lt;strong&gt;Folder Structure&lt;/strong&gt; &lt;em&gt;Development_Set.zip&lt;/em&gt; |_Development_Set/ |__Training_Set/ |___BV/ |____*.wav |____*.csv |___HT/ |____*.wav |____*.csv |___JD/ |____*.wav |____*.csv |___MT/ |____*.wav |____*.csv |__Validation_Set/ |___HV/ |____*.wav |____*.csv |___PB/ |____*.wav |____*.csv &lt;em&gt;Development_Set_Audio.zip&lt;/em&gt; has the same structure but contains only the *.wav files. &lt;em&gt;Development_Set_Annotations.zip&lt;/em&gt; has the same structure but contains only the *.csv files &lt;strong&gt;Dataset statistics&lt;/strong&gt; Some statistics on this dataset are as follows, split between training and validation set and their sub-folders: -----------------------------------------------------&lt;br&gt; TRAINING SET&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 11&lt;br&gt; Total duration | 14 hours and 20 mins&lt;br&gt; Total classes (excl. UNK) | 19&lt;br&gt; Total events (excl. UNK) | 4,686&lt;br&gt; -----------------------------------------------------&lt;br&gt; TRAINING SET/BV&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 5&lt;br&gt; Total duration | 10 hours&lt;br&gt; Total classes (excl. UNK) | 11&lt;br&gt; Total events (excl. UNK) | 2,662&lt;br&gt; Sampling rate | 24,000 Hz&lt;br&gt; -----------------------------------------------------&lt;br&gt; TRAINING SET/HT&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 3&lt;br&gt; Total duration | 3 hours&lt;br&gt; Total classes (excl. UNK) | 3&lt;br&gt; Total events (excl. UNK) | 435&lt;br&gt; Sampling rate | 6,000 Hz&lt;br&gt; -----------------------------------------------------&lt;br&gt; TRAINING SET/JD&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 1&lt;br&gt; Total duration | 10 mins&lt;br&gt; Total classes (excl. UNK) | 1&lt;br&gt; Total events (excl. UNK) | 355&lt;br&gt; Sampling rate | 22,050 Hz&lt;br&gt; -----------------------------------------------------&lt;br&gt; TRAINING SET/MT&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 2&lt;br&gt; Total duration | 1 hour and 10 mins&lt;br&gt; Total classes (excl. UNK) | 4&lt;br&gt; Total events (excl. UNK) | 1,234&lt;br&gt; Sampling rate | 8,000 Hz&lt;br&gt; ----------------------------------------------------- &lt;br&gt; -----------------------------------------------------&lt;br&gt; VALIDATION SET&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 8&lt;br&gt; Total duration | 5 hours&lt;br&gt; Total classes (excl. UNK) | 4&lt;br&gt; Total events (excl. UNK) | 310&lt;br&gt; -----------------------------------------------------&lt;br&gt; VALIDATION SET/HV&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 2&lt;br&gt; Total duration | 2 hours&lt;br&gt; Total classes (excl. UNK) | 2&lt;br&gt; Total events (excl. UNK) | 50&lt;br&gt; Sampling rate | 6,000 Hz&lt;br&gt; -----------------------------------------------------&lt;br&gt; VALIDATION SET/PB&lt;br&gt; -----------------------------------------------------&lt;br&gt; Number of audio recordings | 6&lt;br&gt; Total duration | 3 hours&lt;br&gt; Total classes (excl. UNK) | 2&lt;br&gt; Total events (excl. UNK) | 260&lt;br&gt; Sampling rate | 44,100 Hz&lt;br&gt; ----------------------------------------------------- &lt;strong&gt;Annotation structure&lt;/strong&gt; Each line of the annotation csv represents an event in the audio file. The column descriptions are as follows: TRAINING SET&lt;br&gt; ---------------------&lt;br&gt; Audiofilename, Starttime, Endtime, CLASS_1, CLASS_2, ...CLASS_N VALIDATION SET&lt;br&gt; ---------------------&lt;br&gt; Audiofilename, Starttime, Endtime, Q &lt;strong&gt;Classes&lt;/strong&gt; DCASE2021_task5_training_set_classes.csv and DCASE2021_task5_validation_set_classes.csv provide a table with class code correspondace to class name for all classes in the Development set. DCASE2021_task5_training_set_classes.csv&lt;br&gt; ---------------------&lt;br&gt; dataset, class_code, class_name DCASE2021_task5_validation_set_classes.csv&lt;br&gt; ---------------------&lt;br&gt; dataset, recording, class_code, class_name &lt;strong&gt;Evaluation Set&lt;/strong&gt; The Evaluation set for the same task can be found at: https://doi.org/10.5281/zenodo.5413149 &lt;strong&gt;Open Access&lt;/strong&gt; This dataset is available under a Creative Commons Attribution 4.0 International (CC BY 4.0) license. &lt;br&gt; &lt;strong&gt;Contact info&lt;/strong&gt; Please send any feedback or questions to:&lt;br&gt; Veronica Morfi: g.v.morfi@qmul.ac.uk&lt;br&gt;</dcterms:abstract>
    <dc:contributor>Pamula, Hanna</dc:contributor>
    <dc:contributor>Morfi, Veronica</dc:contributor>
    <dc:creator>Morfi, Veronica</dc:creator>
    <dc:contributor>Nolasco, Ines</dc:contributor>
    <dc:creator>Duteil, Mathieu</dc:creator>
    <dcterms:rights rdf:resource="https://creativecommons.org/licenses/by/4.0/legalcode"/>
    <dc:rights>Creative Commons Attribution 4.0 International</dc:rights>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:creator>Stowell, Dan</dc:creator>
    <dc:contributor>Sridhar, Sripathi</dc:contributor>
    <dc:creator>Lostanlen, Vincent</dc:creator>
    <dc:language>eng</dc:language>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:contributor>Lostanlen, Vincent</dc:contributor>
    <dc:contributor>Stowell, Dan</dc:contributor>
    <dc:contributor>Duteil, Mathieu</dc:contributor>
    <dc:contributor>Singh, Shubhr</dc:contributor>
    <dc:creator>Farnsworth, Andrew</dc:creator>
    <dcterms:issued>2021</dcterms:issued>
    <dcterms:created rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2021-02-19T10:33:15Z</dcterms:created>
  </rdf:Description>
</rdf:RDF>
URL (Link zu den Daten)

Prüfdatum der URL

Kommentar zur Publikation

Universitätsbibliographie
Ja
Diese Publikation teilen