Publikation: GlobalGeoTree : a multi-granular vision-language dataset for global tree species classification
Dateien
Datum
Autor:innen
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
DOI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
European Union (EU): 101130544
European Union (EU): 101101093
Institutionen der Bundesrepublik Deutschland: 50EE2201C
Institutionen der Bundesrepublik Deutschland: 67KI32002B
Projekt
Open Access-Veröffentlichung
Sammlungen
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
Global tree species mapping using remote sensing data is vital for biodiversity monitoring, forest management, and ecological research. However, progress in this field has been constrained by the scarcity of large-scale, labeled datasets. To address this, we introduce GlobalGeoTree, a comprehensive global dataset for tree species classification. GlobalGeoTree comprises 6.3 million geolocated tree occurrences, spanning 275 families, 2734 genera, and 21 001 species across the hierarchical taxonomic levels. Each sample is paired with Sentinel-2 image time series and 27 auxiliary environmental variables, encompassing bioclimatic, geographic, and soil data. The dataset is partitioned into GlobalGeoTree-6M, a large subset for model pretraining, and curated evaluation subsets, primarily GlobalGeoTree-10kEval, a benchmark for zero-shot and few-shot classification. To demonstrate the utility of the dataset, we introduce a baseline model, GeoTreeCLIP, which leverages paired remote sensing data and taxonomic text labels within a vision-language framework pretrained on GlobalGeoTree-6M. Experimental results show that GeoTreeCLIP achieves substantial improvements in zero- and few-shot classification on GlobalGeoTree-10kEval over existing advanced models. By making the dataset, models, and code publicly available, we aim to establish a benchmark to advance tree species classification and foster innovation in biodiversity research and ecological applications. The code is publicly available at https://github.com/MUYang99/GlobalGeoTree (last access: 10 February 2026), and the GlobalGeoTree dataset is available at https://doi.org/10.15468/dd.9qxqyy (Mu et al., 2025b).
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
MU, Yang, Zhitong XIONG, Yi WANG, Muhammad SHAHZAD, Franz ESSL, Holger KREFT, Mark VAN KLEUNEN, Xiao Xiang ZHU, 2026. GlobalGeoTree : a multi-granular vision-language dataset for global tree species classification. In: Earth System Science Data. Copernicus. 2026, 18(2), S. 1379-1403. ISSN 1866-3508. eISSN 1866-3516. Verfügbar unter: doi: 10.5194/essd-18-1379-2026BibTex
@article{Mu2026-02-24Globa-76418,
title={GlobalGeoTree : a multi-granular vision-language dataset for global tree species classification},
year={2026},
doi={10.5194/essd-18-1379-2026},
number={2},
volume={18},
issn={1866-3508},
journal={Earth System Science Data},
pages={1379--1403},
author={Mu, Yang and Xiong, Zhitong and Wang, Yi and Shahzad, Muhammad and Essl, Franz and Kreft, Holger and van Kleunen, Mark and Zhu, Xiao Xiang}
}RDF
<rdf:RDF
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bibo="http://purl.org/ontology/bibo/"
xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:void="http://rdfs.org/ns/void#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#" >
<rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/76418">
<dc:language>eng</dc:language>
<dc:contributor>Mu, Yang</dc:contributor>
<dc:creator>Shahzad, Muhammad</dc:creator>
<dc:contributor>Shahzad, Muhammad</dc:contributor>
<dc:contributor>Kreft, Holger</dc:contributor>
<dcterms:rights rdf:resource="http://creativecommons.org/licenses/by/4.0/"/>
<dc:contributor>Xiong, Zhitong</dc:contributor>
<dc:creator>Essl, Franz</dc:creator>
<dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/76418/1/Mu_2-1u5gad9ftyjht6.pdf"/>
<dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2026-03-02T14:16:06Z</dc:date>
<dcterms:title>GlobalGeoTree : a multi-granular vision-language dataset for global tree species classification</dcterms:title>
<dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/76418/1/Mu_2-1u5gad9ftyjht6.pdf"/>
<dc:contributor>Essl, Franz</dc:contributor>
<void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
<dc:creator>Wang, Yi</dc:creator>
<dc:contributor>Zhu, Xiao Xiang</dc:contributor>
<dc:contributor>van Kleunen, Mark</dc:contributor>
<dc:creator>Mu, Yang</dc:creator>
<dc:creator>Xiong, Zhitong</dc:creator>
<dcterms:abstract>Global tree species mapping using remote sensing data is vital for biodiversity monitoring, forest management, and ecological research. However, progress in this field has been constrained by the scarcity of large-scale, labeled datasets. To address this, we introduce GlobalGeoTree, a comprehensive global dataset for tree species classification. GlobalGeoTree comprises 6.3 million geolocated tree occurrences, spanning 275 families, 2734 genera, and 21 001 species across the hierarchical taxonomic levels. Each sample is paired with Sentinel-2 image time series and 27 auxiliary environmental variables, encompassing bioclimatic, geographic, and soil data. The dataset is partitioned into GlobalGeoTree-6M, a large subset for model pretraining, and curated evaluation subsets, primarily GlobalGeoTree-10kEval, a benchmark for zero-shot and few-shot classification. To demonstrate the utility of the dataset, we introduce a baseline model, GeoTreeCLIP, which leverages paired remote sensing data and taxonomic text labels within a vision-language framework pretrained on GlobalGeoTree-6M. Experimental results show that GeoTreeCLIP achieves substantial improvements in zero- and few-shot classification on GlobalGeoTree-10kEval over existing advanced models. By making the dataset, models, and code publicly available, we aim to establish a benchmark to advance tree species classification and foster innovation in biodiversity research and ecological applications. The code is publicly available at https://github.com/MUYang99/GlobalGeoTree (last access: 10 February 2026), and the GlobalGeoTree dataset is available at https://doi.org/10.15468/dd.9qxqyy (Mu et al., 2025b).</dcterms:abstract>
<dc:creator>Kreft, Holger</dc:creator>
<dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2026-03-02T14:16:06Z</dcterms:available>
<dc:rights>Attribution 4.0 International</dc:rights>
<dc:creator>Zhu, Xiao Xiang</dc:creator>
<dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/>
<foaf:homepage rdf:resource="http://localhost:8080/"/>
<dc:contributor>Wang, Yi</dc:contributor>
<bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/76418"/>
<dc:creator>van Kleunen, Mark</dc:creator>
<dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/>
<dcterms:issued>2026-02-24</dcterms:issued>
</rdf:Description>
</rdf:RDF>