SelvaBox: A high‑resolution dataset for tropical tree crown detection
Overview
Overall Novelty Assessment
The paper introduces SelvaBox, a large-scale annotated dataset containing over 83,000 manually labeled tropical tree crowns across three countries, positioned within the Training Data and Benchmark Datasets leaf of the taxonomy. This leaf contains only two papers, indicating a relatively sparse research direction focused on curating standardized resources for model development. The dataset contribution addresses a recognized bottleneck in tropical forest remote sensing, where annotated imagery remains scarce despite growing interest in automated crown detection methods.
The taxonomy reveals that SelvaBox sits within the broader Deep Learning-Based Detection Methods branch, which encompasses specialized architectures (one-stage detectors, instance segmentation networks), foundation model adaptations (SAM-based approaches, parameter-efficient fine-tuning), and active learning strategies. Neighboring branches include LiDAR-Based Detection and Photogrammetric Structure-from-Motion Approaches, which leverage three-dimensional data rather than RGB imagery alone. The dataset's multi-country scope and high-resolution focus distinguish it from species-specific detection efforts (e.g., palm identification) and ecological monitoring applications that track temporal dynamics rather than static crown delineation.
Among the 30 candidates examined, none clearly refute the three core contributions: the SelvaBox dataset itself (10 candidates examined, 0 refutable), the benchmark evaluation framework (10 examined, 0 refutable), and the multi-resolution training pipeline (10 examined, 0 refutable). The sibling paper ReforesTree addresses restoration contexts with potentially different annotation strategies, but the limited search scope prevents definitive claims about overlap. The statistics suggest that within the examined literature, no prior work provides a tropical crown dataset of comparable scale or geographic diversity, though the analysis does not cover exhaustive domain-specific repositories or unpublished institutional datasets.
Based on the top-30 semantic matches and taxonomy structure, the work appears to occupy a relatively underserved niche in tropical forest benchmarking. The sparse population of the Training Data leaf and absence of refuting candidates among examined papers suggest novelty in dataset scale and multi-country coverage. However, the limited search scope means the analysis cannot rule out overlapping efforts in specialized forestry journals, regional datasets, or concurrent preprints not captured by the semantic search strategy.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors present SelvaBox, a large-scale dataset comprising over 83,000 manually annotated tree crown bounding boxes from high-resolution drone imagery across Brazil, Ecuador, and Panama. This dataset is an order of magnitude larger than all previous tropical forest datasets combined and addresses the critical scarcity of annotated tropical forest data.
The authors provide extensive benchmarks comparing CNN-based and transformer-based detection methods across multiple resolutions and input sizes. They introduce a raster-level evaluation metric (RF175) that addresses limitations of tile-level metrics and enables proper assessment of model performance on entire rasters rather than individual tiles.
The authors develop a multi-resolution training approach that enables models to generalize across different spatial resolutions and ground sampling distances. Their trained models achieve state-of-the-art performance on both tropical and non-tropical datasets, with competitive zero-shot detection on unseen datasets, and they release pre-trained weights and two open-source Python libraries for preprocessing and benchmarking.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
[9] ReforesTree: A dataset for estimating tropical forest carbon stock with deep learning and aerial imagery PDF
Contribution Analysis
Detailed comparisons for each claimed contribution
SelvaBox dataset for tropical tree crown detection
The authors present SelvaBox, a large-scale dataset comprising over 83,000 manually annotated tree crown bounding boxes from high-resolution drone imagery across Brazil, Ecuador, and Panama. This dataset is an order of magnitude larger than all previous tropical forest datasets combined and addresses the critical scarcity of annotated tropical forest data.
[5] ⦠and accurate monitoring of flowering across multiple tropical tree species over two years with a time series of high-resolution drone imagery and deep learning PDF
[8] Individual tree detection and species classification of Amazonian palms using UAV images and deep learning PDF
[28] Accurate tropical forest individual tree crown delineation from aerial RGB imagery using Mask R-CNN PDF
[51] Att-Mask R-CNN: an individual tree crown instance segmentation method based on fused attention mechanism PDF
[52] UC-HSI: UAV Based Crop Hyperspectral Imaging Datasets and Machine Learning Benchmark Results PDF
[53] Automated tree-crown and height detection in a young forest plantation using mask region-based convolutional neural network (Mask R-CNN) PDF
[54] A comparative assessment of the performance of individual tree crowns delineation algorithms from ALS data in tropical forests PDF
[55] Accurate delineation of individual tree crowns in tropical forests from aerial RGB imagery using Mask RâCNN PDF
[56] Individual Tree Crown Detection and Classification of Live and Dead Trees Using a Mask Region-Based Convolutional Neural Network (Mask R-CNN) PDF
[57] Deep learning based oil palm tree detection and counting for high-resolution remote sensing images PDF
Comprehensive benchmark and evaluation framework
The authors provide extensive benchmarks comparing CNN-based and transformer-based detection methods across multiple resolutions and input sizes. They introduce a raster-level evaluation metric (RF175) that addresses limitations of tile-level metrics and enables proper assessment of model performance on entire rasters rather than individual tiles.
[58] ⦠Detection of Pedestrian and Bicycle Lanes from High-Resolution Aerial Images by Integrating Image Processing and Artificial Intelligence (AI) Techniques PDF
[59] Leveraging SAM 2 and LiDAR for Automated Individual Tree Crown Delineation: A Comparative Evaluation of Prompting Methods PDF
[60] Comparison of object detection and patch-based classification deep learning models on mid-to late-season weed detection in UAV imagery PDF
[61] ⦠of Individual Tree Segmentation Methods in Mediterranean Forest Based on Point Clouds from Unmanned Aerial Vehicle Imagery and Low-Density Airborne ⦠PDF
[62] Deep Learning for Archaeological Object Detection on LiDAR: New Evaluation Measures and Insights PDF
[63] Building Detection and Outlining in Multi-Modal Remote Sensor Data: A Stratified Approach PDF
[64] TAF-YOLO: A Small-Object Detection Network for UAV Aerial Imagery via Visible and Infrared Adaptive Fusion PDF
[65] Improving the energy efficiency of real-time DNN object detection via compression, transfer learning, and scale prediction PDF
[66] Tracking Moose using Aerial Object Detection PDF
[67] Evaluating the Potential of Digital Aerial Photogrammetry (DAP) versus Airborne Laser Scanning (ALS) for Individual Tree Detection and Segmentation in Subtropical ⦠PDF
Multi-resolution training pipeline and state-of-the-art models
The authors develop a multi-resolution training approach that enables models to generalize across different spatial resolutions and ground sampling distances. Their trained models achieve state-of-the-art performance on both tropical and non-tropical datasets, with competitive zero-shot detection on unseen datasets, and they release pre-trained weights and two open-source Python libraries for preprocessing and benchmarking.