This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

DualStack: Multi-Resolution Spectrogram Fusion Improves Bird Sound Classification for Ecological Monitoring
Downloads
Authors
Abstract
Automated bird sound classification plays a critical role in biodiversity assessment, ecological monitoring, and conservation research. Many current approaches use single-resolution spectrograms, which fail to fully capture the multi-scale acoustic features of avian vocalizations. We present DualStack, a new method that vertically stacks high-resolution and low-resolution Mel spectrograms into a single image, allowing convolutional neural networks to jointly learn fine temporal and broad spectral patterns. Using a dataset of 967 recordings from 22 species sourced from Xeno-Canto, DualStack achieved 86.63% classification accuracy, outperforming both single-resolution baselines and a BiParallel ResNet18 multi-branch architecture. This method improves species identification accuracy while remaining applicable to real-time monitoring, supporting more effective conservation efforts and large-scale ecological studies.
DOI
https://doi.org/10.32942/X2TD2Z
Subjects
Engineering
Keywords
bird sound classification, bioacoustics, Biodiversity monitoring, ecological informatics, species identification, conservation technology, acoustic monitoring, spectrogram analysis, machine learning
Dates
Published: 2025-08-10 23:53
Last Updated: 2025-08-10 23:53
License
CC BY Attribution 4.0 International
Additional Metadata
Language:
English
There are no comments or no comments have been made public for this article.