DualStack: Multi-Resolution Spectrogram Fusion Improves Bird Sound Classification for Ecological Monitoring

Chitrang Jayeshkumar Patel; Tanishka Gupta; Sapan Mankad

DualStack: Multi-Resolution Spectrogram Fusion Improves Bird Sound Classification for Ecological Monitoring

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.

Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Chitrang Jayeshkumar Patel, Tanishka Gupta, Sapan Mankad

Abstract

Automated bird sound classification plays a critical role in biodiversity assessment, ecological monitoring, and conservation research. Many current approaches use single-resolution spectrograms, which fail to fully capture the multi-scale acoustic features of avian vocalizations. We present DualStack, a new method that vertically stacks high-resolution and low-resolution Mel spectrograms into a single image, allowing convolutional neural networks to jointly learn fine temporal and broad spectral patterns. Using a dataset of 967 recordings from 22 species sourced from Xeno-Canto, DualStack achieved 86.63% classification accuracy, outperforming both single-resolution baselines and a BiParallel ResNet18 multi-branch architecture. This method improves species identification accuracy while remaining applicable to real-time monitoring, supporting more effective conservation efforts and large-scale ecological studies.

DOI

https://doi.org/10.32942/X2TD2Z

Subjects

Engineering

Keywords

bird sound classification, bioacoustics, Biodiversity monitoring, ecological informatics, species identification, conservation technology, acoustic monitoring, spectrogram analysis, machine learning

Dates

Published: 2025-08-11 02:53

Last Updated: 2025-08-11 02:53

License

CC BY Attribution 4.0 International

Additional Metadata

Language:
English