This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

DrawerDissect: Whole-drawer insect imaging, segmentation, and transcription using AI
Downloads
Supplementary Files
Authors
Abstract
1. Many museums curate vast collections of insect specimens. These collections represent invaluable records of biodiversity information, ecological patterns and phenotypic variation. However, traditional methods of imaging specimens and digitizing their metadata are labor-intensive and risk damaging delicate specimens. High-throughput imaging of entire specimen drawers integrated with computer vision artificial intelligence (AI) models can provide a potential solution.
2. We present DrawerDissect, a python-based pipeline for processing high-resolution drawer photographs, and a workflow to use it in entomological collections. By using custom vision models trained in the platform Roboflow and LLM-based transcription with Claude, DrawerDissect can crop and segment specimens from images and extract metadata from specimen labels. DrawerDissect is flexible, tuneable and modular, allowing seamless integration with downstream analyses of phenotypic features (e.g. color, pattern, and size).
3. We validated Drawerdissect by digitizing the Field Museum of Natural History's (FMNH’s) entire tiger beetle (family Cicindelidae) collection, processing 13,484 specimens to generate high-resolution dorsal photographs, backgroundless specimen images, and basic body measurements. Geographic data were successfully extracted for 3,648 specimens. To demonstrate the utility of the masked images, we provide an example integration of DrawerDissect with existing image analysis methods in R and ImageJ. Finally, to show the research potential of high-quality specimen images, we trained a species identification model, Cicindel-ID, using ~7,000 images of specimens in the genus Cicindela.
4. DrawerDissect’s novel multi-model AI workflow provides an efficient and reproducible framework that meets the demands of high-throughput digitization of natural history museum collections, unlocking the potential of vast specimen collections for future analyses.
DOI
https://doi.org/10.32942/X2QW84
Subjects
Artificial Intelligence and Robotics, Biology, Ecology and Evolutionary Biology, Entomology, Life Sciences
Keywords
Artificial Intelligence, computer vision, Digitization, high-throughput imaging, insects, Image Segmentation, machine learning, museum specimens
Dates
Published: 2025-07-15 04:16
Last Updated: 2025-07-15 04:16
License
CC BY Attribution 4.0 International
Additional Metadata
Conflict of interest statement:
The authors report no conflicts of interest.
Data and Code Availability Statement:
The source code for DrawerDissect and the identification model Cicindel-ID are available at github.com/EGPostema/DrawerDissect and github.com/de-Medeiros-insect-lab/Cicindelinae_ID, respectively. All images and annotations used to train FMNH roboflow models can be found at universe.roboflow.com/field-museum. Training weights for Cicindel-ID are available at huggingface.co/brunoasm/eva02_large_patch14_448.Cicindela_ID_FMNH.
Language:
English
There are no comments or no comments have been made public for this article.