Skip to main content
DrawerDissect: Whole-drawer insect imaging, segmentation, and transcription using AI

DrawerDissect: Whole-drawer insect imaging, segmentation, and transcription using AI

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Supplementary Files

Authors

Elizabeth G Postema, Leah Briscoe, Chloe Harder, George R. A. Hancock, Lucy D. Guarnieri, Tony Eisel, Kelton Welch, Nicole Fischer, Christine Johnson, Diego Souza, Dexter Phillip, Rebekah Baquiran, Tatiana Sepulveda, Bruno A S de Medeiros 

Abstract

1. Many museums curate vast collections of insect specimens. These collections represent invaluable records of biodiversity information, ecological patterns and phenotypic variation. However, traditional methods of imaging specimens and digitizing their metadata are labor-intensive and risk damaging delicate specimens. High-throughput imaging of entire specimen drawers integrated with computer vision artificial intelligence (AI) models can provide a potential solution.

2. We present DrawerDissect, a python-based pipeline for processing high-resolution drawer photographs, and a workflow to use it in entomological collections. By using custom vision models trained in the platform Roboflow and LLM-based transcription with Claude, DrawerDissect can crop and segment specimens from images and extract metadata from specimen labels. DrawerDissect is flexible, tuneable and modular, allowing seamless integration with downstream analyses of phenotypic features (e.g. color, pattern, and size).

3. We validated Drawerdissect by digitizing the Field Museum of Natural History's (FMNH’s) entire tiger beetle (family Cicindelidae) collection, processing 13,484 specimens to generate high-resolution dorsal photographs, backgroundless specimen images, and basic body measurements. Geographic data were successfully extracted for 3,648 specimens. To demonstrate the utility of the masked images, we provide an example integration of DrawerDissect with existing image analysis methods in R and ImageJ. Finally, to show the research potential of high-quality specimen images, we trained a species identification model, Cicindel-ID, using ~7,000 images of specimens in the genus Cicindela.

4. DrawerDissect’s novel multi-model AI workflow provides an efficient and reproducible framework that meets the demands of high-throughput digitization of natural history museum collections, unlocking the potential of vast specimen collections for future analyses.

DOI

https://doi.org/10.32942/X2QW84

Subjects

Artificial Intelligence and Robotics, Biology, Ecology and Evolutionary Biology, Entomology, Life Sciences

Keywords

Artificial Intelligence, computer vision, Digitization, high-throughput imaging, insects, Image Segmentation, machine learning, museum specimens

Dates

Published: 2025-07-15 04:16

Last Updated: 2025-07-15 04:16

License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
The authors report no conflicts of interest.

Data and Code Availability Statement:
The source code for DrawerDissect and the identification model Cicindel-ID are available at github.com/EGPostema/DrawerDissect and github.com/de-Medeiros-insect-lab/Cicindelinae_ID, respectively. All images and annotations used to train FMNH roboflow models can be found at universe.roboflow.com/field-museum. Training weights for Cicindel-ID are available at huggingface.co/brunoasm/eva02_large_patch14_448.Cicindela_ID_FMNH.

Language:
English