Proceedings of the 10th Convention of the
European Acoustics Association
Forum Acusticum 2023


Politecnico di Torino
Torino, Italy
September 11 - 15, 2023





Session: A02-02: Open-source software and cutting-edge applications in bio-acoustics
Date: Thursday 14 September 2023
Time: 11:20 - 11:40
Title: Deep Learning on Small Datasets to Classify Mammalian Vocalizations
Author(s): R. Manriquez, Vrije Universiteit Brussel, Artificial Intelligence Lab, 1050 Brussel, Belgium
S. Kotz, Maastricht University, Department of Neuropsychology and Psychopharmacology, 6229 ER Maastricht, Netherlands
A. Ravignani, Max Planck Institute for Psycholinguistics, Comparative Bioacoustics Group, Wundtlaan 1, 6525 XD Nijmegen, Netherlands
B. De Boer, Vrije Universiteit Brussel, Artificial Intelligence Lab, 1050 Brussel, Belgium
Pages: 4687-4690
DOI: https://www.doi.org/10.61782/fa.2023.1052
PDF: https://dael.euracoustics.org/confs/fa2023/data/articles/001052.pdf
Conference proceedings
Abstract

Deep learning algorithms are increasingly used in many fields outside of artificial intelligence, including bioacoustics. Among many possible applications of deep learning to bioacoustics, typical ones include call identification, species recognition, and acoustic features classification. However, the implementation of deep learning algorithms is limited as bioacoustic databases are often rather small and thus lack sufficient data to properly train neural networks. Improper training leads to problems like overfitting and lack of generalization which, in turn, affect performance. Here, we address the most common challenges that bioacousticians face when training a deep neural network in a classification task. We present and explain useful techniques such as pre-training and data augmentation, and emphasize applying them in an efficient and meaningful way to not alter distinctive features or specific stimulus features such as fundamental frequency. We present an example application of these techniques in a classification task, where we perform species identification in a database of phylogenetically distant mammals, each with a limited number of calls. We aim at developing a general framework on how to apply deep learning algorithms to small- and larger-scale bioacoustic datasets.