Proceedings of the 10th Convention of the
European Acoustics Association
Forum Acusticum 2023


Politecnico di Torino
Torino, Italy
September 11 - 15, 2023





Session: A11-04: Personalisation in Spatial Audio Technologies - Part I
Date: Tuesday 12 September 2023
Time: 17:20 - 17:40
Title: Exploring the impact of transfer learning on GAN-based HRTF upsampling
Author(s): A. Hogg, Imperial College London, Dyson School of Design Engineering, Imperial College Rd, South Kensington, SW7 2DB London, UK
H. Liu, Imperial College London, Dyson School of Design Engineering, Imperial College Rd, South Kensington, SW7 2DB London, UK
M. Jenkins, Imperial College London, Dyson School of Design Engineering, Imperial College Rd, South Kensington, SW7 2DB London, UK
L. Picinali, Imperial College London, Dyson School of Design Engineering, Imperial College Rd, South Kensington, SW7 2DB London, UK
Pages: 2323-2328
DOI: https://www.doi.org/10.61782/fa.2023.0266
PDF: https://dael.euracoustics.org/confs/fa2023/data/articles/000266.pdf
Conference proceedings
Abstract

Individualised head-related transfer functions (HRTFs) are essential for creating realistic virtual reality (VR) and augmented reality (AR) environments and interactions. Performing acoustic measurements is the most accurate way to capture these individualised HRTFs. However, one of the main challenges is acoustically capturing high-quality HRTFs without the need for expensive equipment and a lab- controlled setting. To make these measurements more feasible on a large scale, HRTF upsampling has been exploited in the past, where a high-resolution HRTF is created from a low-resolution measurement. However, as the world shifts to more data-driven methods, upsampling HRTFs using machine learning (ML) has become more prevalent. The main limitation is the lack of HRTF data available for model training. This paper explores the use of transfer learning (TL) on a new synthetic HRTF dataset generated from three-dimensional (3D) meshes using a parametric pina model, and the performance improvements that can be achieved. The performance is also compared against using a small acoustically measured dataset for TL, with the aim to start answering the question: ‘Is it better to have more data, or is it better for the data to be of higher quality?’.