Proceedings of the 10th Convention of the
European Acoustics Association
Forum Acusticum 2023


Politecnico di Torino
Torino, Italy
September 11 - 15, 2023





Session: Poster 4 (ghost)
Date: Friday 15 September 2023
Time: 16:20 - 17:00
Title: Blind modelling of speech intelligibility and listening effort in binaural listening conditions
Author(s): T. Brand, Universität Oldenburg, Medizinische Physik, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
S. Röttges, Universität Oldenburg, Medizinische Physik, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
C.F. Hauth, Universität Oldenburg, Medizinische Physik, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
J. Rossbach, Universität Oldenburg, Communication Acoustics, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
R. Huber, Fraunhofer IDMT, Hearing, Speech and Audio Technology, Marie-Curie-Str. 2, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
B. Meyer, Universität Oldenburg, Communication Acoustics, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
J. Rennies, Universität Oldenburg, Communication Acoustics, Cluster of Excellence Hearing4all, 26129 Oldenburg, Germany
Pages: 6419-6420
DOI: https://www.doi.org/10.61782/fa.2023.0470
PDF: https://dael.euracoustics.org/confs/fa2023/data/articles/000470.pdf
Conference proceedings
Abstract

This study evaluates a blind Binaural Speech Intelligibility Model (bBSIM) that predicts speech intelligibility (SI) and listening effort (LE) for spatially separated target speech and noise interferers. The model applies a blind binaural front-end, which assumes that the interfering noise has different modulation characteristics than speech and which predicts the effects of better-ear-listening and binaural unmasking. This binaural front-end is combined with two different blind back-ends that estimate SI and/or LE based on the output signal of the binaural front-end without any further auxiliary information. These blind back-ends are the non-intrusive Short Time Objective Intelligibility measure (NI-STOI) and an automatic triphone recognizer with an a posteriori estimate of the certainty of the recognized triphones. Even a short-term version is available which can be applied to arbitrary unknown speech and which updates SI and LE estimates in close to real time. This offers new fields of application, e.g., in hearing instruments. Therefore, the blind real-time BSIM has been implemented on a research hearing aid, where it continuously estimates SI and LE. These estimates can be used, for instance, for informing the selection of hearing aid processing strategies.