HEAR set: A ligHtwEight acoustic paRameters set to assess mental health from voice analysis

HEAR set: A ligHtwEight acoustic paRameters set to assess mental health from voice analysis

Journal Verde, Laura and Marulli, Fiammetta and De Fazio, Roberta and Campanile, Lelio and Marrone, Stefano — 2024 · Computers in Biology and Medicine

Venue & metadata

  • Journal/Proceedings: Computers in Biology and Medicine
  • Volume: 182
  • Note: Cited by: 1
  • Author keywords: Acoustic features set; HEAR set; Mental disorders; Signal processing; Voice analysis

Abstract

Background: Voice analysis has significant potential in aiding healthcare professionals with detecting, diagnosing, and personalising treatment. It represents an objective and non-intrusive tool for supporting the detection and monitoring of specific pathologies. By calculating various acoustic features, voice analysis extracts valuable information to assess voice quality. The choice of these parameters is crucial for an accurate assessment. Method: In this paper, we propose a lightweight acoustic parameter set, named HEAR, able to evaluate voice quality to assess mental health. In detail, this consists of jitter, spectral centroid, Mel-frequency cepstral coefficients, and their derivates. The choice of parameters for the proposed set was influenced by the explainable significance of each acoustic parameter in the voice production process. Results: The reliability of the proposed acoustic set to detect the early symptoms of mental disorders was evaluated in an experimental phase. Voices of subjects suffering from different mental pathologies, selected from available databases, were analysed. The performance obtained from the HEAR features was compared with that obtained by analysing features selected from toolkits widely used in the literature, as with those obtained using learned procedures. The best performance in terms of MAE and RMSE was achieved for the detection of depression (5.32 and 6.24 respectively). For the detection of psychogenic dysphonia and anxiety, the highest accuracy rates were about 75 % and 97 %, respectively. Conclusions: The comparative evaluation was carried out to assess the performance of the proposed approach, demonstrating a reliable capability to highlight affective physiological alterations of voice quality due to the considered mental disorders. © 2024 The Author(s)

Keywords

Acoustics GS Adult GS Female GS Humans GS Male GS Mental Disorders GS Mental Health GS Middle Aged GS Speech Acoustics GS Voice GS Voice Quality GS Acoustic variables measurement GS mHealth GS Acoustic feature set GS Acoustic features GS Acoustic parameters GS Features sets GS HEAR set GS Mental disorders GS Performance GS Signal-processing GS Voice analysis GS Voice quality GS acoustics GS anxiety GS Article GS climate change GS comparative study GS controlled study GS convolutional neural network GS electric potential GS emotional stress GS feature extraction GS health care personnel GS human GS major clinical study GS mental disease GS mental health GS mood change GS prosody GS reliability GS root mean squared error GS signal processing GS spectral centroid GS vocal cord GS voice GS voice analysis GS voice change GS acoustics GS adult GS diagnosis GS female GS male GS mental disease GS mental health GS middle aged GS pathophysiology GS physiology GS speech GS Personalized medicine GS

Links & artifacts

DOI Publisher

Suggested citation

Verde, L., Marulli, F., De Fazio, R., Campanile, L., & Marrone, S. (2024). HEAR set: A ligHtwEight acoustic paRameters set to assess mental health from voice analysis [Article]. Computers in Biology and Medicine, 182. https://doi.org/10.1016/j.compbiomed.2024.109021

← Back to Publications