HEAR set: A ligHtwEight acoustic paRameters set to assess mental health from voice analysis

2 minute read

Journal Laura Verde, Fiammetta Marulli, Roberta De Fazio, Lelio Campanile, Stefano Marrone — 2024 · Computers in Biology and Medicine

Venue & metadata

  • Journal/Proceedings: Computers in Biology and Medicine
  • Volume: 182
  • Note: Cited by: 1
  • Author keywords: Acoustic features set; HEAR set; Mental disorders; Signal processing; Voice analysis

Abstract

Background: Voice analysis has significant potential in aiding healthcare professionals with detecting, diagnosing, and personalising treatment. It represents an objective and non-intrusive tool for supporting the detection and monitoring of specific pathologies. By calculating various acoustic features, voice analysis extracts valuable information to assess voice quality. The choice of these parameters is crucial for an accurate assessment. Method: In this paper, we propose a lightweight acoustic parameter set, named HEAR, able to evaluate voice quality to assess mental health. In detail, this consists of jitter, spectral centroid, Mel-frequency cepstral coefficients, and their derivates. The choice of parameters for the proposed set was influenced by the explainable significance of each acoustic parameter in the voice production process. Results: The reliability of the proposed acoustic set to detect the early symptoms of mental disorders was evaluated in an experimental phase. Voices of subjects suffering from different mental pathologies, selected from available databases, were analysed. The performance obtained from the HEAR features was compared with that obtained by analysing features selected from toolkits widely used in the literature, as with those obtained using learned procedures. The best performance in terms of MAE and RMSE was achieved for the detection of depression (5.32 and 6.24 respectively). For the detection of psychogenic dysphonia and anxiety, the highest accuracy rates were about 75 % and 97 %, respectively. Conclusions: The comparative evaluation was carried out to assess the performance of the proposed approach, demonstrating a reliable capability to highlight affective physiological alterations of voice quality due to the considered mental disorders. © 2024 The Author(s)

Keywords

AcousticsAdultFemaleHumansMaleMental DisordersMental HealthMiddle AgedSpeech AcousticsVoiceVoice QualityAcoustic variables measurementmHealthAcoustic feature setAcoustic featuresAcoustic parametersFeatures setsHEAR setMental disordersPerformanceSignal-processingVoice analysisVoice qualityacousticsanxietyArticleclimate changecomparative studycontrolled studyconvolutional neural networkelectric potentialemotional stressfeature extractionhealth care personnelhumanmajor clinical studymental diseasemental healthmood changeprosodyreliabilityroot mean squared errorsignal processingspectral centroidvocal cordvoicevoice analysisvoice changeacousticsadultdiagnosisfemalemalemental diseasemental healthmiddle agedpathophysiologyphysiologyspeechPersonalized medicine

Links & artifacts

DOI Publisher

Suggested citation

Verde, L., Marulli, F., De Fazio, R., Campanile, L., & Marrone, S. (2024). HEAR set: A ligHtwEight acoustic paRameters set to assess mental health from voice analysis [Article]. Computers in Biology and Medicine, 182. https://doi.org/10.1016/j.compbiomed.2024.109021

← Back to Publications