A Comprehensive Psychoacoustic Approach to Voice Quality Perception

NIH RePORTER · NIH · R01 · $174,294 · view on reporter.nih.gov ↗

Abstract

PROJECT SUMMARY/ABSTRACT Voice disorders often lead to changes in voice quality noticed by patients, clinicians, and conversation partners. Assessment of voice quality is essential for diagnosis, and improvement in voice quality is a critical outcome of treatment. However, our knowledge of how voice quality is perceived is limited and the availability of robust measures of clinical outcome is even more limited. This has restricted our ability to accurately quantify or describe changes in quality, such as declines due to a disease or improvements resulting from treatment. This continuation project combines concepts and techniques from voice science, speech science, hearing science, and engineering to address these problems. The comprehensive approach to the proposed research simultaneously envelops the three primary VQ dimensions, embraces their covariance, improves measurement methods, and expands ecological validity through connected speech evaluation. The research proceeds by first obtaining high-precision measures of voice quality perception in the laboratory. These data are then used to develop mathematical models of voice quality perception that accurately reflect listeners’ data. To obtain a close match between human judgments of voice quality and model output, models of auditory processing are used to obtain an internal representation of the voice acoustic signal. Specific measures are then captured from this internal auditory representation and used to model the perception of voice quality. In the proposed work, these methods will be used to establish a comprehensive framework for understanding and measuring voice quality perception and to enable translation to clinical practice. The specific goals of this project are to: (1) develop a more complete understanding of the nature of VQ covariance (among breathy, rough, and strain) using natural dysphonic voices; (2) assess VQ in connected speech at micro (segmental) and macro (whole utterance) levels to better capture the impacts of rapid and complex transitions, co- articulatory and prosodic variations, and unique VQ signatures related to specific pathologies; (3) incorporate all of these components into a novel, three-dimensional VQ scaling procedure that is simple, intuitive, efficient enough to be used clinically, and has ratio-level measurement properties; (4) use highly-predictive computational models to overcome many of the limitations of acoustic analyses and perceptual evaluation; and (5) evaluate full range of psychometric properties considered in test design and evaluation including reliability, validity, sensitivity, and specificity. Perhaps most importantly, the goal is to develop an assessment that accurately indicates responsiveness to change (i.e., disorder progression, treatment). These parameters are considered essential for longitudinal assessment and outcome measurement. The feasibility of using these models and metrics in regular clinical assessment will be evaluated in ...

Key facts

NIH application ID
10577810
Project number
5R01DC009029-15
Recipient
UNIVERSITY OF SOUTH FLORIDA
Principal Investigator
David A. Eddins
Activity code
R01
Funding institute
NIH
Fiscal year
2023
Award amount
$174,294
Award type
5
Project period
2007-07-01 → 2023-06-30