posted on 2021-04-22, 22:25authored byBoquan Liu, Hayley Raj, Logan Klein, Jack J. Jiang
Purpose: The excised canine larynx provides an advantageous experimental framework in the study of voice physiology. In recent years, signal processing methods have been applied to analyze phonations in excised canine larynx experiments. However, phonations have a highly complex and nonstationary nature corresponding to different proportions of regular and chaotic signal elements. Current nonlinear dynamic methods that are used to assess the degree of irregularity in the voice fail to recognize the distribution of voice type components (VTCs).
Method: Based on measures of intrinsic dimension, this article presents a method to analyze the VTC distribution of phonations in excised canine larynx experiments. Thirty-nine phonation samples from 13 excised canine larynges at three different subglottal pressures were analyzed.
Results: Phonation produced with subglottal pressures above phonation instability pressure (PIP) and below phonation threshold pressure (PTP) resulted in high proportions of Voice Types 3 and 4, characterized by chaotic and noisy signals. Phonation produced with pressure between PTP and PIP contained mostly Type 1 voice, characterized by a regular and nearly periodic signal. Mean proportions of all VTCs varied significantly in comparisons of phonations produced with Sub-PTP and PTP as well as in comparisons of phonations produced with PTP and PIP.
Conclusions: Across all VTCs, the VTC profiles of normal and abnormal phonation differ significantly. Normal phonation is strongly associated with VTC1 (Voice Type Component 1), whereas abnormal phonation exhibits increased VTC4 (Voice Type Component 4). The study further demonstrates the ability of intrinsic dimension to successfully detect multiple voice types in an acoustic signal and highlights the need for expanded use of intrinsic dimension in human voice.
Supplemental Material S1.
Voice type component profile for phonation under phonation threshold pressure (PTP; see Figure 5).
Voice type component profile for normal phonation (120% PTP). PTP = phonation threshold pressure (see Figure 6).
Voice type component profile for phonation over PIP (150% PIP). PIP = phonation instability pressure (see Figure 7).
Liu, B., Raj, H., Klein, L., & Jiang, J. J. (2021). Evaluating the voice type component distributions of excised larynx phonations at three subglottal pressures. Journal of Speech, Language, and Hearing Research. Advance online publication. https://doi.org/10.1044/2021_JSLHR-20-00429