Acessibilidade / Reportar erro

Spectral and cepstral measurements in women with behavioral dysphonia

ABSTRACT

Purpose

To investigate whether there are differences in cepstral and spectral acoustic measures between women with behavioral dysphonia with and without laryngeal lesions and verify whether there is a correlation between such measures and the auditory-perceptual evaluation of voice quality.

Methods

The sample comprised 78 women with behavioral dysphonia without laryngeal lesions (BDWOL) and 68 with behavioral dysphonia with laryngeal lesions (vocal nodules) (BDWL). Cepstral peak prominence (CPP), cepstral peak prominence-smoothed (CPPS), spectral decrease, and H1-H2 (difference between the amplitude of the first and second harmonics) were extracted. They were submitted to the auditory-perceptual evaluation (APE) of the grade of hoarseness (GH), roughness (RO), breathiness (BR), and strain (ST).

Results

BDWL women had higher H1-H2 values and lower CPP and CPPS values than BDWOL women. More deviant voices had lower CPP and CPPS values. Breathy voices had lower CPP and CPPS values and higher H1-H2 values than rough ones. There was a weak negative correlation between CPP and RO, a moderate negative correlation with GH, and a strong negative correlation with BR. CPPS had a moderate negative correlation with GH, RO, and BR. H1-H2 had a weak positive correlation with BR. There was a weak positive correlation between spectral decrease and ST.

Conclusion

H1-H2, CPP, and CPPS were different between BDWOL and BDWL women. Furthermore, cepstral and spectral measures were correlated with the different APE parameters.

Voice; Voice Disorders; Voice Quality; Acoustics

RESUMO

Objetivo

Investigar se existem diferenças nas medidas acústicas cepstrais e espectrais entre mulheres com disfonia comportamental com e sem lesão laríngea, bem como verificar se existe correlação entre tais medidas e o julgamento perceptivo-auditivo da qualidade vocal.

Método

Participaram 78 mulheres com disfonia comportamental sem lesão laríngea (DCSL) e 68 com disfonia comportamental com lesão laríngea (nódulos vocais) (DCCL). Foram extraídas as medidas CPP (cepstral peak prominence), CPPS (cepstral peak prominence smoothed), declínio espectral e H1-H2 (diferença entre a amplitude do primeiro e do segundo harmônico), assim como o julgamento perceptivo-auditivo (JPA) do grau geral de desvio vocal (GG), graus de rugosidade (GR), de soprosidade (GS) e de tensão (GT).

Resultados

Mulheres com DCCL apresentaram maiores valores de H1-H2 e menores valores no CPP e CPPS, em relação às mulheres com DCSL. As vozes mais desviadas apresentaram menores valores do CPP e CPPS. As vozes soprosas apresentaram menores valores de CPP e CPPS, assim como maior valor de H1-H2 em relação às vozes rugosas. Houve correlação negativa fraca entre o CPP e o GR, negativa moderada com o GG e negativa forte com o GS. O CPPS apresentou correlação negativa moderada com o GG, GR e GS. A medida H1-H2 apresentou correlação positiva fraca com o GS. Houve correlação positiva fraca entre o declínio espectral e o GT.

Conclusão

As medidas acústicas H1-H2, CPP e CPPS apresentam diferenças entre mulheres com DCSL e DCCL. Além disso, há correlação entre as medidas cepstrais e espectrais e os diferentes parâmetros do JPA.

Voz; Qualidade Vocal; Acústica

INTRODUCTION

Acoustic analysis is a tool capable of characterizing the vocal signal both quantitatively and qualitatively, thus providing estimates of physiological aspects underlying voice production(11 Lopes L, Vieira V, Behlau M. Performance of different acoustic measures to discriminate individuals with and without voice disorders. J Voice. 2022;36(4):487-98. http://dx.doi.org/10.1016/j.jvoice.2020.07.008. PMid:32798120.
http://dx.doi.org/10.1016/j.jvoice.2020....
). From the clinical perspective, each acoustic measure is expected to be sensitive to variations in voice production aerodynamics and biomechanics and auditory-perceptual evaluation (APE) parameters.

There is a greater amount of research and applications using acoustic measures based on statistics and short-term perturbations in oscillation frequency (f0), short-term perturbations in amplitude, and perturbations in waveform(22 Brockmann-Bauser M, Drinnan MJ. Routine acoustic voice analysis: time to think again? Curr Opin Otolaryngol Head Neck Surg. 2011;19(3):165-70. http://dx.doi.org/10.1097/MOO.0b013e32834575fe. PMid:21483265.
http://dx.doi.org/10.1097/MOO.0b013e3283...
). However, there are important restrictions to these measures’ reliability and reproducibility, especially in the assessment of more intensely deviant voices(22 Brockmann-Bauser M, Drinnan MJ. Routine acoustic voice analysis: time to think again? Curr Opin Otolaryngol Head Neck Surg. 2011;19(3):165-70. http://dx.doi.org/10.1097/MOO.0b013e32834575fe. PMid:21483265.
http://dx.doi.org/10.1097/MOO.0b013e3283...
). In this regard, recent research(33 Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001. PMid:32658592.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
,44 Ben BB, Maryn Y, Gerrits E, de Bodt M. A meta-analysis: acoustic measurement of roughness and breathiness. J Speech Lang Hear Res. 2018;61(2):298-323. http://dx.doi.org/10.1044/2017_JSLHR-S-16-0188. PMid:29392295.
http://dx.doi.org/10.1044/2017_JSLHR-S-1...
) and consensus(55 Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009. PMid:29955816.
http://dx.doi.org/10.1044/2018_AJSLP-17-...
) support the use of spectral measures based on either Fourier transform/LPC spectrum (linear predictive coding) or the cepstrum to acoustically assess dysphonic voices. Among these, the cepstral peak prominence (CPP) and cepstral peak prominence-smoothed (CPPS) have been indicated as reliable measures to assess dysphonic voices, regardless of the intensity of vocal deviation(11 Lopes L, Vieira V, Behlau M. Performance of different acoustic measures to discriminate individuals with and without voice disorders. J Voice. 2022;36(4):487-98. http://dx.doi.org/10.1016/j.jvoice.2020.07.008. PMid:32798120.
http://dx.doi.org/10.1016/j.jvoice.2020....
,33 Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001. PMid:32658592.
http://dx.doi.org/10.1044/2020_AJSLP-20-...

4 Ben BB, Maryn Y, Gerrits E, de Bodt M. A meta-analysis: acoustic measurement of roughness and breathiness. J Speech Lang Hear Res. 2018;61(2):298-323. http://dx.doi.org/10.1044/2017_JSLHR-S-16-0188. PMid:29392295.
http://dx.doi.org/10.1044/2017_JSLHR-S-1...

5 Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009. PMid:29955816.
http://dx.doi.org/10.1044/2018_AJSLP-17-...
-66 Lopes LW, Sousa ESS, Silva ACF, Silva IM, Paiva MAA, Vieira VJD, et al. Cepstral measures in the assessment of severity of voice disorders. CoDAS. 2019;31(4):e20180175. http://dx.doi.org/10.1590/2317-1782/20182018175. PMid:31433040.
http://dx.doi.org/10.1590/2317-1782/2018...
).

CPP and CPPS demonstrate to what extent f0 harmonics are individualized and stand out from the noise level in the signal(33 Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001. PMid:32658592.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
). Signals with greater regularity and less noise have greater definition and amplitude in the dominant cepstral peak. CPP and CPPS have two main advantages in relation to traditional perturbation and noise measures: they do not depend on f0 estimates to be obtained and can be extracted from both sustained vowels and linked speech. Moreover, CPPS values have been able to discriminate Brazilian Portuguese speakers with and without vocal deviation; distinguish predominantly rough, breathy, or strained voices; and detect a strong negative correlation with the grade of hoarseness (GH) and breathiness (BR), a moderate negative correlation with roughness (RO, and a weak negative correlation with strain (ST)(66 Lopes LW, Sousa ESS, Silva ACF, Silva IM, Paiva MAA, Vieira VJD, et al. Cepstral measures in the assessment of severity of voice disorders. CoDAS. 2019;31(4):e20180175. http://dx.doi.org/10.1590/2317-1782/20182018175. PMid:31433040.
http://dx.doi.org/10.1590/2317-1782/2018...
).

In the field of spectral measures, those related to spectral decrease (which include parameters that reflect the harmonic energy spectrum, comparing differences in decibels [dB] between relative amplitudes of two harmonics) are among the main predictors of vocal hyperfunctioning or the breathiness component related to voice production(77 Hejná M, Šturm P, Tylečková L, Bořil T. Normophonic breathiness in czech and danish: are females breathier than males? J Voice. 2021;35(3):498.e1-22. http://dx.doi.org/10.1016/j.jvoice.2019.10.019. PMid:31902679.
http://dx.doi.org/10.1016/j.jvoice.2019....
,88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
). The difference in amplitude between the first and second harmonics (H1-H2) and the characterization of the spectral decrease in dBs are among the main measures recommended for use in clinical practice(77 Hejná M, Šturm P, Tylečková L, Bořil T. Normophonic breathiness in czech and danish: are females breathier than males? J Voice. 2021;35(3):498.e1-22. http://dx.doi.org/10.1016/j.jvoice.2019.10.019. PMid:31902679.
http://dx.doi.org/10.1016/j.jvoice.2019....
). A systematic review with meta-analysis reported that H1-H2 was among the main measures to predict the presence and intensity of breathiness in voice emission(44 Ben BB, Maryn Y, Gerrits E, de Bodt M. A meta-analysis: acoustic measurement of roughness and breathiness. J Speech Lang Hear Res. 2018;61(2):298-323. http://dx.doi.org/10.1044/2017_JSLHR-S-16-0188. PMid:29392295.
http://dx.doi.org/10.1044/2017_JSLHR-S-1...
).

H1-H2 is a measure related to the spectral slope, glottal airflow pulse slope, vocal fold opening quotient, the thickness of the free edges of the vocal folds, and vibration asymmetry between vocal folds(99 Mehta DD, Espinoza VM, van Stan JH, Zañartu M, Hillman RE. The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation. J Acoust Soc Am. 2019;145(5):EL386-92. http://dx.doi.org/10.1121/1.5100909. PMid:31153299.
http://dx.doi.org/10.1121/1.5100909...
,1010 Kreiman J, Gerratt BR, Garellek M, Samlan R, Zhang Z. Toward a unified theory of voice production and perception. Loquens. 2014;1(1):e009. http://dx.doi.org/10.3989/loquens.2014.009. PMid:27135054.
http://dx.doi.org/10.3989/loquens.2014.0...
). It is also associated with breathiness and strain components. H1-H2 can furnish insights into physiological mechanisms underlying voice production and is potentially useful in the clinical assessment of dysphonic individuals.

Patients with phonotraumatic vocal fold lesions generally have higher H1-H2 values than vocally healthy individuals(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
). Higher H1-H2 values are associated with less contact between vocal folds, vibration pattern with less abrupt closure between vocal folds, longer open phase of the glottal cycles, greater constriction/narrowing of the vocal tract, and breathy voice quality(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
,99 Mehta DD, Espinoza VM, van Stan JH, Zañartu M, Hillman RE. The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation. J Acoust Soc Am. 2019;145(5):EL386-92. http://dx.doi.org/10.1121/1.5100909. PMid:31153299.
http://dx.doi.org/10.1121/1.5100909...
,1111 Zhang Z. Cause-effect relationship between vocal fold physiology and voice production in a three-dimensional phonation model. J Acoust Soc Am. 2016;139(4):1493-507. http://dx.doi.org/10.1121/1.4944754. PMid:27106298.
http://dx.doi.org/10.1121/1.4944754...
,1212 Bickley CA, Stevens KN. Effects of a vocal-tract constriction on the glottal source: experimental and modelling studies. J Phonetics. 1986;14(3-4):373-82. http://dx.doi.org/10.1016/S0095-4470(19)30711-9.
http://dx.doi.org/10.1016/S0095-4470(19)...
). On the other hand, lower H1-H2 values are compatible with greater glottal closure or abrupt glottal closure of the vocal folds, with greater phonatory strain in the voice quality(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
), greater longitudinal (anteroposterior) strain of the vocal folds(1313 Zhang Z. Effect of vocal fold stiffness on voice production in a three-dimensional body-cover phonation model. J Acoust Soc Am. 2017;142(4):2311-21. http://dx.doi.org/10.1121/1.5008497. PMid:29092586.
http://dx.doi.org/10.1121/1.5008497...
), and greater vertical thickness of the free edge of the vocal folds(1313 Zhang Z. Effect of vocal fold stiffness on voice production in a three-dimensional body-cover phonation model. J Acoust Soc Am. 2017;142(4):2311-21. http://dx.doi.org/10.1121/1.5008497. PMid:29092586.
http://dx.doi.org/10.1121/1.5008497...
).

H1-H2 varies greatly in intersubject comparison(1010 Kreiman J, Gerratt BR, Garellek M, Samlan R, Zhang Z. Toward a unified theory of voice production and perception. Loquens. 2014;1(1):e009. http://dx.doi.org/10.3989/loquens.2014.009. PMid:27135054.
http://dx.doi.org/10.3989/loquens.2014.0...
). Hence, it must be investigated in different laryngeal conditions and voice quality deviations to understand its usefulness in a clinical context. Moreover, CPP/CPPS, H1-H2, and spectral decrease have been considered complementary measures to assess vocal deviation and, therefore, estimate structural and kinematic changes in the vocal folds underlying the voice production process(1414 Samlan RA, Story BH. Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. J Speech Lang Hear Res. 2011;54(5):1267-83. http://dx.doi.org/10.1044/1092-4388(2011/10-0195). PMid:21498582.
http://dx.doi.org/10.1044/1092-4388(2011...
,1515 Garellek M, Samlan R, Gerratt BR, Kreiman J. Modeling the voice source in terms of spectral slopes. J Acoust Soc Am. 2016;139(3):1404-10. http://dx.doi.org/10.1121/1.4944474. PMid:27036277.
http://dx.doi.org/10.1121/1.4944474...
).

Behavioral dysphonia corresponds to 65% of the cases found in voice clinical practice(1616 van Houtte E, van Lierde K, Claeys S. Pathophysiology and treatment of muscle tension dysphonia: a review of the current knowledge. J Voice. 2011;25(2):202-7. http://dx.doi.org/10.1016/j.jvoice.2009.10.009. PMid:20400263.
http://dx.doi.org/10.1016/j.jvoice.2009....
), and 10 to 40% of these cases can occur without structural changes in the larynx(1717 Roy N, Merrill R, Thibeault S, Parsa R, Gray S, Smith EM. Prevalence of voice disorders in teachers and the general population. J Speech Lang Hear Res. 2004;47(2):281-93. http://dx.doi.org/10.1044/1092-4388(2004/023). PMid:15157130.
http://dx.doi.org/10.1044/1092-4388(2004...
). Patients with behavioral dysphonia, regardless of having phonotraumatic lesions, have greater muscle activity and excessive effort or strain in the intrinsic and/or extrinsic musculature of the larynx during voice production(1818 Hillman RE, Stepp CE, van Stan JH, Zañartu M, Mehta DD. An updated theoretical framework for vocal hyperfunction. Am J Speech Lang Pathol. 2020;29(4):2254-60. http://dx.doi.org/10.1044/2020_AJSLP-20-00104. PMid:33007164.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
). In their turn, tissue lesions associated with phonotrauma, such as nodules, polyps, and edema in the vocal folds, have the potential to change the vibration and kinematic characteristics of voice production(1818 Hillman RE, Stepp CE, van Stan JH, Zañartu M, Mehta DD. An updated theoretical framework for vocal hyperfunction. Am J Speech Lang Pathol. 2020;29(4):2254-60. http://dx.doi.org/10.1044/2020_AJSLP-20-00104. PMid:33007164.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
). Given the prevalence of these conditions, it is important to identify measures capable of characterizing and monitoring these patients’ voice production with a potential application in clinical practice.

Thus, considering the relevance of these measures in clinical voice assessment and the scarcity of studies in Brazilian Portuguese speakers, this research aimed to investigate whether there are differences in cepstral (CPP and CPPS) and spectral (H1-H2 and spectral decrease) acoustic measures between women with behavioral dysphonia with (BDWL) and without laryngeal lesions (BDWOL) and whether there is a correlation between these measures and APE of voice quality.

METHODS

Study design

This cross-sectional descriptive study was appraised and approved by the originating institution’s Research Ethics Committee under evaluation report no. 2.677.777/2018. All participants signed an informed consent form, agreeing to participate in the research.

Sample

The research sample comprised 146 dysphonic women with a mean age of 33.10±11.26 years, assessed at the originating institution’s voice laboratory before voice therapy. Among them, 78 were diagnosed with behavioral dysphonia with no laryngeal lesion, and 68 were diagnosed with behavioral dysphonia with laryngeal lesions (vocal nodules).

The eligibility criteria to select participants were as follows: being females, as the acoustic measures investigated in this study are influenced by sex(1919 Pépiot E, Arnold A. Cross-gender differences in English/French bilingual speakers: a multiparametric study. Percept Mot Skills. 2021;128(1):153-77. http://dx.doi.org/10.1177/0031512520973514. PMid:33202192.
http://dx.doi.org/10.1177/00315125209735...
); being 18 years or older to avoid the period of voice changes; having voice complaints, based on the affirmative answer to the question, “Do you currently perceive any problem in your voice?”; being diagnosed with behavioral dysphonia (with vocal nodules or without laryngeal lesions), confirmed with visual laryngeal examination by an otorhinolaryngologist.

The participant exclusion criteria were occupational voice users or women who had been previously submitted to head and neck surgery (including cases of laryngeal microsurgery) or voice therapy. Other exclusion criteria were related to the quality of the signals of the collected voices - those whose voice signals had a signal-to-noise ratio (SNR) below 30 dBSPL were excluded(55 Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009. PMid:29955816.
http://dx.doi.org/10.1044/2018_AJSLP-17-...
).

Figure 1 shows information on sample selection based on inclusion and exclusion criteria.

Figure 1
Flowchart of research participants, according to eligibility criteria

Participants were allocated to one of the following groups:

  • Behavioral dysphonia without laryngeal lesions (BDWOL) - including participants whose dysphonia etiology was associated with vocal behavior, with self-reported voice problems. None of this group’s participants had structural changes in the larynx; 25 of them had an otorhinolaryngological diagnosis of mid-posterior glottic chink, 11 participants were diagnosed with supraglottal constriction, and 42 participants had a normal larynx.

  • Behavioral dysphonia with laryngeal lesions (BDWL) - women with structural changes in the vocal folds, whose dysphonia etiology is associated with vocal behavior, with self-reported voice problems. All 68 participants in this group had an otorhinolaryngological diagnosis of vocal nodules and glottic chink. This research approached the diagnosis of vocal nodules because this is the main laryngeal lesion associated with behavioral dysphonia(1616 van Houtte E, van Lierde K, Claeys S. Pathophysiology and treatment of muscle tension dysphonia: a review of the current knowledge. J Voice. 2011;25(2):202-7. http://dx.doi.org/10.1016/j.jvoice.2009.10.009. PMid:20400263.
    http://dx.doi.org/10.1016/j.jvoice.2009....
    ).

Data collection procedure

Data in this research were collected at the voice laboratory of a higher education institution. Participants were initially assessed with a form approaching personal data and voice complaints. Then, their voice samples were recorded.

Their voices were collected with Fonoview software, version 4.5, by CTS Informática, using an all-in-one Dell desktop, a Sennheiser unidirectional cardioid microphone, manufactured, model E-835, placed on a pedestal and attached to a Behringer preamplifier, model U-Phoria UMC 204. The voices were collected in an acoustically treated recording booth, with noise under 50 dB SPL, 44000-Hz sampling rate, 16 bits per sample, and the microphone 10 cm away from the participant’s mouth.

Participants were standing to collect their voices, with the pedestal in front of them, keeping their mouths at the set distance from the microphone. They were instructed on the collection procedures before recording their voices. They were asked to emit the sustained vowel /a/ for at least 5 seconds and count from 1 to 10 in self-reported habitual frequency and intensity.

During voice recording, the signals were visually monitored in the Fonoview display, observing signal duration (lasting at least 5 s) and the presence of peak clipping. Hence, if the emission lasted less than 5 s or had peak clipping, the participant was asked to rerecord to obtain the established criteria.

After the session of data collection and voice sample recording, the participants were referred for otorhinolaryngological examinations, including visual laryngeal examination with telelaryngoscopy using a rigid endoscope. All examinations were performed at the same reference service, and participants received a written report with the laryngeal diagnosis. Examination results were used as criteria to select research participants.

Then, before proceeding with APE and taking acoustic measures, the voice signals’ SNR was obtained in the VoxMore script of Praat open-access software (Paul Boersma and David Weenink, University of Amsterdam, the Netherlands), version 5.3.84. All samples of the 146 selected participants met the pre-established signal quality criteria(55 Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009. PMid:29955816.
http://dx.doi.org/10.1044/2018_AJSLP-17-...
), with SNR at 45.37± 3.1 dB SPL.

The vowel and counting samples were presented to speech-language-hearing therapists who specialized in voice with more than 20 years of experience in assessing and treating voice disorders. The APE session took place in a silent room, with earphones attached to a laptop computer, at a self-reported comfortable volume to the evaluator. In this evaluation, the speech-language-hearing therapist listened to the voice samples (counting and sustained vowels) in sequence, with a 1-second interval in between them. Then, they judged GH, RO, BR, and ST with markings on a 0-to-100-mm visual analog scale (VAS). Markings closer to 0 indicated less voice quality deviation, and those closer to 100 indicated greater voice quality deviation(2020 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31(1):67-71. http://dx.doi.org/10.1016/j.jvoice.2016.01.004. PMid:26873420.
http://dx.doi.org/10.1016/j.jvoice.2016....
).

At the end of the APE session, 20% (n = 30) of the voice samples were randomly repeated to analyze intrarater reliability with Cohen’s kappa coefficient. The judge’s kappa coefficient was 0.83, indicating good agreement.

VAS cutoff scores(2020 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31(1):67-71. http://dx.doi.org/10.1016/j.jvoice.2016.01.004. PMid:26873420.
http://dx.doi.org/10.1016/j.jvoice.2016....
) were used to classify the voices regarding vocal deviation and GH. Signals that scored ≤ 35.5 mm were considered as having adapted voice quality or normal voice quality variability, whereas voices with values > 35.5 mm were classified as deviant. Then, VAS was corresponded with a numerical scale, as follows: degree 1 (0 - 35.5 mm) was related to normal voice quality variability; degree 2 (35.6 - 50.5 mm), with mild to moderate deviation; degree 3 (50.6 - 90.5 mm), with moderate deviation; and degree 4 (90.6 - 100 mm), with intense deviation(2020 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31(1):67-71. http://dx.doi.org/10.1016/j.jvoice.2016.01.004. PMid:26873420.
http://dx.doi.org/10.1016/j.jvoice.2016....
). After the APE result, 11 women were found to have adapted voice quality, 74 had mild to moderate vocal deviation, and 61 had moderate vocal deviation. None of the participants selected for this study had intense vocal deviation. Higher RO, BR, and ST values in VAS of the 74 women with vocal deviation were used to classify the predominant voice quality deviation as roughness, breathiness, or strain.

The 11 participants with adapted voice quality were maintained in the study, considering that the diagnostic confirmation of dysphonia is multidimensional and that behavioral conditions may have multiple interactions between physiological, acoustic, auditory-perceptual, and self-perceptive parameters of the voice problem(1818 Hillman RE, Stepp CE, van Stan JH, Zañartu M, Mehta DD. An updated theoretical framework for vocal hyperfunction. Am J Speech Lang Pathol. 2020;29(4):2254-60. http://dx.doi.org/10.1044/2020_AJSLP-20-00104. PMid:33007164.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
). Thus, women with behavioral dysphonia may have adapted voice quality, despite reporting voice complaints/symptoms and structural changes in the larynx. Eight out of the 11 women in this sample with adapted voice quality were in the BDWOL group, while only three were allocated to the BDWL group.

Counting from 1 to 10 (linked speech) was the only speech task used in the reference study(2020 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31(1):67-71. http://dx.doi.org/10.1016/j.jvoice.2016.01.004. PMid:26873420.
http://dx.doi.org/10.1016/j.jvoice.2016....
) addressing the Brazilian reality to determine VAS cutoff scores. Even though this may be a limitation of this study, it used the cutoff scores proposed by Yamasaki et al.(2020 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31(1):67-71. http://dx.doi.org/10.1016/j.jvoice.2016.01.004. PMid:26873420.
http://dx.doi.org/10.1016/j.jvoice.2016....
) because it approaches only the four internationally considered degrees of deviation (adapted or normal voice quality variability, mild to moderate, moderate, and intense) and is the main Brazilian cutoff reference for this classification. Moreover, the task of counting from 1 to 10 was included (along with the sustained vowel) in the APE.

Then, the CPP, CPPS, spectral decrease, and H1-H2 acoustic measures were taken from the sustained vowel /a/ samples. The amplitude of the first and second harmonics is influenced by the type of vowel collected, especially regarding timbre (open vs. close)(2121 Henton CG, Bladon RAW. Breathiness in normal female speech: inefficiency versus desirability. Lang Commun. 1985;5(3):221-7. http://dx.doi.org/10.1016/0271-5309(85)90012-6.
http://dx.doi.org/10.1016/0271-5309(85)9...
). Hence, vowel /a/ is the most reliable one to take H1-H2 measures because the frequency of the first formant is high enough not to interfere with the harmonics at the lower frequency bands. Moreover, the first and second harmonics are more distinct in this vowel(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
,2121 Henton CG, Bladon RAW. Breathiness in normal female speech: inefficiency versus desirability. Lang Commun. 1985;5(3):221-7. http://dx.doi.org/10.1016/0271-5309(85)90012-6.
http://dx.doi.org/10.1016/0271-5309(85)9...
).

The measures were taken automatically with the VoxMore script(2222 Samuel A, Moraes R, Lopes L. VoxMore: artefato tecnológico para auxiliar a avaliação acústica da voz no processo ensino-aprendizagem e prática clínica. CoDAS. 2023. No prelo.) in Praat (v.5.3.84). This script takes measures from the 3 central seconds of the sustained vowel. Concerning H1-H2, it takes measures with the discrete Fourier transform, using 50-ms windows. H1-H2 for each window was defined as the difference in dB between the amplitudes of the first and second harmonics in the frequency spectrum(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
). CPP was extracted with 41-ms windows, and the final CPP value for each sample corresponded to the mean of all 3-second-interval analysis frames(2323 Hillenbrand J, Cleveland RA, Erickson RL. Acoustic correlates of breathy vocal quality. J Speech Hear Res. 1994;37(4):769-78. http://dx.doi.org/10.1044/jshr.3704.769. PMid:7967562.
http://dx.doi.org/10.1044/jshr.3704.769...
,2424 Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res. 1996;39(2):311-21. http://dx.doi.org/10.1044/jshr.3902.311. PMid:8729919.
http://dx.doi.org/10.1044/jshr.3902.311...
). CPPS was obtained in 2-ms windows, and its final value corresponded to the mean of all 3-second-interval analysis frames(33 Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001. PMid:32658592.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
,2525 Brockmann-Bauser M, van Stan JH, Carvalho Sampaio M, Bohlender JE, Hillman RE, Mehta DD. Effects of vocal intensity and fundamental frequency on cepstral peak prominence in patients with voice disorders and vocally healthy controls. J Voice. 2021;35(3):411-7. http://dx.doi.org/10.1016/j.jvoice.2019.11.015. PMid:31859213.
http://dx.doi.org/10.1016/j.jvoice.2019....
).

Data analysis

All independent variables addressed in the study were submitted to descriptive statistical analysis, including the means and standard deviations. The Shapiro-Wilk test was initially used to verify the normality distribution curve of the variables investigated (GH, RO, BR, ST, CPP, CPPS, spectral decrease, and H1-H2) in the study sample. Thus, given the normal distribution of all these variables, the independent samples t-test was used to compare the means of the acoustic measures and APE between the BDWOL and BDWL groups. The one-way ANOVA test was used to compare acoustic measures between the groups regarding GH and the predominant voice quality, followed by the Tukey post-test when there was a statistically significant difference between the groups in ANOVA. The Pearson correlation test was used to verify whether there was a correlation between values of acoustic measures and APE parameters (GH, RO, BR, and ST).

Correlation coefficients were used to assess and quantify the degree of linear relationship between acoustic (CPP, CPPS, spectral decrease, and H1-H2) and APE variables (GH, RO, BR, and ST), observing whether they changed together and to what degree. This research classified correlation coefficients as follows: 0.1 to 0.3 indicated a weak correlation; 0.4 to 0.6 indicated a moderate correlation; and above 0.6 indicated a strong correlation.

All analyses were performed in the Statistical Package for the Social Sciences (SPSS), version 2.0. The level of significance was set at 5%.

RESULTS

Differences were found in all H1-H2, CPP, and CPPS acoustic measures between the participating groups (Table 1). BDWL women had higher H1-H2 values (p = 0.024) and lower CPP (p = 0.017) and CPPS values (p = 0.004) than BDWOL women. There was no difference in spectral decrease values between BDWL and BDWOL (p = 0.504).

Table 1
Comparison of mean acoustic measures and auditory-perceptual evaluation parameters between women with behavioral dysphonia, with and without laryngeal lesions

There were differences in CPP (p < 0.001) and CPPS values (p < 0.001) in relation to GH (Table 2). The Tukey post-test compared the groups pair by pair and found differences in CPP values in the comparison between adapted x mild to moderate (p = 0.004), adapted x moderate (p < 0.001), and mild to moderate x moderate (p < 0.001). Likewise, CPPS values were different in the comparison between adapted x mild to moderate (p = 0.005), adapted x moderate (p < 0.001), and mild to moderate x moderate (p < 0.001). In general, more deviant voices had lower CPP and CPPS values. There were no differences in spectral decrease (p = 0.220) and H1-H2 values (p = 0.504) in relation to GH (p = 0.504).

Table 2
Comparison of mean acoustic measures and auditory-perceptual evaluation parameters between women with behavioral dysphonia with and without laryngeal lesions, regarding the grade of hoarseness in their voices

There were differences in CPP (p < 0.001), CPPS (p < 0.001), and H1-H2 measures (p = 0.022) in relation to the predominant voice quality (Table 3). The Tukey post-test compared the groups pair by pair and found differences in CPP (p < 0.001), CPPS (p < 0.001), and H1-H2 values (p = 0.018) only in the comparison between rough and breathy voices. The latter had lower CPP and CPPS values and higher H1-H2 values than rough voices. There were no differences in spectral decrease values (p = 0.671) in relation to the predominant voice quality.

Table 3
Comparison of mean acoustic measures and auditory-perceptual evaluation parameters between women with behavioral dysphonia with and without laryngeal lesions, regarding their predominant voice quality.

Lastly, CPP had a weak negative correlation with RO (< 0.001), a moderate negative correlation with GH (< 0.001), and a strong negative correlation with BR (< 0.001) (Table 4). CPPS had a moderate negative correlation with GH (< 0.001), RO (< 0.001), and BR (< 0.001). H1-H2 had only a weak positive correlation with BR (p = 0.013). There was a weak positive correlation between spectral decrease and ST (p = 0.003).

Table 4
Correlation between the intensity of vocal deviation in the various auditory-perceptual parameters and the acoustic measures

DISCUSSION

Spectral/cepstral acoustic measures have stood out as the most promising ones to assess and monitor dysphonic voices. They have proved to be sensitive to different APE components (such as GH, BR, and ST)(66 Lopes LW, Sousa ESS, Silva ACF, Silva IM, Paiva MAA, Vieira VJD, et al. Cepstral measures in the assessment of severity of voice disorders. CoDAS. 2019;31(4):e20180175. http://dx.doi.org/10.1590/2317-1782/20182018175. PMid:31433040.
http://dx.doi.org/10.1590/2317-1782/2018...
) and kinematic and vibration characteristics associated with normal and dysphonic voice production(1414 Samlan RA, Story BH. Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. J Speech Lang Hear Res. 2011;54(5):1267-83. http://dx.doi.org/10.1044/1092-4388(2011/10-0195). PMid:21498582.
http://dx.doi.org/10.1044/1092-4388(2011...
).

BDWL women in this research had higher H1-H2 values and lower CPP and CPPS values than BDWOL women. There is no H1-H2 normative data for Brazilian Portuguese-speaking women, whereas for English-speaking ones there they range from 3.3 to 8.4 dB(2121 Henton CG, Bladon RAW. Breathiness in normal female speech: inefficiency versus desirability. Lang Commun. 1985;5(3):221-7. http://dx.doi.org/10.1016/0271-5309(85)90012-6.
http://dx.doi.org/10.1016/0271-5309(85)9...
). BDWOL and BDWL women in this study respectively obtained 1.65±5.31 and 3.64±5.16. These values may suggest greater glottal closure or abrupt glottal closure of the vocal folds, with greater phonatory strain in the voice quality(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
), increased longitudinal (anteroposterior) strain of the vocal folds(1313 Zhang Z. Effect of vocal fold stiffness on voice production in a three-dimensional body-cover phonation model. J Acoust Soc Am. 2017;142(4):2311-21. http://dx.doi.org/10.1121/1.5008497. PMid:29092586.
http://dx.doi.org/10.1121/1.5008497...
), and increased vertical thickness of the free edge of the vocal folds(1313 Zhang Z. Effect of vocal fold stiffness on voice production in a three-dimensional body-cover phonation model. J Acoust Soc Am. 2017;142(4):2311-21. http://dx.doi.org/10.1121/1.5008497. PMid:29092586.
http://dx.doi.org/10.1121/1.5008497...
) in dysphonic women.

As for H1-H2, the higher values in BDWL than in BDWOL women may indicate less contact between vocal folds, a vibration pattern with less abrupt closure between vocal folds, longer open phase of the glottal cycles, greater constriction/narrowing of the vocal tract, and greater breathiness(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
,99 Mehta DD, Espinoza VM, van Stan JH, Zañartu M, Hillman RE. The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation. J Acoust Soc Am. 2019;145(5):EL386-92. http://dx.doi.org/10.1121/1.5100909. PMid:31153299.
http://dx.doi.org/10.1121/1.5100909...
,1111 Zhang Z. Cause-effect relationship between vocal fold physiology and voice production in a three-dimensional phonation model. J Acoust Soc Am. 2016;139(4):1493-507. http://dx.doi.org/10.1121/1.4944754. PMid:27106298.
http://dx.doi.org/10.1121/1.4944754...
,1212 Bickley CA, Stevens KN. Effects of a vocal-tract constriction on the glottal source: experimental and modelling studies. J Phonetics. 1986;14(3-4):373-82. http://dx.doi.org/10.1016/S0095-4470(19)30711-9.
http://dx.doi.org/10.1016/S0095-4470(19)...
).

BDWL patients tend to have a glottic chink, which may explain the greater breathiness and higher H1-H2 values(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
,2626 Radish Kumar B, Bhat J, Mukhi P. Vowel harmonic amplitude differences in persons with vocal nodules. J Voice. 2011;25(5):559-61. http://dx.doi.org/10.1016/j.jvoice.2010.06.009. PMid:20926251.
http://dx.doi.org/10.1016/j.jvoice.2010....
,2727 Narasimhan SV, Vishal K. Spectral measures of hoarseness in persons with hyperfunctional voice disorder. J Voice. 2017;31(1):57-61. http://dx.doi.org/10.1016/j.jvoice.2016.03.005. PMid:27080591.
http://dx.doi.org/10.1016/j.jvoice.2016....
). The BDWOL group probably had a hyperfunctional component and greater glottal closure or abrupt glottal closure, justifying the lower H1-H2 values.

Previous studies(88 van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065. PMid:31995428.
http://dx.doi.org/10.1044/2019_JSLHR-19-...
,99 Mehta DD, Espinoza VM, van Stan JH, Zañartu M, Hillman RE. The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation. J Acoust Soc Am. 2019;145(5):EL386-92. http://dx.doi.org/10.1121/1.5100909. PMid:31153299.
http://dx.doi.org/10.1121/1.5100909...
,2626 Radish Kumar B, Bhat J, Mukhi P. Vowel harmonic amplitude differences in persons with vocal nodules. J Voice. 2011;25(5):559-61. http://dx.doi.org/10.1016/j.jvoice.2010.06.009. PMid:20926251.
http://dx.doi.org/10.1016/j.jvoice.2010....
) compared patients with phonotraumatic lesions and vocally healthy individuals, observing that H1-H2 values were lower in the group with lesions. However, data in the present research indicated higher H1-H2 values in BDWL than in BDWOL women. Thus, there seems to be a continuum in H1-H2 measures, with higher values in vocally healthy people, decreased H1-H2 in cases of BDWL, and lower values in BDWOL. This finding may be compatible with more intense glottal closure in people with behavioral dysphonia than in vocally healthy ones, reflected in lower H1-H2 values.

There is also a moderate positive correlation between vibration asymmetry in the vocal folds and H1-H2 values(2828 Mehta DD, Zaéartu M, Quatieri TF, Deliyski DD, Hillman RE. Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy. J Acoust Soc Am. 2011;130(6):3999-4009. http://dx.doi.org/10.1121/1.3658441. PMid:22225054.
http://dx.doi.org/10.1121/1.3658441...
). Hence, the vocal nodules in the BDWL group may cause greater vibration asymmetry between vocal folds and explain the higher H1-H2 values in this group.

As for the comparison between acoustic measures and GH, only CPP and CPPS had differences, with lower values in patients with more deviant voices. CPP and CPPS have been recommended as useful measures to monitor voice quality deviation during the patient’s treatment(33 Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001. PMid:32658592.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
,66 Lopes LW, Sousa ESS, Silva ACF, Silva IM, Paiva MAA, Vieira VJD, et al. Cepstral measures in the assessment of severity of voice disorders. CoDAS. 2019;31(4):e20180175. http://dx.doi.org/10.1590/2317-1782/20182018175. PMid:31433040.
http://dx.doi.org/10.1590/2317-1782/2018...
). These measures are sensitive to the periodicity of the signal and amplitude of the harmonic energy and are, therefore, recommended as the main measure to monitor GH(55 Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009. PMid:29955816.
http://dx.doi.org/10.1044/2018_AJSLP-17-...
).

The literature presents conflicting H1-H2 results in relation to GH. No consistent H1-H2 changes were found in the comparison between voices with greater or less GH(2929 Cannito MP, Buder EH, Chorna LB. Spectral amplitude measures of adductor spasmodic dysphonic speech. J Voice. 2005;19(3):391-410. http://dx.doi.org/10.1016/j.jvoice.2004.07.001. PMid:16102666.
http://dx.doi.org/10.1016/j.jvoice.2004....
). Voices with intense deviations have values similar to those of voices with no deviations(2929 Cannito MP, Buder EH, Chorna LB. Spectral amplitude measures of adductor spasmodic dysphonic speech. J Voice. 2005;19(3):391-410. http://dx.doi.org/10.1016/j.jvoice.2004.07.001. PMid:16102666.
http://dx.doi.org/10.1016/j.jvoice.2004....
). Furthermore, H1-H2 values did not decrease after voice therapy in dysphonic patients, despite the decreased GH and improved videostroboscopy laryngeal parameters(3030 Holmberg EB, Doyle P, Perkell JS, Hammarberg B, Hillman RE. Aerodynamic and acoustic voice measurements of patients with vocal nodules: variation in baseline and changes across voice therapy. J Voice. 2003;17(3):269-82. http://dx.doi.org/10.1067/S0892-1997(03)00076-6. PMid:14513951.
http://dx.doi.org/10.1067/S0892-1997(03)...
). Hence, H1-H2 seems to complement other measures like CPP and CPPS, being useful in understanding the predominant voice quality and the physiology underlying the voice production process.

Dysphonic women with predominantly breathy voices had lower CPP and CPPS values and higher H1-H2 values than dysphonic women with predominantly rough voices. It is challenging to establish the difference between dysphonic rough and breathy voices based on acoustic measures because these two components overlap in the same deviant vocal signal.

There are three acoustic indicators of the presence of breathiness in voice production: signal periodicity and first harmonic spectral decrease and amplitude(2424 Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res. 1996;39(2):311-21. http://dx.doi.org/10.1044/jshr.3902.311. PMid:8729919.
http://dx.doi.org/10.1044/jshr.3902.311...
). Signal periodicity can explain 80% of the variable perception of breathiness degree in a voice(2424 Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res. 1996;39(2):311-21. http://dx.doi.org/10.1044/jshr.3902.311. PMid:8729919.
http://dx.doi.org/10.1044/jshr.3902.311...
). CPP and CPPS are related to signal periodicity, which justifies the differences found in these measures between rough and breathy voices in this research, with lower values in breathy voices.

Breathy voices tend to have greater amplitude in the first harmonic, with relatively weak higher harmonics, which justifies the higher H1-H2 values in these than in rough voices(2424 Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res. 1996;39(2):311-21. http://dx.doi.org/10.1044/jshr.3902.311. PMid:8729919.
http://dx.doi.org/10.1044/jshr.3902.311...
). Since these three measures are considered sensitive to the presence of breathiness in voice production(3131 Aaen M, McGlashan J, Thu KT, Sadolin C. Assessing and quantifying air added to the voice by means of laryngostroboscopic imaging, EGG, and acoustics in vocally trained subjects. J Voice. 2021;35(2):326.e1-11. http://dx.doi.org/10.1016/j.jvoice.2019.09.001. PMid:31628046.
http://dx.doi.org/10.1016/j.jvoice.2019....
), they help understand the difference in values between predominantly rough and breathy voices.

In this research, GH had a moderate negative correlation with CPP and CPPS. This finding supports the current indication of CPP and CPPS as the main measures to identify the presence and intensity of vocal deviation in dysphonic voices(11 Lopes L, Vieira V, Behlau M. Performance of different acoustic measures to discriminate individuals with and without voice disorders. J Voice. 2022;36(4):487-98. http://dx.doi.org/10.1016/j.jvoice.2020.07.008. PMid:32798120.
http://dx.doi.org/10.1016/j.jvoice.2020....
,33 Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001. PMid:32658592.
http://dx.doi.org/10.1044/2020_AJSLP-20-...
,55 Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009. PMid:29955816.
http://dx.doi.org/10.1044/2018_AJSLP-17-...
).

RO had a strong negative correlation with CPP, a moderate negative correlation with CPPS, and a weak positive correlation with H1-H2. CPP and CPPS are considered sensitive measures to the presence of breathiness in voice emission. Computer models(1414 Samlan RA, Story BH. Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. J Speech Lang Hear Res. 2011;54(5):1267-83. http://dx.doi.org/10.1044/1092-4388(2011/10-0195). PMid:21498582.
http://dx.doi.org/10.1044/1092-4388(2011...
) have demonstrated that CPP is sensitive to the various factors underlying the hearing perception of breathiness, such as distancing in vocal processes, changes on the surface of the free edges of the vocal folds, and decreased constriction in the epilaryngeal area.

On the other hand, although H1-H2 is indicated as a measure sensitive to the presence of breathiness, experimental conditions verified that the H1-H2 measure was only sensitive to the presence of breathiness originating in the distancing of vocal processes when all other structural and kinematic variables of the vocal folds remained constant(1414 Samlan RA, Story BH. Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. J Speech Lang Hear Res. 2011;54(5):1267-83. http://dx.doi.org/10.1044/1092-4388(2011/10-0195). PMid:21498582.
http://dx.doi.org/10.1044/1092-4388(2011...
). Nonetheless, changes in the free edge of the vocal folds and constrictions implemented in the vocal tract may have a greater influence on H1-H2 values and the association between this measure and the perception of breathiness(1414 Samlan RA, Story BH. Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. J Speech Lang Hear Res. 2011;54(5):1267-83. http://dx.doi.org/10.1044/1092-4388(2011/10-0195). PMid:21498582.
http://dx.doi.org/10.1044/1092-4388(2011...
,1515 Garellek M, Samlan R, Gerratt BR, Kreiman J. Modeling the voice source in terms of spectral slopes. J Acoust Soc Am. 2016;139(3):1404-10. http://dx.doi.org/10.1121/1.4944474. PMid:27036277.
http://dx.doi.org/10.1121/1.4944474...
).

RO demonstrated a moderate negative correlation with CPPS and a weak negative correlation with CPP. Roughness and breathiness obviously overlap in dysphonic voices, as previously mentioned. Such overlapping increases in more intense vocal deviations, which explains the moderate to strong negative correlation between CPP and CPPS and BR and the weak to moderate correlation between these measures and RO.

Lastly, there was a weak positive correlation between ST and spectral decrease. Even though the spectral decrease is influenced by structural and kinematic characteristics of the vocal folds, subglottal pressure seems to be the main determinant of such values(3232 Zhang Z. Mechanics of human voice production and control. J Acoust Soc Am. 2016;140(4):2614-35. http://dx.doi.org/10.1121/1.4964509. PMid:27794319.
http://dx.doi.org/10.1121/1.4964509...
). Increased subglottal pressure, in turn, is related to the hearing perception of phonatory strain(66 Lopes LW, Sousa ESS, Silva ACF, Silva IM, Paiva MAA, Vieira VJD, et al. Cepstral measures in the assessment of severity of voice disorders. CoDAS. 2019;31(4):e20180175. http://dx.doi.org/10.1590/2317-1782/20182018175. PMid:31433040.
http://dx.doi.org/10.1590/2317-1782/2018...
), which may explain the correlation between spectral decrease and ST found in this research.

Although H1-H2 measures and spectral decrease are considered potential indicators of the presence of strain in the voice quality(2626 Radish Kumar B, Bhat J, Mukhi P. Vowel harmonic amplitude differences in persons with vocal nodules. J Voice. 2011;25(5):559-61. http://dx.doi.org/10.1016/j.jvoice.2010.06.009. PMid:20926251.
http://dx.doi.org/10.1016/j.jvoice.2010....
,2727 Narasimhan SV, Vishal K. Spectral measures of hoarseness in persons with hyperfunctional voice disorder. J Voice. 2017;31(1):57-61. http://dx.doi.org/10.1016/j.jvoice.2016.03.005. PMid:27080591.
http://dx.doi.org/10.1016/j.jvoice.2016....
), they were not found in this study. The etiology of strain in the voice quality is evidently multifactorial, and the various factors influence differently the magnitude of changes in cepstral and spectral measures. For instance, the vertical thickness of the vocal folds has a greater effect on determining the spectral contour and the energy of the upper harmonics than longitudinal (anteroposterior) strain in the vocal folds. In this sense, the absence of correlation between ST and CPP, CPPS, and H1-H2 does not indicate that such measures are not useful in clinical assessment but restates that acoustic parameters must be cautiously interpreted and associated with other clinical information obtained from multidimensional voice assessment.

The findings in this research may reinforce that the voice production process occurs in the time domain while hearing perception of the voice quality is strongly influenced by spectral information. Thus, different glottal/supraglottal adjustments and laryngeal conditions may produce the same vocal output. Likewise, greater changes in these adjustments may not change the vocal output(1010 Kreiman J, Gerratt BR, Garellek M, Samlan R, Zhang Z. Toward a unified theory of voice production and perception. Loquens. 2014;1(1):e009. http://dx.doi.org/10.3989/loquens.2014.009. PMid:27135054.
http://dx.doi.org/10.3989/loquens.2014.0...
,3333 Kreiman J, Gerratt BR. Perceptual sensitivity to first harmonic amplitude in the voice source. J Acoust Soc Am. 2010;128(4):2085-9. http://dx.doi.org/10.1121/1.3478784. PMid:20968379.
http://dx.doi.org/10.1121/1.3478784...
). The various adjustments interact nonlinearly to make an auditory impression related to voice quality. Moreover, the data found in this research seem to reinforce the importance of acoustic analysis as an additional tool to estimate structural and kinematic aspects underlying voice production. They must be cautiously used to understand dysphonic patients’ voice problems.

This study categorized the groups in terms of the presence/absence of laryngeal lesions. This method was necessary to ensure the internal and external validity of the study concerning its objectives. Despite the differences found in the investigated measures between the groups with and without laryngeal lesions, the conscious and pragmatic use of acoustic measures is recommended in the clinical assessment process to confirm diagnoses or monitor patients with voice complaints. Acoustic measures mainly help speech-language-hearing therapists understand the patient’s voice quality and underlying physiological, aerodynamic, and biomechanical factors, which cannot be directly and routinely accessed in clinical practice without using high-cost technology (such as aerodynamic assessment systems).

The results of this research were limited to women with behavioral dysphonia, with or without laryngeal lesions (specifically, vocal nodules) in the vocal folds. This must be considered when interpreting and transferring these findings to everyday clinical conditions. This study did not recruit vocally healthy individuals, which is a limitation to be addressed in future investigations. Furthermore, normative H1-H2 data must be developed for vocally healthy Brazilian Portuguese speakers of both sexes.

One of the limitations of this study was that acoustic measures were taken only from the sustained vowel samples, not including linked speech. The discriminative power of cepstral and spectral measures to distinguish voice samples of dysphonic and non-dysphonic individuals may be influenced by speech tasks(2323 Hillenbrand J, Cleveland RA, Erickson RL. Acoustic correlates of breathy vocal quality. J Speech Hear Res. 1994;37(4):769-78. http://dx.doi.org/10.1044/jshr.3704.769. PMid:7967562.
http://dx.doi.org/10.1044/jshr.3704.769...
,2424 Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res. 1996;39(2):311-21. http://dx.doi.org/10.1044/jshr.3902.311. PMid:8729919.
http://dx.doi.org/10.1044/jshr.3902.311...
). Moreover, APE in this research approached samples combining sustained vowels and linked speech (counting numbers), whereas acoustic parameters were extracted only from the vowel. Hence, the correlation analysis between APE and acoustic measures may have been influenced by such methods. Thus, further studies may investigate whether the correlation strength changes when the investigation considers APE and acoustic measure data obtained from linked speech tasks.

CONCLUSION

BDWL women had higher H1-H2 values and lower CPP and CPPS values than BDWOL women. More deviant voices had lower CPP and CPPS values than less deviant voices. Breathy voices had lower CPP and CPPS values and higher H1-H2 values than rough ones.

GH had a moderate negative correlation with CPP and CPPS. RO had a weak negative correlation with CPP and a moderate negative correlation with CPPS. BR had a strong negative correlation with CPP, a moderate negative correlation with CPPS, and a weak positive correlation with H1-H2. ST had a weak positive correlation with spectral decrease.

  • Study conducted at Departamento de Fonoaudiologia, Universidade Federal da Paraíba - UFPB - João Pessoa (PB), Brasil.

REFERÊNCIAS

  • 1
    Lopes L, Vieira V, Behlau M. Performance of different acoustic measures to discriminate individuals with and without voice disorders. J Voice. 2022;36(4):487-98. http://dx.doi.org/10.1016/j.jvoice.2020.07.008 PMid:32798120.
    » http://dx.doi.org/10.1016/j.jvoice.2020.07.008
  • 2
    Brockmann-Bauser M, Drinnan MJ. Routine acoustic voice analysis: time to think again? Curr Opin Otolaryngol Head Neck Surg. 2011;19(3):165-70. http://dx.doi.org/10.1097/MOO.0b013e32834575fe PMid:21483265.
    » http://dx.doi.org/10.1097/MOO.0b013e32834575fe
  • 3
    Murton O, Hillman R, Mehta D. Cepstral peak prominence values for clinical voice evaluation. Am J Speech Lang Pathol. 2020;29(3):1596-607. http://dx.doi.org/10.1044/2020_AJSLP-20-00001 PMid:32658592.
    » http://dx.doi.org/10.1044/2020_AJSLP-20-00001
  • 4
    Ben BB, Maryn Y, Gerrits E, de Bodt M. A meta-analysis: acoustic measurement of roughness and breathiness. J Speech Lang Hear Res. 2018;61(2):298-323. http://dx.doi.org/10.1044/2017_JSLHR-S-16-0188 PMid:29392295.
    » http://dx.doi.org/10.1044/2017_JSLHR-S-16-0188
  • 5
    Patel RR, Awan SN, Barkmeier-Kraemer J, Courey M, Deliyski D, Eadie T, et al. Recommended protocols for instrumental assessment of voice: american speech-language-hearing association expert panel to develop a protocol for instrumental assessment of vocal function. Am J Speech Lang Pathol. 2018;27(3):887-905. http://dx.doi.org/10.1044/2018_AJSLP-17-0009 PMid:29955816.
    » http://dx.doi.org/10.1044/2018_AJSLP-17-0009
  • 6
    Lopes LW, Sousa ESS, Silva ACF, Silva IM, Paiva MAA, Vieira VJD, et al. Cepstral measures in the assessment of severity of voice disorders. CoDAS. 2019;31(4):e20180175. http://dx.doi.org/10.1590/2317-1782/20182018175 PMid:31433040.
    » http://dx.doi.org/10.1590/2317-1782/20182018175
  • 7
    Hejná M, Šturm P, Tylečková L, Bořil T. Normophonic breathiness in czech and danish: are females breathier than males? J Voice. 2021;35(3):498.e1-22. http://dx.doi.org/10.1016/j.jvoice.2019.10.019 PMid:31902679.
    » http://dx.doi.org/10.1016/j.jvoice.2019.10.019
  • 8
    van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, et al. Differences in weeklong ambulatory vocal behavior between female patients with phonotraumatic lesions and matched controls. J Speech Lang Hear Res. 2020;63(2):372-84. http://dx.doi.org/10.1044/2019_JSLHR-19-00065 PMid:31995428.
    » http://dx.doi.org/10.1044/2019_JSLHR-19-00065
  • 9
    Mehta DD, Espinoza VM, van Stan JH, Zañartu M, Hillman RE. The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation. J Acoust Soc Am. 2019;145(5):EL386-92. http://dx.doi.org/10.1121/1.5100909 PMid:31153299.
    » http://dx.doi.org/10.1121/1.5100909
  • 10
    Kreiman J, Gerratt BR, Garellek M, Samlan R, Zhang Z. Toward a unified theory of voice production and perception. Loquens. 2014;1(1):e009. http://dx.doi.org/10.3989/loquens.2014.009 PMid:27135054.
    » http://dx.doi.org/10.3989/loquens.2014.009
  • 11
    Zhang Z. Cause-effect relationship between vocal fold physiology and voice production in a three-dimensional phonation model. J Acoust Soc Am. 2016;139(4):1493-507. http://dx.doi.org/10.1121/1.4944754 PMid:27106298.
    » http://dx.doi.org/10.1121/1.4944754
  • 12
    Bickley CA, Stevens KN. Effects of a vocal-tract constriction on the glottal source: experimental and modelling studies. J Phonetics. 1986;14(3-4):373-82. http://dx.doi.org/10.1016/S0095-4470(19)30711-9
    » http://dx.doi.org/10.1016/S0095-4470(19)30711-9
  • 13
    Zhang Z. Effect of vocal fold stiffness on voice production in a three-dimensional body-cover phonation model. J Acoust Soc Am. 2017;142(4):2311-21. http://dx.doi.org/10.1121/1.5008497 PMid:29092586.
    » http://dx.doi.org/10.1121/1.5008497
  • 14
    Samlan RA, Story BH. Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. J Speech Lang Hear Res. 2011;54(5):1267-83. http://dx.doi.org/10.1044/1092-4388(2011/10-0195) PMid:21498582.
    » http://dx.doi.org/10.1044/1092-4388(2011/10-0195)
  • 15
    Garellek M, Samlan R, Gerratt BR, Kreiman J. Modeling the voice source in terms of spectral slopes. J Acoust Soc Am. 2016;139(3):1404-10. http://dx.doi.org/10.1121/1.4944474 PMid:27036277.
    » http://dx.doi.org/10.1121/1.4944474
  • 16
    van Houtte E, van Lierde K, Claeys S. Pathophysiology and treatment of muscle tension dysphonia: a review of the current knowledge. J Voice. 2011;25(2):202-7. http://dx.doi.org/10.1016/j.jvoice.2009.10.009 PMid:20400263.
    » http://dx.doi.org/10.1016/j.jvoice.2009.10.009
  • 17
    Roy N, Merrill R, Thibeault S, Parsa R, Gray S, Smith EM. Prevalence of voice disorders in teachers and the general population. J Speech Lang Hear Res. 2004;47(2):281-93. http://dx.doi.org/10.1044/1092-4388(2004/023) PMid:15157130.
    » http://dx.doi.org/10.1044/1092-4388(2004/023)
  • 18
    Hillman RE, Stepp CE, van Stan JH, Zañartu M, Mehta DD. An updated theoretical framework for vocal hyperfunction. Am J Speech Lang Pathol. 2020;29(4):2254-60. http://dx.doi.org/10.1044/2020_AJSLP-20-00104 PMid:33007164.
    » http://dx.doi.org/10.1044/2020_AJSLP-20-00104
  • 19
    Pépiot E, Arnold A. Cross-gender differences in English/French bilingual speakers: a multiparametric study. Percept Mot Skills. 2021;128(1):153-77. http://dx.doi.org/10.1177/0031512520973514 PMid:33202192.
    » http://dx.doi.org/10.1177/0031512520973514
  • 20
    Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31(1):67-71. http://dx.doi.org/10.1016/j.jvoice.2016.01.004 PMid:26873420.
    » http://dx.doi.org/10.1016/j.jvoice.2016.01.004
  • 21
    Henton CG, Bladon RAW. Breathiness in normal female speech: inefficiency versus desirability. Lang Commun. 1985;5(3):221-7. http://dx.doi.org/10.1016/0271-5309(85)90012-6
    » http://dx.doi.org/10.1016/0271-5309(85)90012-6
  • 22
    Samuel A, Moraes R, Lopes L. VoxMore: artefato tecnológico para auxiliar a avaliação acústica da voz no processo ensino-aprendizagem e prática clínica. CoDAS. 2023. No prelo.
  • 23
    Hillenbrand J, Cleveland RA, Erickson RL. Acoustic correlates of breathy vocal quality. J Speech Hear Res. 1994;37(4):769-78. http://dx.doi.org/10.1044/jshr.3704.769 PMid:7967562.
    » http://dx.doi.org/10.1044/jshr.3704.769
  • 24
    Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res. 1996;39(2):311-21. http://dx.doi.org/10.1044/jshr.3902.311 PMid:8729919.
    » http://dx.doi.org/10.1044/jshr.3902.311
  • 25
    Brockmann-Bauser M, van Stan JH, Carvalho Sampaio M, Bohlender JE, Hillman RE, Mehta DD. Effects of vocal intensity and fundamental frequency on cepstral peak prominence in patients with voice disorders and vocally healthy controls. J Voice. 2021;35(3):411-7. http://dx.doi.org/10.1016/j.jvoice.2019.11.015 PMid:31859213.
    » http://dx.doi.org/10.1016/j.jvoice.2019.11.015
  • 26
    Radish Kumar B, Bhat J, Mukhi P. Vowel harmonic amplitude differences in persons with vocal nodules. J Voice. 2011;25(5):559-61. http://dx.doi.org/10.1016/j.jvoice.2010.06.009 PMid:20926251.
    » http://dx.doi.org/10.1016/j.jvoice.2010.06.009
  • 27
    Narasimhan SV, Vishal K. Spectral measures of hoarseness in persons with hyperfunctional voice disorder. J Voice. 2017;31(1):57-61. http://dx.doi.org/10.1016/j.jvoice.2016.03.005 PMid:27080591.
    » http://dx.doi.org/10.1016/j.jvoice.2016.03.005
  • 28
    Mehta DD, Zaéartu M, Quatieri TF, Deliyski DD, Hillman RE. Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy. J Acoust Soc Am. 2011;130(6):3999-4009. http://dx.doi.org/10.1121/1.3658441 PMid:22225054.
    » http://dx.doi.org/10.1121/1.3658441
  • 29
    Cannito MP, Buder EH, Chorna LB. Spectral amplitude measures of adductor spasmodic dysphonic speech. J Voice. 2005;19(3):391-410. http://dx.doi.org/10.1016/j.jvoice.2004.07.001 PMid:16102666.
    » http://dx.doi.org/10.1016/j.jvoice.2004.07.001
  • 30
    Holmberg EB, Doyle P, Perkell JS, Hammarberg B, Hillman RE. Aerodynamic and acoustic voice measurements of patients with vocal nodules: variation in baseline and changes across voice therapy. J Voice. 2003;17(3):269-82. http://dx.doi.org/10.1067/S0892-1997(03)00076-6 PMid:14513951.
    » http://dx.doi.org/10.1067/S0892-1997(03)00076-6
  • 31
    Aaen M, McGlashan J, Thu KT, Sadolin C. Assessing and quantifying air added to the voice by means of laryngostroboscopic imaging, EGG, and acoustics in vocally trained subjects. J Voice. 2021;35(2):326.e1-11. http://dx.doi.org/10.1016/j.jvoice.2019.09.001 PMid:31628046.
    » http://dx.doi.org/10.1016/j.jvoice.2019.09.001
  • 32
    Zhang Z. Mechanics of human voice production and control. J Acoust Soc Am. 2016;140(4):2614-35. http://dx.doi.org/10.1121/1.4964509 PMid:27794319.
    » http://dx.doi.org/10.1121/1.4964509
  • 33
    Kreiman J, Gerratt BR. Perceptual sensitivity to first harmonic amplitude in the voice source. J Acoust Soc Am. 2010;128(4):2085-9. http://dx.doi.org/10.1121/1.3478784 PMid:20968379.
    » http://dx.doi.org/10.1121/1.3478784

Publication Dates

  • Publication in this collection
    10 Nov 2023
  • Date of issue
    2024

History

  • Received
    16 Dec 2022
  • Accepted
    20 Mar 2023
Sociedade Brasileira de Fonoaudiologia Al. Jaú, 684, 7º andar, 01420-002 São Paulo - SP Brasil, Tel./Fax 55 11 - 3873-4211 - São Paulo - SP - Brazil
E-mail: revista@codas.org.br