Performance of the phonatory deviation diagram in the evaluation of rough and breathy synthesized voices

Lopes, Leonardo Wanderley; Freitas, Jonas Almeida de; Almeida, Anna Alice; Silva, Priscila Oliveira Costa; Alves, Giorvan Ânderson dos Santos

doi:10.1016/j.bjorl.2017.05.012

Abstract

Introduction

Voice disorders alter the sound signal in several ways, combining several types of vocal emission disturbances and noise. The phonatory deviation diagram is a two-dimensional chart that allows the evaluation of the vocal signal based on the combination of periodicity (jitter, shimmer, and correlation coefficient) and noise (Glottal to Noise Excitation) measurements. The use of synthesized signals, where one has a greater control and knowledge of the production conditions, may allow a better understanding of the physiological and acoustic mechanisms underlying the vocal emission and its main perceptual-auditory correlates regarding the intensity of the deviation and types of vocal quality.

Objective

To analyze the performance of the phonatory deviation diagram in the discrimination of the presence and degree of roughness and breathiness in synthesized voices.

Methods

871 synthesized vocal signals were used corresponding to the vowel /ɛ/. The perceptual-auditory analysis of the degree of roughness and breathiness of the synthesized signals was performed using visual analogue scale. Subsequently, the signals were categorized regarding the presence/absence of these parameters based on the visual analogue scale cutoff values. Acoustic analysis was performed by assessing the distribution of vocal signals according to the phonatory deviation diagram area, quadrant, shape, and density. The equality of proportions and the chi-square tests were performed to compare the variables.

Results

Rough and breathy vocal signals were located predominantly outside the normal range and in the lower right quadrant of the phonatory deviation diagram. Voices with higher degrees of roughness and breathiness were located outside the area of normality in the lower right quadrant and had concentrated density.

Conclusion

The normality area and the phonatory deviation diagram quadrant can discriminate healthy voices from rough and breathy ones. Voices with higher degrees of roughness and breathiness are proportionally located outside the area of normality, in the lower right quadrant and with concentrated density.

KEYWORDS
Voice quality; Acoustics; Phonation; Dysphonia; Voice disorders

Resumo

Introdução

Os distúrbios de voz alteram o sinal sonoro de diversas formas, combinam variados tipos de perturbações e ruídos na emissão vocal. O diagrama de desvio fonatório é um gráfico bidimensional que possibilita a avaliação do sinal vocal a partir da combinação de medidas de periodicidade (jitter, shimmer e coeficiente de correlação) e de ruído (Glottal to Noise Excitation). O uso de sinais sintetizados, no qual se tem um maior controle e conhecimento das condições de produção, pode possibilitar uma maior compreensão dos mecanismos fisiológicos e acústicos subjacentes à emissão vocal e seus principais correlatos perceptivo-auditivos quanto à intensidade do desvio e aos tipos de qualidade vocal.

Objetivo

Analisar o desempenho do diagrama de desvio fonatório na discriminação da presença e do grau de rugosidade e soprosidade em vozes sintetizadas.

Método

Foram utilizados 871 sinais vocais sintetizados correspondentes à vogal/ɛ/. Realizou-se a análise perceptivo-auditiva do grau de rugosidade e soprosidade dos sinais sintetizados, com uma escala visual analógica. Posteriormente, os sinais foram categorizados quanto à presença/ausência desses parâmetros a partir dos valores de corte da escala visual analógica. A análise acústica foi realizada por meio da avaliação da distribuição dos sinais vocais de acordo com a área, quadrante, forma e densidade do diagrama de desvio fonatório. Executou-se o teste de igualdade de proporções e o teste qui-quadrado para comparar as variáveis.

Resultados

Sinais vocais rugosos e soprosos localizaram-se predominantemente fora da área de normalidade e no quadrante inferior direito do diagrama de desvio fonatório. Vozes com maiores graus de rugosidade e soprosidade localizaram-se fora da área de normalidade, no quadrante inferior direito e apresentaram densidade concentrada.

Conclusão

A área de normalidade e o quadrante do diagrama de desvio fonatório são capazes de discriminar vozes saudáveis de rugosas e soprosas. Vozes com maior grau de rugosidade e soprosidade localizam-se proporcionalmente fora da área de normalidade, no quadrante inferior-direito e com densidade concentrada.

PALAVRAS-CHAVE
Qualidade da voz; Acústica; Fonação; Disfonia; Distúrbios da voz

Introduction

Traditionally, vocal assessment includes the investigation and integration of perceptual-auditory, laryngeal, aerodynamic, acoustic, and self-assessment data.¹1 Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, et al. A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur Arch Otorhinolaryngol. 2001;258:77-82.^,²2 Ma EP, Yiu EM. Multiparametric evaluation of dysphonic severity. J Voice. 2006;20:380-90. Specifically, perceptual-auditory evaluation and acoustic analysis are the main tools used by the speech therapist/audiologist to characterize the vocal quality deviation observed in voice disorders.³3 Roy N, Barkmeir-Kraemer J, Eadie T, Sivasankar MP, Metha D, Paul D, et al. Evidence-based clinical voice assessment: a systematic review. Am J Speech Lang Pathol. 2013;22:212-26.

Studies in the area of voice disorder evaluation and diagnosis aim to investigate three essential clinical issues³3 Roy N, Barkmeir-Kraemer J, Eadie T, Sivasankar MP, Metha D, Paul D, et al. Evidence-based clinical voice assessment: a systematic review. Am J Speech Lang Pathol. 2013;22:212-26.: the ability of the measure to determine the presence/absence of a voice disorder (diagnosis); the evidence that the test used can determine the origin (etiology) of a voice disorder; and the ability of a measure to determine the extent (intensity) of a voice disorder.

The perceptual-auditory voice assessment includes from the definition of the present deviation intensity to the emission and predominant vocal quality, in case of deviated emissions. The descriptors "roughness", "breathiness" and "tension" are universally used⁴4 Kempster GB, Gerratt BR, Verdolini Abbott K, Barkmeier-Kraemer J, Hillman RE. Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. Am J Speech Lang Pathol. 2009;18:124-32.^,⁵5 Oates J. Auditory-perceptual evaluation of disordered voice quality: pros, cons and future directions. Folia Phoniatr Logop. 2009;61:49-56. to characterize dysphonic voices, showing a correlation in the physiological and acoustic planes.⁶6 Parsa V, Jamieson DG. Identification of pathological voices using glottal noise measures. J Speech Lang Hear Res. 2000;43:469-85.

7 Bhuta T, Patrick L, Garnett J. Perceptual evaluation of voice quality and its correlation with acoustic measurements. J Voice. 2004;18:299-304.^-⁸8 Lopes LW, Cavalcante DP, Costa PO. Intensidade do desvio vocal: integração de dados perceptivo-auditivos e acústicos em pacientes disfônicos. CoDAS. 2014;26:382-8. However, the roughness and breathiness parameters are considered more robust, whereas tension is a less reliable quality with great inter-rater variability, which justifies its omission in some perceptual-auditory evaluation protocols.⁹9 Dejonckere P, Schoentgen J, Giordano A, Fraj S, Bocchi L, Manfredi C. Validity of jitter measures in non-quasi-periodic voices. Part I: Perceptual and computer performances in cycle pattern recognition. Logoped Phoniatr Vocol. 2011;36:70-7.^,¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7.

The acoustic analysis corresponds to the sound signal recording, which is the complex product of the non-linear interaction of the biomechanical and aerodynamic properties of the vocal production system.⁸8 Lopes LW, Cavalcante DP, Costa PO. Intensidade do desvio vocal: integração de dados perceptivo-auditivos e acústicos em pacientes disfônicos. CoDAS. 2014;26:382-8. It provides an indirect estimate of the vibratory patterns of the vocal folds, the vocal tract, and its different adjustments, contributing to the task of vocal quality analysis and classification.¹¹11 Nayak J, Bhat PS, Acharya R, Aithal UV. Classification and analysis of speech abnormalities. ITBM-RBM. 2005;26:319-27.

12 Awan S, Lawson L. The effect of anchor modality on the reliability of vocal severity ratings. J Voice. 2009;23:341-52.

13 Dejonckere PH, Giordano A, Schoentgen J, Frag S, Bocchi L, Manfredi C. To what degree of voice perturbation are jitter measurements valid? A novel approach with synthesized vowels and visuo-perceptual pattern recognition. Comput Methods Programs Biomed J. 2012;7:37-42.^-¹⁴14 Lowell SY. The acoustic assessment of voice in continuous speech. SIG 3 Perspect Voice Voice Dis. 2012;22:57-63.

Jitter and shimmer are among the main acoustic measures based on linear models of vocal production and used in the clinical context.¹⁵15 Brockmann-Bausser M, Drinnan MJ. Routine acoustic voice analysis: time to think again?. Curr Opin Otolaryngol Head Neck Surg. 2011;19:165-70. These are measures that analyze the fundamental frequency disturbance index, that is, the control of vocal fold vibrations (jitter), and the amplitude disturbance index, which is related to glottic resistance (shimmer).¹⁶16 Brockmann M, Drinnan MJ, Storck C, Carding PN. Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. J Voice. 2011;25:44-53.^,¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.

In addition to disturbance measures, noise measurements such as Glottal to Noise Excitation (GNE) and Harmonic-Noise Ratio (HNR) are also widely used in the clinical context,⁸8 Lopes LW, Cavalcante DP, Costa PO. Intensidade do desvio vocal: integração de dados perceptivo-auditivos e acústicos em pacientes disfônicos. CoDAS. 2014;26:382-8.^,¹⁸18 Awan SN, Frenkel ML. Improvements in estimating the harmonic-to-noise ratio of the voice. J Voice. 1994;8:255-62.^,¹⁹19 Godino LJL, Osma RV, Sáenz LN, Vilda GP, Blanco VM, Cruz RF. The effectiveness of the glottal to noise excitation ratio for the screening of voice disorders. J Voice. 2010;24:47-56. as they demonstrate whether the vocal signal originates from vocal fold vibrations or the presented air current (GNE), as well as of the regular signal of the vocal folds in relation to the irregular signal of the vocal folds and the vocal tract, correlating the harmonic noise versus the wave noise component (HNR).¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,¹⁹19 Godino LJL, Osma RV, Sáenz LN, Vilda GP, Blanco VM, Cruz RF. The effectiveness of the glottal to noise excitation ratio for the screening of voice disorders. J Voice. 2010;24:47-56.^,²⁰20 Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio - a new measure for describing pathological voices. Acta Acust. 1997;83:700-6.

In general, a deviant emission tends to combine different components of noise and disturbance, so that studies using combined measures may better represent the auditorily perceived vocal quality deviation.⁸8 Lopes LW, Cavalcante DP, Costa PO. Intensidade do desvio vocal: integração de dados perceptivo-auditivos e acústicos em pacientes disfônicos. CoDAS. 2014;26:382-8.^,¹⁶16 Brockmann M, Drinnan MJ, Storck C, Carding PN. Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. J Voice. 2011;25:44-53.^,^20-2320 Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio - a new measure for describing pathological voices. Acta Acust. 1997;83:700-6.

In this context, the Phonatory Deviation Diagram (PDD) or hoarseness diagram (in its original version)²⁴24 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice quality description: case studies for different regions of the hoarseness diagram. In: Wittenberg T, Mergell P, Tigges M, Eysholdt U, editors. Advances in quantitative laryngoscopy. 2nd ed. Erlangen: Abteilung Phoniatrie, Universitäts-HNO-Klinik Göttin-gen; 1997. p. 143-50.

25 Michaelis D, Frohlich M, Stube HW. Selection and combination of acoustic features for the description of pathologic voices. J Acoust Soc Am. 1998;103:1628-40.^-²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20. offers the possibility of the combined analysis of disturbance measurements (jitter, shimmer, and correlation) and noise (GNE), making it an important tool for the evaluation and monitoring of voice disorders.¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,^27-3027 Behlau M, Oliveira G, Pontes P. Vocal fold self-disruption after phonotrauma on a lead actor: a case presentation. J Voice. 2009;23:726-32.

One of the great challenges of vocal assessment is the integrated analysis of data, which includes the acoustic and perceptual-auditory information.³¹31 Kreiman J, Gerrat B, Ito M. When and why listeners disagree in voice quality assessment tasks. J Acoust Soc Am. 2007;122:2354-64. One of the possible solutions suggested for a better understanding of the associations between the acoustic and perceptual phenomena related to the vocal signal is the development of researches with voices generated by synthesizers.³²32 Englert M, Madazio G, Gielow I, Lucero J, Behlau M. Perceptual error identification of human and synthesized voices. J Voice. 2016;30:e17-23.

Synthesized voices have highly controlled and known acoustic properties and production conditions, which contributes to the understanding of the mechanisms underlying the auditorily perceived vocal quality deviation. Synthesizers simulate vocal production deviations such as roughness, breathiness, and tension, from the manipulation of disturbance parameters, noise, and tension/symmetry differences between the vocal folds, respectively.³³33 Lucero JC, Schoentgen J, Behlau M. Physics-based synthesis of disordered voices. In: Proceedings of Interspeech 14th Annual Conference of the International Speech Communication Association. 2013. p. 25-9.

Therefore, considering that the identification of the presence and degree of roughness and breathiness are part of the clinical vocal evaluation routine, that PDD is an important tool in the evaluation and monitoring of voice disorders, and that the use of synthesized signals allows greater control of the stimulus and can elucidate conditions underlying the perceived deviation, the aim of this research is to analyze the performance of PDD in the discrimination of the presence and degree of roughness and breathiness in synthesized voices.

For this purpose, two hypotheses were raised: (1) there are differences in the PDD parameters regarding the identification of voices with and without roughness and breathiness; (2) there are differences in the PDD parameters regarding the identification of signals with different degrees of roughness and breathiness.

Methods

Study design

This is a documented descriptive, and cross-sectional study carried out at the Voice Laboratory of the Speech Therapy and Audiology Department of a university. It was evaluated and approved by the Research Ethics Committee of the institution, under Opinion n. 508200/2013.

Sample

This study used a set of synthesized voices developed by the VoiceSim synthesizer.³³33 Lucero JC, Schoentgen J, Behlau M. Physics-based synthesis of disordered voices. In: Proceedings of Interspeech 14th Annual Conference of the International Speech Communication Association. 2013. p. 25-9. The synthesizer consists of a computer system containing a vocal fold model and a representation of the vocal tract in the format of concatenated tubes, through which an acoustic wave propagates.³²32 Englert M, Madazio G, Gielow I, Lucero J, Behlau M. Perceptual error identification of human and synthesized voices. J Voice. 2016;30:e17-23.

Vocal deviations of roughness and breathiness were produced from the manipulation of acoustic parameters of fundamental frequency disturbance (flutter, tremor, and wow), additive noise and tension asymmetry between the vocal folds.³³33 Lucero JC, Schoentgen J, Behlau M. Physics-based synthesis of disordered voices. In: Proceedings of Interspeech 14th Annual Conference of the International Speech Communication Association. 2013. p. 25-9.

Roughness was generated by manipulating the duration of the cycle of glottic excitation and jitter, with the introduction of a stochastic disturbance in the vocal fold tissue tension, using the formula: ΔK = αεK; where /α/ is a scale parameter, /ɛ/ is a random variable, and /K/ is a coefficient of vocal fold stiffness.

Breathiness was generated with the insertion of additive noise, according to the formula: Δµ = bεµ where /µ/ is the glottal airflow rate, /b/ is a scale parameter, and /ɛ/ is a random variable, similar to jitter.

The tension asymmetry parameters between the vocal folds, subglottic pressure and vocal fold separation were also controlled during the production of these synthesized signals. For more details on the synthesizer, please refer to the available literature.³³33 Lucero JC, Schoentgen J, Behlau M. Physics-based synthesis of disordered voices. In: Proceedings of Interspeech 14th Annual Conference of the International Speech Communication Association. 2013. p. 25-9.

The speech material of the synthesized stimuli was the vowel /ɛ/ sustained for 3 s. This vowel was chosen because it is commonly used in vocal and laryngeal evaluation procedures in Brazil,³⁴34 Gonçalves MIR, Pontes PAL, Vieira VP, Pontes AAL, Curcio D, De Biase NG. Transfer function of Brazilian Portuguese oral vowels: a comparative acoustic analysis. Braz J Ororhinolaryngol. 2009;75:680-4. also considering that it is an oral, medium, open, and unrounded vowel, considered the most medium vowel of Brazilian Portuguese,³⁴34 Gonçalves MIR, Pontes PAL, Vieira VP, Pontes AAL, Curcio D, De Biase NG. Transfer function of Brazilian Portuguese oral vowels: a comparative acoustic analysis. Braz J Ororhinolaryngol. 2009;75:680-4. which allows a more neutral and intermediate position of the vocal tract.

Therefore, 871 synthesized vocal signals were used, of which 426 (48.8%) were female and 446 (51.2%) were male signals, with different combinations of the previously mentioned acoustic parameters.

Procedures

The acoustic analysis was performed using the VoxMetria software, version 4.5 h, by CTS Informática (Pato Branco, Paraná, Brazil), in the vocal quality module. The PDD was used for this evaluation, in order to analyze the distribution of vocal signals according to area, quadrant, shape, and density.

Regarding the area, the software itself indicates whether the vocal signal is inside or outside the normal range (Fig. 1).

Figure 1
Vocal signals inside (dots in blue color) and outside (dots in green color) the PDD normal area.

As for the quadrants, the PDD was divided into four equal quadrants¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.: lower left (1), lower right (2), upper right (3) and upper left (4) (Fig. 2).

Figure 2
Division of PDD in quadrants.

Regarding the distribution of the points in relation to density (Figs. 3 and 4), the points concerning the distribution of the vocal signals were classified as concentrated, when the points were distributed inside a space corresponding to one square, or amplified, when the points were distributed throughout the space corresponding to more than one square of the PDD.

Figure 3
Vocal sample with density concentrated on PDD.

Figure 4
Vocal sample with density amplified in PDD.

The shape classification was performed using a simple 10-cm ruler on the printed sheet of each PDD generated by the software, corresponding to the image of each analyzed vocal signal, with no previous knowledge of the vocal deviation intensity and the predominant voice type.

The points concerning the distribution of vocal signals were categorized as vertical, when the distance between the points along the abscissa was lower than along the ordinate (X < Y); horizontal, when the distance between the points along the abscissa was higher along the ordinate (X > Y); and circular when the distance between the points along the ordinate and the abscissa was approximately the same (X ≅ Y).¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.

The perceptual-auditory evaluation session took place in a quiet environment and was performed by a speech therapist/audiologist who was also a voice specialist with more than 10 years of experience in this task.

The evaluator was instructed that voices should be considered normal when they were socially acceptable, naturally produced, without any irregularity, noise, or effort observable during the emission. The evaluator was also instructed that roughness would correspond to the presence of vibratory irregularity and breathiness would be associated with audible air escape during the emission. The evaluator was trained with anchor stimuli, containing normal emissions, and deviated ones at different degrees, as well as predominantly rough and breathy voices. Moreover, the evaluator was instructed about the cutoff values that would be used in this study,¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7. to categorize voices regarding the absence and presence of roughness and breathiness.

For the assessment, the evaluator used a Visual Analogue Scale (VAS), with a metric scale of 0-100 mm, evaluating the intensity of vocal deviation (GD, general degree) and the roughness degree (RD) and breathiness degree (BD). The evaluation closest to 0 represents less vocal deviation, and the closer to 100, the greater the deviations.

For the assessment, each emission of the sustained vowel was presented three times through a speaker, at a comfortable intensity self-reported by the evaluator. At the end of the perceptual assessment session, 10% of the samples (88 signals) were randomly repeated for the evaluator's reliability analysis, using Cohen's Kappa Coefficient. The Kappa value was 0.88, indicating excellent reliability of the evaluator.³⁵35 Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159-75.

In the current literature,¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7.^,³⁶36 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31:67-71. distinct cutoff values are found for GD,³⁶36 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31:67-71. RD¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7. and BD,¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7. used to categorize both the presence/absence of vocal deviation, and to classify the degree of the present deviation. Therefore, considering that the aim of this study is to investigate the performance of the PDD in the discrimination of the presence and degree of roughness and breathiness in synthesized voices, it was decided to use the cutoff values established for the classification of roughness and breathiness parameters.¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7.

For RD, the following cutoff points are considered¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7.: absence of roughness or Grade 0 (0-8.5 mm), mild roughness or Grade 1 (8.6-28.5 mm), moderate roughness or Grade 2 (28.6-59.5) and intense roughness or Grade 3 (≥59.6 mm). In relation to BD, the following cutoff points were recommended: no breathiness or Grade 0 (0-8.5 mm), mild breathiness or Grade 1 (8.6-33.5 mm), moderate breathiness or Grade 2 (33.6-52.0 mm) and intense breathiness or Grade 3 (≥ 52.1 mm).

Thus, a correspondence was made between the VAS used for RD and BD and the numerical scale,¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7. as described below:

Grade 0: RD and BD ≤ 8.4 mm;
Grade 1: 8.5 mm ≤ RD ≤ 28.4 mm and 8.5 ≤ BD ≤ 33.4 mm;
Grade 2: 28.5 mm ≤ RD ≤ 59.4 mm and 33.5 mm ≤ BD ≤ 52.4 mm;
Grade 3: RD ≥ 59.5 mm and BD ≥ 52.5 mm.

The 8.4 mm cutoff was also used to categorize the voices regarding the presence or absence of roughness and breathiness.¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7. Voices with values >8.4 mm in RD and BD were considered as having the presence of roughness and breathiness in vocal emissions, respectively.

We chose not to analyze the tension parameter, since other studies have already shown that such characteristic is not specifically identified in the PDD,¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83. in addition to the lack of consensus regarding the inclusion of this parameter in the perceptual-auditory evaluation protocols.¹1 Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, et al. A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur Arch Otorhinolaryngol. 2001;258:77-82.^,¹⁰10 Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7.

The GD evaluation³⁶36 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31:67-71. was not used for signal categorization, but only for the sample characterization in the present study.

Therefore, based on the results of the perceptual-auditory analysis of the RD and BD, the following classification was observed:

As for the presence of roughness: 128 (14.7%) signals without roughness (RD ≤ 8.4 mm) and 743 (85.3%) with roughness (RD ≥ 8.5 mm) (Table 1).

Thumbnail

Table 1
Distribution of vocal signals regarding the presence and degree of roughness and breathiness.
As for the presence of breathiness: 365 (41.9%) signals without breathiness (BD ≤ 8.4 mm) and 506 (58.1%) with breathiness (BD ≥ 8.5 mm) (Table 1).

It is worth mentioning that a categorical analysis of the vocal quality predominant in the emission was not performed, but a same vocal signal could show roughness and breathiness components, since the criterion for the allocation of signals regarding the presence/absence of these components was the result of the independent evaluation of each of them through the VAS (RD and BD) and of the cutoffs established for these parameters (Table 2).

Thumbnail

Table 2
Comparison of the distribution frequency of the synthesized voices with and without roughness depending on the PDD area, density, quadrant, and shape.

Data analysis

The statistical analysis was descriptive for all the assessed variables and Fisher's exact test and Chi-square test (x ²) were used to compare the analysis of variables related to perceptual-auditory (presence and degree of roughness and breathiness) and acoustic measures (area, density, shape, and quadrant of the PDD). The Kruskal-Wallis test was used to compare the acoustic measurements according to the degree of roughness and breathiness. The level of significance was set at 5% for all analyses. The software used was the Statistical Package for Social Sciences (SPSS, version 21.0).

Results

Initially, the distribution frequency of the synthesized voices with and without roughness was compared according to the area, density, quadrant, and shape of the PDD (Table 2). A difference was observed between the signals with and without roughness as a function of the PDD area and quadrant (Table 2). The vocal signals with roughness were found to be proportionally outside the area of normal PDD and in the lower right quadrant. There was no statistically significant difference regarding the distribution of the signals with and without roughness as a function of the density and shape of the PDD points.

Subsequently, the distribution of signals with and without breathiness was compared as a function of the PDD parameters. There was a difference in the proportion of these signals regarding the PDD area, density, and quadrant. The breathy voices were predominantly outside the normal range and in the lower right quadrant (Table 3).

Thumbnail

Table 3
Comparison of the distribution frequency of synthesized voices with and without breathiness as a function of PDD area, density, quadrant, and shape.

When comparing the distribution frequency of the voices with different degrees of roughness according to the PDD parameters, a difference in the distribution of the signals was observed in relation to all PDD parameters (Table 4). Voices with a higher degree of roughness were proportionally outside the area of normality, in the lower right quadrant and showed concentrated density in relation to voices with lower degrees of roughness. As for the shape, although a difference was found between the proportions of the groups, there was no distribution pattern of the signals with different degrees of roughness in a specific shape, since the signals predominantly showed the horizontal shape in all grades.

Thumbnail

Table 4
Comparison of the distribution frequency of voices with different degrees of roughness depending on the PDD area, density, quadrant, and shape.

Regarding the degree of breathiness, there was a difference in the distribution of the signals as a function of the PDD area, density, and quadrant parameters (Table 5). Voices with higher degrees of breathiness were proportionally more often outside the area of normality, showed more concentrated density and were in the lower right quadrant, in relation to the signals with lower degrees of breathiness.

Thumbnail

Table 5
Comparison of the frequency of voice distribution with different degrees of breathability depending on the PDD area, density, quadrant, and shape.

Discussion

This study analyzed the performance of the PDD in the discrimination of the presence and degree of roughness and breathiness in synthesized voices. This section was organized with the purpose of clarifying the conclusions of the study according to the raised hypotheses. Didactically, it was decided to analyze the components of roughness and breathiness in subsections.

PDD performance in the evaluation of the presence and degree of roughness

This study showed that the PDD area and quadrant were able to discriminate between normal signals and signals with roughness. Voices with roughness were predominantly located outside the area of normality and in the lower right quadrant.

Previous studies, carried out with adults'¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11. and children's voices,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83. corroborate the findings obtained in the present study. Both the lower right quadrant and the PDD area were important to discriminate voices with presence and absence of roughness, showing these two parameters are robust and reliable to evaluate roughness in dysphonic and non-dysphonic voices.

The PDD evaluates signal irregularity in its horizontal position, being associated to the concept of roughness.²⁴24 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice quality description: case studies for different regions of the hoarseness diagram. In: Wittenberg T, Mergell P, Tigges M, Eysholdt U, editors. Advances in quantitative laryngoscopy. 2nd ed. Erlangen: Abteilung Phoniatrie, Universitäts-HNO-Klinik Göttin-gen; 1997. p. 143-50.^,²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20. The greater the irregularity of the vocal signal, the greater its displacement from left to right in the chart. This fact justifies the location of rough voices outside the area of normality and in the lower right quadrant, both in the present study and in previous ones.¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83.

Additionally, it is emphasized that roughness is one of the universal parameters of the perceptual-auditory evaluation of vocal quality, representing an important characteristic in the identification of the presence of vocal or laryngeal alterations.³⁷37 Connor NP, Cohen SB, Theis SM, Thibeault SL, Heatley DG, Bless DM. Attitudes of children with dysphonia. J Voice. 2008;22:197-209.

Roughness is commonly related to the presence of structural and/or functional alterations in the larynx, such as is seen in cases of edema, vascular dysgenesis, nodular lesions, polyps, or any other component that generates a mass increase in the membranous portion of the vocal folds³⁸38 McAllister A, Sederholm E, Perceptual Sundberg J. acoustic analysis of vocal registers in 10-year-old children. Logoped Phoniatr Vocol. 2000;25:63-71. and, consequently, irregularities in the vocal fold vibratory pattern. In the acoustic plane, roughness is associated to the jitter and shimmer parameters.¹⁹19 Godino LJL, Osma RV, Sáenz LN, Vilda GP, Blanco VM, Cruz RF. The effectiveness of the glottal to noise excitation ratio for the screening of voice disorders. J Voice. 2010;24:47-56.

As for the distribution of voices with different degrees of roughness in the PDD, it was verified that vocal signals with a greater roughness component were proportionally outside the area of normality and in the lower right quadrant. Regarding density, signals with moderate and intense deviation predominantly showed concentrated density.

It is noteworthy that 35.93% (n = 46) of the synthesized voices without roughness were outside the area of normality, whereas 12.10% (n = 31) of the voices with mild-to-moderate degree of roughness were inside the area of normality, that is, the PDD showed a greater confounding factor in the identification of voices without roughness, with a slight deviation in relation to the signals with a higher degree of roughness (moderate and intense).

In traditional models, with the use of algorithms that extract isolated jitter and shimmer measurements, an inverse behavior is observed, as the use of these isolated measures is less reliable in the evaluation of more deviant voices.¹⁵15 Brockmann-Bausser M, Drinnan MJ. Routine acoustic voice analysis: time to think again?. Curr Opin Otolaryngol Head Neck Surg. 2011;19:165-70.^,¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,²⁰20 Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio - a new measure for describing pathological voices. Acta Acust. 1997;83:700-6.^,²⁴24 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice quality description: case studies for different regions of the hoarseness diagram. In: Wittenberg T, Mergell P, Tigges M, Eysholdt U, editors. Advances in quantitative laryngoscopy. 2nd ed. Erlangen: Abteilung Phoniatrie, Universitäts-HNO-Klinik Göttin-gen; 1997. p. 143-50.^,²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.^,^39-4139 Titze IR. Workshop on acoustic voice analysis: summary statement. Nat Cen Voice Speech; 1995.

Regarding density, few studies¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,²⁸28 Pifaia LR, Madazio G, Behlau M. Diagrama de desvio fonatório e análise perceptivo-auditiva pré e pós-terapia vocal. CoDAS. 2013;25:140-7.^,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83. specifically included this parameter for PDD analysis and none of them investigated the distribution of voices with different degrees of roughness as a function of PDD density. Only one of these studies¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11. showed a difference in the distribution of signals with and without vocal deviation regarding density, with the deviated signals characterized as having amplified density.

In other studies where PDD was used,²⁰20 Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio - a new measure for describing pathological voices. Acta Acust. 1997;83:700-6.^,²⁴24 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice quality description: case studies for different regions of the hoarseness diagram. In: Wittenberg T, Mergell P, Tigges M, Eysholdt U, editors. Advances in quantitative laryngoscopy. 2nd ed. Erlangen: Abteilung Phoniatrie, Universitäts-HNO-Klinik Göttin-gen; 1997. p. 143-50.^,²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.^,^40-4240 Olthoff A, Mrugalla S, Lasjawi R, Frolich M, Stuermer I, Kruse E, et al. Assessment of irregular voices after total and laser surgical partial laryngectomy. Arch Otolaryngol Head Neck Surg. 2003;129:994-9. the density parameter can be inferred from the distance between the points only on the abscissa axis, being associated with signals with amplified or concentrated density, respectively. All these studies were longitudinal ones and produced a tendency for less dispersion of the points on the post-intervention abscissa axis, although there is great individual variability in this parameter throughout the treatment,²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20. with significant differences being observed only between pre- and post-treatment conditions.

This study showed greater variability in the distribution of the signals without a roughness component or with a mild-to-moderate degree of roughness between the concentrated and amplified densities. This fact confirms the good performance of the PDD in analyzing signals with a wide range of deviation and its reliability in the assessment of the most deviant signals. Additionally, it can be inferred that the PDD density parameter seems to be more robust to qualitatively analyze the patient's evolution regarding the roughness component in vocal emission.

Regarding the shape, although a statistical significance was verified, a distribution pattern of the signals with different degrees of roughness as a function of this PDD parameter was not observed. In all grades, the voices were predominantly horizontal, with differences being observed only between the proportions of the groups. This finding corroborates the literature, as there is a tendency for the signals to show a predominance of the dispersion of the points in the horizontal dimension, regardless of the presence and degree of vocal deviation.²⁰20 Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio - a new measure for describing pathological voices. Acta Acust. 1997;83:700-6.^,²⁴24 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice quality description: case studies for different regions of the hoarseness diagram. In: Wittenberg T, Mergell P, Tigges M, Eysholdt U, editors. Advances in quantitative laryngoscopy. 2nd ed. Erlangen: Abteilung Phoniatrie, Universitäts-HNO-Klinik Göttin-gen; 1997. p. 143-50.^,²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.^,^40-4240 Olthoff A, Mrugalla S, Lasjawi R, Frolich M, Stuermer I, Kruse E, et al. Assessment of irregular voices after total and laser surgical partial laryngectomy. Arch Otolaryngol Head Neck Surg. 2003;129:994-9.

Even in the original proposal for the classification of vocal signals as a function of the PDD shape, no significant difference was observed between healthy and deviant signals, as well as between different degrees of deviation and between rough, breathy, and tense voices.¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11. Therefore, the shape of the points distributed in the PDD does not seem to be a robust parameter for signal differentiation.

PDD performance in the evaluation of the presence and degree of breathiness

When comparing the distribution of vocal signals with and without breathiness as a function of the PDD parameters, it was observed that area and quadrant were able to discriminate normal vocal signals from breathy ones. Breathy vocal signals were outside the normal range and were predominantly located in the lower right quadrant.

Breathiness is among the universally accepted parameters for the perceptual-auditory evaluation of vocal quality and for the characterization of a dysphonic voice.⁴4 Kempster GB, Gerratt BR, Verdolini Abbott K, Barkmeier-Kraemer J, Hillman RE. Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. Am J Speech Lang Pathol. 2009;18:124-32.^,⁸8 Lopes LW, Cavalcante DP, Costa PO. Intensidade do desvio vocal: integração de dados perceptivo-auditivos e acústicos em pacientes disfônicos. CoDAS. 2014;26:382-8.^,³⁷37 Connor NP, Cohen SB, Theis SM, Thibeault SL, Heatley DG, Bless DM. Attitudes of children with dysphonia. J Voice. 2008;22:197-209. Thus, the fact that the PDD correctly identifies the breathy signals outside the area of normality reinforces its usefulness in the clinical context of vocal assessment.

However, it was observed that the PDD area and quadrant parameters showed identical behavior, in both rough and breathy voices. The vocal signals with roughness and breathiness were found outside the area of normality and in the lower right quadrant. Therefore, one can discuss the interrelationships of these two parameters in physiological and perceptual terms.

The presence of breathiness is physiologically associated with a higher degree of separation between the vocal processes, lower convexity of the free edge of the vocal folds and the shorter time of the closed phase of the glottic cycles⁴³43 Samlam RA, Story BH, Bunton K. Relation of perceived breathiness to laryngeal kinematics and acoustic measures based on computacional modeling. J Speech Lang Hear Res. 2013;56:1209-23. In turn, vocal folds that are further away from the midline tend to vibrate with greater irregularity and less amplitude of the mucosal wave,⁴⁴44 Lucero JC, Koening LL. Simulations of temporal patterns of oral airflow in men and women using two-mass model of the vocal flods under dynamic control. J Acoust Soc Am. 2005;117:1362-72. which, consequently, generates the roughness component in the emission.³⁷37 Connor NP, Cohen SB, Theis SM, Thibeault SL, Heatley DG, Bless DM. Attitudes of children with dysphonia. J Voice. 2008;22:197-209.

Therefore, considering that the signals with roughness and breathiness showed, in general, moderate deviation, with GD of 62.19 ± 14.80 and 65.28 ± 14.75 points in the VAS,³⁶36 Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31:67-71. respectively, one understands the similar distribution of signals with roughness and breathiness in the PDD area and quadrant.

Although the synthesizer used to generate the signals in this study allows the creation of voices with isolated components of roughness (disturbance) and breathiness (additive noise), this separation was not used in the present study. We suggest further investigations with separation of the exclusively rough and breathy signals to assess the performance of the PDD in this classification.

In other studies,¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83. the breathy voices were located outside the area of normality, but were distributed between the lower right and upper right quadrants. Some methodological issues need to be highlighted to evidence the similar distribution of the rough and breathy voices in the lower right quadrant in this study.

The two aforementioned studies¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.^,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83. used as a criterion to classify the voices as rough, breathy, or tense, a forced choice task, in which the evaluator, if he/she considered the emission deviant, should determine the predominant vocal quality. This type of evaluation task allows only one possibility of choice for each emission and not necessarily a classification regarding the presence/absence of each deviated parameter in the emission.

In turn, the present study evaluated the degree of roughness and breathiness present in the emission through a VAS. Based on the cut-off values, the presence/absence of such components was established, with the possibility that the same signal would concomitantly show the presence of one or more of them, which is close to the usual conditions of deviant vocal production.

Another finding of this research is the high percentage of voices without breathiness (75.40%; n = 276) classified outside the normal range of the PDD. In a qualitative data analysis, it can be observed that the GD of deviation of these signals is 53.35 ± 16.49. Therefore, although these signals did not show auditory-perceived breathiness, they were probably evaluated as deviated in the VAS due to the presence of roughness in the emission.

When comparing the results regarding the proportion of voices with presence/absence of roughness and presence/absence of breathiness identified inside and outside the PDD normality area, it is observed that there is a greater identification of voices without roughness within the area of normality (64.07%, n = 82) and a greater identification of voices without the breathiness component outside the normality area (75.35%; n = 275).

Qualitatively, a difference of more than 20 points was found regarding the VAS GD between voices without roughness and without breathiness, with higher GD values in the latter group. This difference in itself would justify the results regarding the higher proportion of signals without the breathiness component identified outside the normal range.

These findings reinforce that, even in conditions where the perceptual-auditory evaluation criteria used to classify the signals were not intended to maximize the differences between them, but to evaluate them over a continuum, the PDD was also efficient for vocal evaluation, mainly in relation to the most deviant signals.

It is suggested that other studies be carried out using the same methodology and criteria of perceptual-auditory evaluation used in this study, adding to them the criterion that the signals selected for investigation have only one of the components deviated from the cutoff values of the VAS.

Regarding the degree of breathiness, there was a difference in the distribution of the signals as a function of the PDD area, density, and quadrants. It was observed that the higher the degree of breathiness, the greater the proportion of signals located outside the area of normality, in the lower right quadrant and with concentrated density. Therefore, it is verified that the greater the breathiness component in the vocal signal, the greater the capacity of the PDD to correctly identify the presence of the deviation.

As previously mentioned, such finding regarding the classification of signals with higher degree of deviation constitutes one of the greatest advantages of the PDD, as it fills an existing gap¹⁵15 Brockmann-Bausser M, Drinnan MJ. Routine acoustic voice analysis: time to think again?. Curr Opin Otolaryngol Head Neck Surg. 2011;19:165-70. regarding the use and reliability of traditional measures of disturbance and noise in the evaluation of voices with moderate and intense deviations.

Once again, a similar distribution of the voices with different degrees of roughness and breathiness was observed as a function of the area, quadrant, and density of the PDD. The only difference between the voices with different degrees of roughness and breathiness is the distribution of the signals with Grade 2, in which there was a higher level of correct identification of the group of voices without roughness within the PDD normality area. This fact has already been discussed in this section.

The vertical axis of the PDD evaluates the presence of additive noise in the vocal signal, compatible with the presence of the breathiness component.²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20. Therefore, it was expected that the higher the breathiness component in the emission, the greater the proportion of signals toward the upper left quadrant.

In the study¹⁷17 Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11. with voices of dysphonic adults, it was observed that breathy voices, although they were predominantly distributed in the upper left quadrant (52.6%; n = 30); 19.3% (n = 11) were also situated in the lower right quadrant. With the pediatric population,²⁹29 Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83. breathy voices were distributed in the lower right (35%, n = 7), lower left (30%, n = 6), upper right (30%, n = 6) and upper left (5%, n = 1) quadrants.

In studies²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.^,⁴¹41 Harnisch W, Brosch S, Schmidt M, Hagen R. Breathing and voice quality after surgical treatment for bilateral vocal cord paralysis. Arch Otolaryngol Head Neck Surg. 2008;134:278-84. with patients presenting with unilateral vocal fold paralysis²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20. and individuals with bilateral vocal fold paralysis,²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.^,⁴¹41 Harnisch W, Brosch S, Schmidt M, Hagen R. Breathing and voice quality after surgical treatment for bilateral vocal cord paralysis. Arch Otolaryngol Head Neck Surg. 2008;134:278-84. it was found that only the second group, whose patients showed intense breathiness, had their voices located in the upper right quadrant. In turn, individuals with unilateral paralysis had their voices distributed between the lower left and lower right quadrants.²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.

In general, in high lesions of the vagus nerve, the vocal folds are more distant from the midline and the vocal emission does not originate from the glottic vibration mechanism, but comes primarily from the turbulent transglottic airflow and its propagation in the vocal tract,⁴⁵45 Sprecher A, Olszewski A, Jiang JJ, Zhang Y. Updating signal typing in voice: addition of type 4 signals. J Acoust Soc Am. 2010;127:3710-6.^,⁴⁶46 Little MA, Costello DAE, Harries ML. Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures. J Voice. 2011;25:21-31. which would justify the presence of these signals in the upper right quadrant.²⁶26 Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.

In the present study, only nine signals were classified as having severe breathiness deviation, and of these, only one (11.11%) was in the upper right quadrant. In this way, two points can be highlighted: first, the sample size, since a different result could have been observed in this distribution with a larger sample of breathy voices with intense deviations; second, as already emphasized in the discussion, there is an overlap of the type of vocal deviation in the assessed signals, since the presence of only one type of deviation in each emission was not used as eligibility criterion.

Conclusion

The PDD area and quadrant can discriminate the presence and absence of roughness, as well as the presence and absence of breathiness in synthesized voices. Signals with higher degree of roughness and breathiness are proportionally outside the area of normality, in the lower right quadrant and with concentrated density.

Peer Review under the responsibility of Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial.
☆
Please cite this article as: Lopes LW, Freitas JA, Almeida AA, Silva PO, Alves GÂ. Performance of the phonatory deviation diagram in the evaluation of rough and breathy synthesized voices. Braz J Otorhinolaryngol. 2018;84:460-72.
☆☆
Study carried out at Universidade Federal da Paraíba (UFPB), Departamento de Fonoaudiologia, João Pessoa, PB, Brazil.

References

¹
Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, et al. A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur Arch Otorhinolaryngol. 2001;258:77-82.
²
Ma EP, Yiu EM. Multiparametric evaluation of dysphonic severity. J Voice. 2006;20:380-90.
³
Roy N, Barkmeir-Kraemer J, Eadie T, Sivasankar MP, Metha D, Paul D, et al. Evidence-based clinical voice assessment: a systematic review. Am J Speech Lang Pathol. 2013;22:212-26.
⁴
Kempster GB, Gerratt BR, Verdolini Abbott K, Barkmeier-Kraemer J, Hillman RE. Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. Am J Speech Lang Pathol. 2009;18:124-32.
⁵
Oates J. Auditory-perceptual evaluation of disordered voice quality: pros, cons and future directions. Folia Phoniatr Logop. 2009;61:49-56.
⁶
Parsa V, Jamieson DG. Identification of pathological voices using glottal noise measures. J Speech Lang Hear Res. 2000;43:469-85.
⁷
Bhuta T, Patrick L, Garnett J. Perceptual evaluation of voice quality and its correlation with acoustic measurements. J Voice. 2004;18:299-304.
⁸
Lopes LW, Cavalcante DP, Costa PO. Intensidade do desvio vocal: integração de dados perceptivo-auditivos e acústicos em pacientes disfônicos. CoDAS. 2014;26:382-8.
⁹
Dejonckere P, Schoentgen J, Giordano A, Fraj S, Bocchi L, Manfredi C. Validity of jitter measures in non-quasi-periodic voices. Part I: Perceptual and computer performances in cycle pattern recognition. Logoped Phoniatr Vocol. 2011;36:70-7.
¹⁰
Baravieira PB, Brasolotto AG, Montagnoli AN, Silvério KCA, Yamasaki R, Behlau M. Análise perceptivo-auditiva de vozes rugosas e soprosas: correspondência entre a escala visual analógica e a escala numérica. CoDAS. 2016;28:163-7.
¹¹
Nayak J, Bhat PS, Acharya R, Aithal UV. Classification and analysis of speech abnormalities. ITBM-RBM. 2005;26:319-27.
¹²
Awan S, Lawson L. The effect of anchor modality on the reliability of vocal severity ratings. J Voice. 2009;23:341-52.
¹³
Dejonckere PH, Giordano A, Schoentgen J, Frag S, Bocchi L, Manfredi C. To what degree of voice perturbation are jitter measurements valid? A novel approach with synthesized vowels and visuo-perceptual pattern recognition. Comput Methods Programs Biomed J. 2012;7:37-42.
¹⁴
Lowell SY. The acoustic assessment of voice in continuous speech. SIG 3 Perspect Voice Voice Dis. 2012;22:57-63.
¹⁵
Brockmann-Bausser M, Drinnan MJ. Routine acoustic voice analysis: time to think again?. Curr Opin Otolaryngol Head Neck Surg. 2011;19:165-70.
¹⁶
Brockmann M, Drinnan MJ, Storck C, Carding PN. Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. J Voice. 2011;25:44-53.
¹⁷
Madazio G, Leão S, Behlau M. The phonatory deviation diagram: a novel objective measurement of vocal function. Folia Phoniatr Logop. 2011;63:305-11.
¹⁸
Awan SN, Frenkel ML. Improvements in estimating the harmonic-to-noise ratio of the voice. J Voice. 1994;8:255-62.
¹⁹
Godino LJL, Osma RV, Sáenz LN, Vilda GP, Blanco VM, Cruz RF. The effectiveness of the glottal to noise excitation ratio for the screening of voice disorders. J Voice. 2010;24:47-56.
²⁰
Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio - a new measure for describing pathological voices. Acta Acust. 1997;83:700-6.
²¹
Carrillo L, Ortiz KZ. Análise vocal (auditiva e acústica) nas disartrias. Pró-Fono R Atual Cient. 2007;19:381-6.
²²
Lopes LW, Lima ILB, Almeida LNA, Cavalcante DP, Almeida AAF. Severity of voice disorders in children: correlations between perceptual and acoustic data. J Voice. 2012;26, e7-819.e12.
²³
Lopes LW, Simões LB, Silva JD, Silva ED, Ugulino ACDN, Silva POC, et al. Accuracy of acoustic analysis measurements in the evaluation of patients with different laryngeal diagnoses. J Voice. 2016;31, 382.e15-82.e26.
²⁴
Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice quality description: case studies for different regions of the hoarseness diagram. In: Wittenberg T, Mergell P, Tigges M, Eysholdt U, editors. Advances in quantitative laryngoscopy. 2nd ed. Erlangen: Abteilung Phoniatrie, Universitäts-HNO-Klinik Göttin-gen; 1997. p. 143-50.
²⁵
Michaelis D, Frohlich M, Stube HW. Selection and combination of acoustic features for the description of pathologic voices. J Acoust Soc Am. 1998;103:1628-40.
²⁶
Fröhlich M, Michaelis D, Strube HW, Kruse E. Acoustic voice analysis by means of the hoarseness diagram. J Speech Hear Res. 2000;43:706-20.
²⁷
Behlau M, Oliveira G, Pontes P. Vocal fold self-disruption after phonotrauma on a lead actor: a case presentation. J Voice. 2009;23:726-32.
²⁸
Pifaia LR, Madazio G, Behlau M. Diagrama de desvio fonatório e análise perceptivo-auditiva pré e pós-terapia vocal. CoDAS. 2013;25:140-7.
²⁹
Lopes LW, Lima ILB, Azevedo EHM, Silva MFBL, Silva POC. Análise acústica de vozes infantis: contribuições do Diagrama de Desvio Fonatório. Rev CEFAC. 2015;17:1173-83.
³⁰
Oliveira IBD, Fernandez EDS, Gargantini EP. Disfonias orgânicas por neoplasias: análise de diagramas de desvio fonatório. Rev CEFAC. 2015;17:364-73.
³¹
Kreiman J, Gerrat B, Ito M. When and why listeners disagree in voice quality assessment tasks. J Acoust Soc Am. 2007;122:2354-64.
³²
Englert M, Madazio G, Gielow I, Lucero J, Behlau M. Perceptual error identification of human and synthesized voices. J Voice. 2016;30:e17-23.
³³
Lucero JC, Schoentgen J, Behlau M. Physics-based synthesis of disordered voices. In: Proceedings of Interspeech 14th Annual Conference of the International Speech Communication Association. 2013. p. 25-9.
³⁴
Gonçalves MIR, Pontes PAL, Vieira VP, Pontes AAL, Curcio D, De Biase NG. Transfer function of Brazilian Portuguese oral vowels: a comparative acoustic analysis. Braz J Ororhinolaryngol. 2009;75:680-4.
³⁵
Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159-75.
³⁶
Yamasaki R, Madazio G, Leão SHS, Padovani M, Azevedo R, Behlau M. Auditory-perceptual evaluation of normal and dysphonic voices using the voice deviation scale. J Voice. 2017;31:67-71.
³⁷
Connor NP, Cohen SB, Theis SM, Thibeault SL, Heatley DG, Bless DM. Attitudes of children with dysphonia. J Voice. 2008;22:197-209.
³⁸
McAllister A, Sederholm E, Perceptual Sundberg J. acoustic analysis of vocal registers in 10-year-old children. Logoped Phoniatr Vocol. 2000;25:63-71.
³⁹
Titze IR. Workshop on acoustic voice analysis: summary statement. Nat Cen Voice Speech; 1995.
⁴⁰
Olthoff A, Mrugalla S, Lasjawi R, Frolich M, Stuermer I, Kruse E, et al. Assessment of irregular voices after total and laser surgical partial laryngectomy. Arch Otolaryngol Head Neck Surg. 2003;129:994-9.
⁴¹
Harnisch W, Brosch S, Schmidt M, Hagen R. Breathing and voice quality after surgical treatment for bilateral vocal cord paralysis. Arch Otolaryngol Head Neck Surg. 2008;134:278-84.
⁴²
Schneider B, Denk DM, Bigenzahn W. Acoustic assessment of the voice quality before and after medialization thyroplasty using the titanium vocal fold medialization implant (TVFMI). Arch Otolaryngol Head Neck Surg. 2003;12:815-22.
⁴³
Samlam RA, Story BH, Bunton K. Relation of perceived breathiness to laryngeal kinematics and acoustic measures based on computacional modeling. J Speech Lang Hear Res. 2013;56:1209-23.
⁴⁴
Lucero JC, Koening LL. Simulations of temporal patterns of oral airflow in men and women using two-mass model of the vocal flods under dynamic control. J Acoust Soc Am. 2005;117:1362-72.
⁴⁵
Sprecher A, Olszewski A, Jiang JJ, Zhang Y. Updating signal typing in voice: addition of type 4 signals. J Acoust Soc Am. 2010;127:3710-6.
⁴⁶
Little MA, Costello DAE, Harries ML. Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures. J Voice. 2011;25:21-31.

Publication Dates

Publication in this collection
Jul-Aug 2018

History

Received
14 Feb 2017
Accepted
24 May 2017

This is an Open Access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

[1] Peer Review under the responsibility of Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial.

[2] ☆
Please cite this article as: Lopes LW, Freitas JA, Almeida AA, Silva PO, Alves GÂ. Performance of the phonatory deviation diagram in the evaluation of rough and breathy synthesized voices. Braz J Otorhinolaryngol. 2018;84:460-72.

[3] ☆☆
Study carried out at Universidade Federal da Paraíba (UFPB), Departamento de Fonoaudiologia, João Pessoa, PB, Brazil.

Variable	n	%
Degree of roughness
Normal	128	14.70
Mild to moderate	256	29.40
Moderate	475	54.50
Intense	12	1.40
Total	871	100
Degree of breathiness
Normal	365	41.90
Mild to moderate	187	21.50
Moderate	310	35.60
Intense	9	1.00
Total	871	100

Configuration	Without roughness			With roughness			p-Value
	n	%	VAS-GD	n	%	VAS-GD
Area							<0.001^a a Significant values (p < 0.05) - Chi-square test and Fisher's exact test. VAS, Visual Analogue Scale; GD, general degree.
Inside	82	64.07	20.60 ± 6.66	35	4.71	41.47 ± 23.54
Outside	46	35.93	26.61 ± 16.05	708	95.28	60.05 ± 0.00
Density							0.060
Concentrated	79	61.71	22.57 ± 17.28	514	69.15	60.92 ± 19.38
Amplified	49	38.28	23.08 ± 8.66	229	30.82	55.25 ± 0.00
Quadrant							<0.001^a a Significant values (p < 0.05) - Chi-square test and Fisher's exact test. VAS, Visual Analogue Scale; GD, general degree.
Lower left	90	70.31	21.22 ± 8.66	49	6.59	41.15 ± 23.04
Lower right	38	29.68	26.42 ± 16.05	688	92.59	60.42 ± 0.00
Upper right	0	0		6	0.80	2.83 ± 0.91
Shape							0.488
Circular	3	2.34	21.00 ± 23.16	27	3.63	68.67 ± 22.14
Horizontal	125	97.65	22.81 ± 8.66	711	95.69	58.76 ± 0.00
Vertical	0	0		5	0.67	60.30 ± 24.02

Configuration	Without breathiness			With breathiness			p-Value
	n	%	VAS-GD	n	%	VAS-GD
Area							<0.001^a a Significant values (p < 0.05) - Chi-square test and Fisher's exact test. VAS, Visual Analogue Scale; GD, general degree; PDD, phonatory deviation diagram.
Inside	90	24.65	40.90 ± 16.47	27	5.33	44.07 ± 24.86
Outside	275	75.35	53.35 ± 16.49	479	94.67	58.10 ± 10.32
Density							0.031
Concentrated	236	64.65	8.01 ± 19.09	357	70.55	58.45 ± 10.32
Amplified	129	35.35	9.13 ± 16.88	149	29.45	54.68 ± 17.42
Quadrant							<0.001^a a Significant values (p < 0.05) - Chi-square test and Fisher's exact test. VAS, Visual Analogue Scale; GD, general degree; PDD, phonatory deviation diagram.
Lower left	103	28.21	5.21 ± 15.04	36	7.12	44.08 ± 25.00
Lower right	262	71.79	9.66 ± 16.88	464	91.69	58.07 ± 10.32
Upper right	0	0		6	1.19	80.91 ± 24.83
Upper left
Shape							0.861
Circular	10	2.74	5.85 ± 25.27	20	3.96	65.55 ± 24.32
Horizontal	355	97.26	8.84 ± 16.88	481	95.05	56.91 ± 10.32
Vertical	0	0		5	0.99	64.1 ± 25.87

Configuration	Normal (0)			Mild to moderate (1)			Moderate (2)			Intense (3)			p-Value
	n	%	VAS-GD	n	%	VAS-GD	n	%	VAS-GD	n	%	VAS-GD
Area													<0.001^a
Inside	82	64.06	20.60 ± 8.66	31	12.10	38.82 ± 23.54	4	0.84	62.00 ± 29.97	0	0
Outside	46	35.93	25.61 ± 16.65	225	87.89	43.38 ± 0.00	471	99.15	67.21 ± 0.00	12	100	91.37 ± 22.14
Density
Concentrated	79	61.71	22.57 ± 11.08	153	59.76	43.30 ± 23.54	350	73.68	67.66 ± 0.00	11	91.66	91.50 ± 22.14	<0.001^a
Amplified	49	29.68	23.08 ± 8.66	103	40.23	42.13 ± 0.00	125	26.31	65.78 ± 19.80	1	8.33	90.00 ± 19.90
Quadrant
Lower left	90	70.31	21.27 ± 6.66	45	17.57	39.30 ± 23.54	4	0.84	62.00 ± 24.74	0	0		<0.001^a
Lower right	38	38.28	26.42 ± 16.05	210	82.03	43.53 ± 0.00	466	98.10	67.22 ± 0.00	12	100	91.37 ± 18.04
Upper right	0	0		1	0.39	47.05 ± 20.93	5	1.05	66.10 ± 20.60	0	0
Upper left	0	0		0	0		0	0		0	0
Shape
Circular	3	2.34	21.00 ± 23.16	3	1.17	42.50 ± 18.86	21	4.42	69.40 ± 20.60	3	25.00	90.66 ± 18.04	0.019^a
Horizontal	125	97.65	22.81 ± 8.66	251	98.04	42.80 ± 0.00	451	94.94	66.98 ± 0.00	9	75.00	97.61 ± 19.90
Vertical	0	0		2	0.78	46.25 ± 16.63	3	0.63	79. 66 ± 20.08	0	0

Configuration	Normal (0)			Mild to moderate (1)			Moderate (2)			Intense (3)			p-Value
	n	%	VAS-GD	n	%	VAS-GD	n	%	VAS-GD	n	%	VAS-GD
Area
Inside	89	24.38	4.87 ± 11.55	24	12.83	40.43 ± 22.76	4	1.29	56.37 ± 21.12	0	0		<0.001^a
Outside	276	75.61	9.55 ± 12.08	163	87.16	42.90 ± 23.22	306	98.70	65.24 ± 7.55	9	100	90.61 ± 22.39
Density
Concentrated	235	64.38	8.02 ± 13.50	124	66.31	42.50 ± 23.22	225	72.58	65.73 ± 7.55	9	100	90.61 ± 22.39	0.008^a
Amplified	130	35.61	9.13 ± 12.08	63	33.68	42.75 ± 21.34	85	27.41	63.53 ± 12.96	0	0
Quadrant
Lower left	102	27.94	5.20 ± 11.55	32	17.11	40.73 ± 22.76	5	1.61	57.90 ± 21.12	0	0		<0.001^a
Lower right	263	72.05	9.66 ± 12.08	155	82.88	42.96 ± 23.22	300	96.77	65.01 ± 7.55	8	88.88	90.68 ± 22.39
Upper right	0	0		0	0		5	1.61	79.10 ± 22.20	1	11.11	90.00 ± 21.75
Upper left	0	0		0	0		0	0		0	0
Shape
Circular	10	2.73	5.85 ± 22.54	3	1.60	44.16 ± 24.74	17	5.84	69.32 ± 22.20	0	0		0.563
Horizontal	355	97.25	8.48 ± 12.08	184	98.39	42.55 ± 23.22	288	92.90	64.90 ± 7.55	9	100	90.61 ± 22.39
Vertical	0	0		0	0		5	1.61	64.10 ± 23.61	0	0

Brasil

Brasil

Abstract

Introduction

Objective

Methods

Results

Conclusion

Resumo

Introdução

Objetivo

Método

Resultados

Conclusão

Introduction

Methods

Study design

Sample

Procedures

Data analysis

Results

Discussion

PDD performance in the evaluation of the presence and degree of roughness

PDD performance in the evaluation of the presence and degree of breathiness

Conclusion

References

Publication Dates

History