Acessibilidade / Reportar erro

Comparison between the acoustic fundamental frequency of the voice and the vibration frequency of the vocal folds analyzed by digital kymography

ABSTRACT

Purpose

To compare the frequency of vocal fold opening variation, analyzed by digital kymography, with the fundamental voice frequency obtained by acoustic analysis, in individuals without laryngeal alteration.

Methods

Observational analytical cross-sectional study. The participants were forty-eight women and 38 men from 18 to 55 years of age. The evaluation was made by voice acoustic analysis, by the habitual emission of the vowel /a/ for 3 seconds, and days of the week, and digital kymography (DKG), by the habitual emission of the vowels /i/ and /ɛ/. The measurements analyzed were acoustic fundamental frequency (f0), extracted by the Computerized Speech Lab (CSL) program, and dominant frequency of the variation of right (R-freq) and left (L-freq) vocal fold opening, obtained through the KIPS image processing program. The mounting of the kymograms consisted in the manual demarcation of the region by vertical lines delimiting width and horizontal lines separating the posterior, middle and anterior thirds of the Rima glottidis. In the statistical analysis, the Anderson-Darling test was used to verify the normality of the sample. The ANOVA and Tukey tests were performed for the comparison of measurements between the groups. For the comparison of age between the groups, the Mann-Whitney test was used.

Results

There are no differences between the values of the frequency measurement analyzed by digital kymography, with the acoustic fundamental frequency, in individuals without laryngeal alteration.

Conclusion

The values of the dominant frequency of the vocal folds opening variation, as assessed by digital kymography, and the acoustic fundamental frequency of the voice are similar, allowing comparison between these measurements in the multidimensional evaluation of the voice, in individuals without laryngeal alteration.

Keywords:
Voice; Speech Acoustics; Kymography; Vocal Fold; Tonal Height Discrimination

RESUMO

Objetivo

Comparar a frequência da variação da abertura das pregas vocais, analisada pela videoquimografia digital, com a frequência fundamental da voz, obtida através da análise acústica, em indivíduos sem alteração laríngea.

Método

Trata-se de um estudo observacional analítico transversal. Participaram 48 mulheres e 38 homens, de 18 a 55 anos. A avaliação foi composta por análise acústica da voz, obtida pela emissão habitual da vogal /a/ durante 3 segundos, e os dias da semana, e pela videoquimografia digital (DKG), obtida pela emissão habitual das vogais /i/ e /ɛ/. As medidas analisadas foram a frequência fundamental acústica (f0), extraída pelo programa Computerized Speech Lab (CSL), e a frequência dominante da variação de abertura da prega vocal direita (D-freq) e esquerda (E-freq), obtidas através do programa de processamento de imagens KIPS. A montagem dos quimogramas constou na demarcação manual da região, compostas por linhas verticais que delimitaram largura da prega vocal e linhas horizontais que marcaram os terços posterior, médio e anterior da rima glótica. Na análise estatística, o teste Anderson-Darling foi utilizado para verificar a normalidade da amostra. Os testes ANOVA e Tukey foram realizados para a comparação das medidas entre os grupos. Para a comparação da idade entre os grupos, foi utilizado o teste Mann-Whitney.

Resultados

Não existem diferenças entre os valores da medida de frequência analisada pela videoquimografia digital, com a frequência fundamental acústica, em indivíduos sem alteração laríngea.

Conclusão

Os valores da frequência dominante da variação de abertura das pregas vocais, avaliada pela videoquimografia digital, e a frequência fundamental acústica da voz são similares, permitindo uma comparação entre estas medidas na avaliação multidimensional da voz, em indivíduos sem alteração laríngea.

Descritores:
Voz; Acústica da Fala; Quimografia; Prega Vocal; Discriminação da Altura Tonal

INTRODUCTION

Starting in the 15th century, with the appearance of the Renaissance movement, science has sought means of understanding and explaining the mechanisms of voice production(11 Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.). Numerous theories have been published that attempt such a feat, among them the muco-undulatory theory, the chaos theory and the aerodynamic myo-elastic theory(11 Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.). Each one of them proposes a different model of how voice production works, considering the various aspects involved in phonation, such as the anatomical structures of the vocal tract and larynx, and their different possibilities of biomechanical movement; the physical events involved in the aerodynamics of breathing; and the acoustic result of the voice(11 Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.,22 Zhang Z. Mechanics of human voice production and control. J Acoust Soc Am. 2016;140(4):2614-35. http://dx.doi.org/10.1121/1.4964509. PMid:27794319.
http://dx.doi.org/10.1121/1.4964509...
). Since then, technology has advanced in the instrumentalization of researchers and professionals in the area, in order to substantiate theoretical data based on scientific knowledge.

Currently, there are a variety of tests and assessment techniques that contribute to the understanding of the biomechanics and physiology of the sounds produced by the vibration of the vocal folds, contributing to the accuracy of diagnoses and effectiveness of the rehabilitation process of dysphonia(22 Zhang Z. Mechanics of human voice production and control. J Acoust Soc Am. 2016;140(4):2614-35. http://dx.doi.org/10.1121/1.4964509. PMid:27794319.
http://dx.doi.org/10.1121/1.4964509...
). These exams provide auditory, visual and acoustic data, and can be divided into three pillars: visual image of the larynx, acoustic voice analysis, and auditory-perceptual analysis of voice quality(33 Nemr K, Amar A, Abrahão M, Leite GCA, Köhle J, Santos AO, et al. Análise comparativa entre avaliação fonoaudiológica perceptivo-auditiva, análise acústica e laringoscopias indiretas para avaliação vocal em população com queixa vocal. Rev Bras Otorrinolaringol. 2005;71(1):13-7. http://dx.doi.org/10.1590/S0034-72992005000100003.
http://dx.doi.org/10.1590/S0034-72992005...
).

.Some imaging exams frequently described in the literature are: i) direct laryngoscopy, one of the pioneering techniques for visualizing the vocal tract, described for the first time in 1895, after obtaining an appropriate light source for its performance, and which until today, after numerous technological advances, is a widespread exam in all over the world(44 Alberti PW. The history of laryngology: a centennial celebration. Otolaryngol Head Neck Surg. 1996;114(3):345-54. http://dx.doi.org/10.1016/S0194-59989670202-4. PMid:8649866.
http://dx.doi.org/10.1016/S0194-59989670...
); ii) videolaryngostroboscopy, which has been widely used since electronic equipment for performing the technique emerged in 1960(44 Alberti PW. The history of laryngology: a centennial celebration. Otolaryngol Head Neck Surg. 1996;114(3):345-54. http://dx.doi.org/10.1016/S0194-59989670202-4. PMid:8649866.
http://dx.doi.org/10.1016/S0194-59989670...
). This exam creates the illusion of slow-motion vibration of the muco-undulatory movement of the vocal folds (VFs) by capturing about 30 images per second while a pulsed light is emitted(55 Krausert CR, Olszewski AE, Taylor LN, McMurray JS, Dailey SH, Jiang JJ. Mucosal wave measurement and visualization techniques. J Voice. 2011;25(4):395-405. http://dx.doi.org/10.1016/j.jvoice.2010.02.001. PMid:20471798.
http://dx.doi.org/10.1016/j.jvoice.2010....
). The possibility of analyzing several laryngeal parameters related to the vibration pattern of the structures made this technique popular in research and, consequently, in vocal clinic(44 Alberti PW. The history of laryngology: a centennial celebration. Otolaryngol Head Neck Surg. 1996;114(3):345-54. http://dx.doi.org/10.1016/S0194-59989670202-4. PMid:8649866.
http://dx.doi.org/10.1016/S0194-59989670...
,55 Krausert CR, Olszewski AE, Taylor LN, McMurray JS, Dailey SH, Jiang JJ. Mucosal wave measurement and visualization techniques. J Voice. 2011;25(4):395-405. http://dx.doi.org/10.1016/j.jvoice.2010.02.001. PMid:20471798.
http://dx.doi.org/10.1016/j.jvoice.2010....
); ii) electroglottography, a non-invasive exam that detects the electrical activity of vibrations in the glottic area and is able to describe parameters such as the duration and relative pattern of contact of the VF cycle by cycle (4); iii) digital kymography, a more recent high-tech exam, which combined with high-speed recording, generates kymograms that allow quantitative analysis of parameters such as amplitude and frequency of each vocal fold and cycle by cycle, measuring asymmetries between them(55 Krausert CR, Olszewski AE, Taylor LN, McMurray JS, Dailey SH, Jiang JJ. Mucosal wave measurement and visualization techniques. J Voice. 2011;25(4):395-405. http://dx.doi.org/10.1016/j.jvoice.2010.02.001. PMid:20471798.
http://dx.doi.org/10.1016/j.jvoice.2010....
,66 Svec JG, Sram F, Schutte HK. Videokymography: a new high-speed method for the examination of vocal-fold vibrations. Otorinolaryngol Foniatr. 1999;48:155-62.).

On the other hand, the voice acoustic analysis is responsible for measuring the characteristics of the sound, and it happens by means of specific programs, developed to determine measures such as: i) fundamental frequency (f0), defined by the quantity of cycles of vibration of the VFs per second; ii) jitter and shimmer, two perturbation measures that indicate the variability of f0 (jitter) and sound wave amplitude (shimmer), both in the short term; iii) harmonic-noise proportion, which quantifies the noise generated by the air turbulence through the glottic structures, among other measures(77 Araújo SA, Grellet M, Pereira JC, Rosa MO. Normatização de medidas acústicas da voz normal. Rev Bras Otorrinolaringol. 2002;68(4):540-4. http://dx.doi.org/10.1590/S0034-72992002000400014.
http://dx.doi.org/10.1590/S0034-72992002...
,88 Read C, Buder EH, Kent RD. Speech analysis systems: an evaluation. J Speech Hear Res. 1992;35(2):314-32. http://dx.doi.org/10.1044/jshr.3502.314. PMid:1573872.
http://dx.doi.org/10.1044/jshr.3502.314...
).

The auditory-perceptual analysis is a subjective technique of vocal evaluation, in which the evaluator auditorily detects the characteristics of the individual's vocal pattern and can classify it in different ways(33 Nemr K, Amar A, Abrahão M, Leite GCA, Köhle J, Santos AO, et al. Análise comparativa entre avaliação fonoaudiológica perceptivo-auditiva, análise acústica e laringoscopias indiretas para avaliação vocal em população com queixa vocal. Rev Bras Otorrinolaringol. 2005;71(1):13-7. http://dx.doi.org/10.1590/S0034-72992005000100003.
http://dx.doi.org/10.1590/S0034-72992005...
). This technique is considered the gold standard in vocal assessment and relating it to the acoustic analysis data and to the vibration pattern of VFs found in some imaging exams is what guarantees a multidimensional voice assessment(33 Nemr K, Amar A, Abrahão M, Leite GCA, Köhle J, Santos AO, et al. Análise comparativa entre avaliação fonoaudiológica perceptivo-auditiva, análise acústica e laringoscopias indiretas para avaliação vocal em população com queixa vocal. Rev Bras Otorrinolaringol. 2005;71(1):13-7. http://dx.doi.org/10.1590/S0034-72992005000100003.
http://dx.doi.org/10.1590/S0034-72992005...
,99 Pimenta RA. Uso da avaliação multidimensional da voz na caracterização vocal de pacientes com paralisia unilateral de pregas vocais [dissertação]. São Carlos: Universidade de São Paulo; 2016.).

Owing to the variety of tests and analysis techniques available for studies in the area of voice, and due to the fact that each one has its own advantages and limitations, it has become common in literature and in vocal clinic the integration and correlation of data from auditory, visual and acoustic analysis of voice production(55 Krausert CR, Olszewski AE, Taylor LN, McMurray JS, Dailey SH, Jiang JJ. Mucosal wave measurement and visualization techniques. J Voice. 2011;25(4):395-405. http://dx.doi.org/10.1016/j.jvoice.2010.02.001. PMid:20471798.
http://dx.doi.org/10.1016/j.jvoice.2010....
). Thus, the multidimensional evaluation of the voice becomes feasible, which is extremely important, because by relating different evaluative data, it enables a broader understanding of the vocal pattern of each individual(11 Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.,99 Pimenta RA. Uso da avaliação multidimensional da voz na caracterização vocal de pacientes com paralisia unilateral de pregas vocais [dissertação]. São Carlos: Universidade de São Paulo; 2016.).

One study that used high-speed kymography and acoustic voice analysis to measure the immediate effect of sonorated tongue vibration exercises and basal sound, identified that tongue vibration exercise in women resulted in a significant decrease in jitter and closed phase time of kymography, while the video kymographic parameters of time of the opening and open phases of VFs increased. This result indicates less effort and higher quality in vocal production(1010 Pimenta RA, Dájer ME, Hachiya A, Tsuji DH, Montagnoli AN. Parameters acoustic and high-speed kymography identified effects of voiced vibration and vocal fry exercises. CoDAS. 2013;25(6):577-83. http://dx.doi.org/10.1590/S2317-17822014000100010. PMid:24626983.
http://dx.doi.org/10.1590/S2317-17822014...
).

In another research, when observing the correlation between the parameters of videokymography and acoustic analysis of voice, the authors described a typical phenomenon in Mongolian singers, the “Kargyraa”, in which the vestibular folds vibrated together with the VFs during emission, with complete closure, reduced frequency and different phase of the VFs, and were responsible for modulating the sound of the singing voice. The authors observed a correlation between the videokymography parameters and the presence of subharmonics in acoustic voice analysis(1111 Lindestad PA, Sodersten M, Merker B, Granqvist S. Voice source characteristics in Mongolian “throat singing” studied with high-speed imaging technique, acoustic spectra, and inverse filtering. J Voice. 2001;15(1):78-85. http://dx.doi.org/10.1016/S0892-1997(01)00008-X. PMid:12269637.
http://dx.doi.org/10.1016/S0892-1997(01)...
).

Another research correlated the results of voice acoustic measurements, perceptual-auditory analysis, and parameters of videokymography to explain the impact of vibration of the vestibular folds on vocal quality in usual and whispered emissions, concluded that the vestibular folds can vibrate without causing damage to vocal quality and this depends on factors such as voice frequency and regularity of vibration of this structure(1212 Lindestad PA, Blixt V, Pahlberg-Olsson J, Hammarberg B. Ventricular fold vibration in voice production: a highspeed imaging study with kymographic, acoustic and perceptual analyses of a voice patient and a vocally healthy subject. Logoped Phoniatr Vocol. 2004;29(4):162-70. http://dx.doi.org/10.1080/14015430410020339. PMid:15764210.
http://dx.doi.org/10.1080/14015430410020...
).

A study compared the videokymographic open quotient and vocal intensity and concluded that the increase in the intensity is correlated with the reduction in the open quotient, in addition to the involuntary increase in f0 (1313 Koishi HU, Tsuji DH, Imamura R, Sennes LU. Vocal intensity variation: a study of vocal folds vibration in humans with videokymography. Rev Bras Otorrinolaringol. 2003;69(4):464-70. http://dx.doi.org/10.1590/S0034-72992003000400005.
http://dx.doi.org/10.1590/S0034-72992003...
). Another study, when determining the relationship between different open quotients and intensity, frequency and phonation modes, observed strong correlation between videokymography open quotients and the parameters analyzed, except for f0 (1414 Yokonishi H, Imagawa H, Sakakibara KI, Yamauchi A, Nito T, Yamasoba T, et al. Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity. J Voice. 2016;30(2):145-57. http://dx.doi.org/10.1016/j.jvoice.2015.01.009. PMid:25953586.
http://dx.doi.org/10.1016/j.jvoice.2015....
). In another study, upon realizing that the size of the glottis area is closely related to the variations of f0, the authors arrived at results that suggested that the combined analysis of digital videokymography (DKG) parameters and acoustic data of the voice is promising and can help in the identification of different types of vocal qualities(1515 Larsson H, Hertegard S, Lindestad PA, Hammarberg B. Vocal fold vibrations: high-speed imaging, kymography, and acoustic analysis: a preliminary report. Laryngoscope. 2000;110(12):2117-22. http://dx.doi.org/10.1097/00005537-200012000-00028. PMid:11129033.
http://dx.doi.org/10.1097/00005537-20001...
).

All the studies mentioned above(1010 Pimenta RA, Dájer ME, Hachiya A, Tsuji DH, Montagnoli AN. Parameters acoustic and high-speed kymography identified effects of voiced vibration and vocal fry exercises. CoDAS. 2013;25(6):577-83. http://dx.doi.org/10.1590/S2317-17822014000100010. PMid:24626983.
http://dx.doi.org/10.1590/S2317-17822014...
,1111 Lindestad PA, Sodersten M, Merker B, Granqvist S. Voice source characteristics in Mongolian “throat singing” studied with high-speed imaging technique, acoustic spectra, and inverse filtering. J Voice. 2001;15(1):78-85. http://dx.doi.org/10.1016/S0892-1997(01)00008-X. PMid:12269637.
http://dx.doi.org/10.1016/S0892-1997(01)...
,1212 Lindestad PA, Blixt V, Pahlberg-Olsson J, Hammarberg B. Ventricular fold vibration in voice production: a highspeed imaging study with kymographic, acoustic and perceptual analyses of a voice patient and a vocally healthy subject. Logoped Phoniatr Vocol. 2004;29(4):162-70. http://dx.doi.org/10.1080/14015430410020339. PMid:15764210.
http://dx.doi.org/10.1080/14015430410020...
,1313 Koishi HU, Tsuji DH, Imamura R, Sennes LU. Vocal intensity variation: a study of vocal folds vibration in humans with videokymography. Rev Bras Otorrinolaringol. 2003;69(4):464-70. http://dx.doi.org/10.1590/S0034-72992003000400005.
http://dx.doi.org/10.1590/S0034-72992003...
,1414 Yokonishi H, Imagawa H, Sakakibara KI, Yamauchi A, Nito T, Yamasoba T, et al. Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity. J Voice. 2016;30(2):145-57. http://dx.doi.org/10.1016/j.jvoice.2015.01.009. PMid:25953586.
http://dx.doi.org/10.1016/j.jvoice.2015....
,1515 Larsson H, Hertegard S, Lindestad PA, Hammarberg B. Vocal fold vibrations: high-speed imaging, kymography, and acoustic analysis: a preliminary report. Laryngoscope. 2000;110(12):2117-22. http://dx.doi.org/10.1097/00005537-200012000-00028. PMid:11129033.
http://dx.doi.org/10.1097/00005537-20001...
) show the importance of correlating data from different vocal and laryngeal evaluations for a better understanding of the aspects that involve vocal production.

It was not possible to identify any study that compared the measures related to the vibration frequency of VFs from digital kymography and acoustic analysis. Knowing that this data can contribute to the multidimensional evaluation of the voice, and consequently help in the process of vocal production analysis, this study aims to compare the frequency of the variation of vocal fold opening, analyzed by digital kymography, with the fundamental frequency of the voice, obtained through acoustic analysis, in individuals without laryngeal alteration.

The results of this research can contribute to a better understanding between the correlation of acoustic f0 and the number of glottic cycles obtained by the DKG, enabling the correlation between acoustic voice data and functional data of the VFs vibration velocity.

METHODS

The present cross-sectional analytical observational study was approved by the Research Ethics Committee of Universidade Federal de Minas Gerais (UFMG) under process number 1,126,016. The convenience sample was made up of men and women, normal upon laryngeal examination, assessed using high-speed videolaryngoscopy (HSV), and aged between 18 and 55 years. All participants signed the Informed Consent Form.

Normal laryngeal examination was those who showed no lesions in the VFs with symmetry and periodicity of mucosal wave, and complete glottic closure. The presence of posterior triangular glottic chink in women was considered physiological(1616 Cielo CA, Schwarz K, Finger LS, Lima JM, Christmann MK. Glottal closure in women with no voice complaints or laryngeal disorders. Int Arch Otorhinolaryngol. 2019;23(4):e384-8. http://dx.doi.org/10.1055/s-0038-1676108. PMid:31649756.
http://dx.doi.org/10.1055/s-0038-1676108...
).

Exclusion criteria were laryngeal signs of gastroesophageal reflux, pregnancy, menstrual or premenstrual period, smoking, cervical surgeries, hormonal diseases, laryngeal diseases, self-reported upper airway infections, and the presence of an exacerbated nauseous reflex that were impediments to perform the exam.

The evaluation process consisted of two exams, namely the HSV, later analyzed through the DKG and the acoustic analysis of the voice. All assessments were conducted at the Functional Health Observatory in Speech-Language Pathology of the School of Medicine at the Universidade Federal de Minas Gerais (OSF/UFMG).

The laryngeal exam consisted of a HSV evaluation performed by two otorhinolaryngologists. Each examination consisted of 2000 images per second, taken with a rigid 70° laryngoscope with 300W xenon light (KayPentax®, Lincoln Park, New Jersey) with a model 9710 high-speed color video-laryngoscopy system. The image resolution used was 512 x 512 pixels with 8-bit RGB color mode. The records obtained through the habitual emission of the vowels /i/ and /ɛ/, were evaluated by selecting the most appropriate sequence of images. Larynges without benign changes and with complete glottal closure were considered normal. The exams were analyzed by the two otorhinolaryngologists, by consensus. Both had more than five years of experience in laryngology.

Ninety-eight laryngeal images were analyzed, by one of the researchers, and 12 were excluded due to low sharpness or corrupted file that made DKG analysis impossible, resulting in a final number of 86 subjects with DKG data and acoustic measurements.

The data evaluated referred to 86 participants, one group consisting of 48 women, and one group consisting of 38 men, all without laryngeal changes or with complete glottal closure. The group of women had a mean age of 26.8 years (18 to 55 years; SD=6.83) and the group of men a mean age of 26.4 years (18 to 44 years; SD=5.82), with no age difference between the groups (p=0.906).

For the acoustic analysis we used the Kay Pentax® Computerized Speech Lab (CSL), model 6103, Multi-Dimensional Voice Program (MDVP)15 module, installed in a Dell® Optiplex GX260 computer, with a DirectSound® professional sound card and a Shure® unidirectional condenser microphone. The participants were taken to an acoustically treated room and asked to stand with their feet slightly apart and their mouths 10 cm away from the microphone, which was placed on a pedestal. The recording was captured with the emission of the vowel /a/ in the usual manner, for 3 seconds, followed by the days of the week.

The measure analyzed was the fundamental frequency (f0), extracted automatically in the aforementioned program, which quantifies the average number of glottal cycles that happen per second throughout the emission time, in Hertz. The divergence of the collection material requested in the two tests is due to the fact that the vowels /i/ and /ɛ/ provide a higher position of the larynx during emission, making it easier to capture images by HSV(1717 Mehta DD, Hillman RE. Current role of stroboscopy in laryngeal imaging. Curr Opin Otolaryngol Head Neck Surg. 2012;20(6):429-36. http://dx.doi.org/10.1097/MOO.0b013e3283585f04. PMid:22931908.
http://dx.doi.org/10.1097/MOO.0b013e3283...
,1818 Woo P. Stroboscopy and high-speed imaging of the vocal function. 2. ed. San Diego: Plural Publishing; 2022.).

Using the HSV laryngeal images we applied the image processing program called KIPS® (Kay's Image Processing Software), version 1.11, provided by KayPENTAX® for generation and analysis of digital kymography (DKG) parameters. The digital kymogram assembly process started with the manual demarcation of the region of the VFs to be analyzed in the images, consisting of two vertical lines, which delimit the width of the kymographic area, and 3 horizontal lines, representing the posterior thirds (line 1), demarcated below the vocal process of the arytenoids, middle (line 2) and anterior (line 3) in the Rima glottidis, and subsequently provided the frequency variation data (Figure 1).

Figure 1
Manual demarcation of vertical and horizontal lines

Subsequently, the beginning and end of the records were discarded and the program automatically performed the two-dimensional montage of the mucosal undulatory motion of the VFs (Figure 2).

Figure 2
2-dimensional assembly of the kymogram

Once the DKG graph was generated, the edge detection rectangle fitting tool and the edge detection tool were used to convert the images to grayscale, and the selection of the Rima glottidis for each line was finalized. Using this as a starting point, the program allows the analysis of the frequency variation of the lines, by means of the Fourier Transform (FFT), and can thus quantify the intensity changes of each pixel, image by image, and translate the information into graphs with frequency variation by time, so that the lines are overlapped (Figure 3) or separated (Figure 4). This tool provides in an objective way the cycle-by-cycle analysis of the vibration of the VFs.

Figure 3
FFT plot with line overlap
Figure 4
FFT plot with separate lines

KIPS® analyzed and quantified the data from the DKG graph and used in this research the parameters dominant frequency of the right vocal fold opening variation (R-freq) and dominant frequency of the left vocal fold opening variation (L-freq), both measured in pixels.

The demarcation of all laryngeal images was performed by one of the researchers, who showed an intra-rater agreement of 76% for the analysis of R-freq and 95% of L-freq. For analysis of agreement, 10 laryngeal images were duplicated.

The MINITAB program, version 17, was used for statistical analysis of the data. Initially, a descriptive analysis of the sample and the variables, with measures of central tendency and dispersion was performed. Afterwards, the Anderson-Darling test was applied to verify the normality of the variables. The ANOVA and Tukey tests were performed for multiple comparisons of the measurements between the groups. The Mann-Whitney non-parametric test was used to compare between age groups. To assess intra-rater agreement, the Intraclass Correlation Coefficient (ICC) was used in the PAST program (version 4.08). The evaluator showed excellent agreement(1919 Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159-74. http://dx.doi.org/10.2307/2529310. PMid:843571.
http://dx.doi.org/10.2307/2529310...
) for both parameters of the DKG.

RESULTS

The results are evidenced in two tables, which show the comparison between the acoustic fundamental frequency and DKG frequency values of the right and left VFs, in rows 1, 2 and 3, in women (Table 1) and men (Table 2).

Table 1
Comparison of acoustic fundamental frequency and DKG frequency values of the right and left vocal folds in women without alteration
Table 2
Comparison of acoustic fundamental frequency and DKG frequency values of the right and left vocal folds in men without alteration

There are no differences between the frequency values of the vocal fold opening variation, analyzed by DKG, with the f0 of the voice, obtained through acoustic analysis, in individuals without laryngeal alteration.

DISCUSSION

The results of this research allow us to conclude that the acoustic f0 values ​​are correlated with the number of glottic cycles of the VFs, suggesting a correlation between the acoustic data of the number of sound waves, with the functional parameters of the VFs vibration velocity.

The f0 is defined as the number of oscillations of a wave in the interval of 1 second, given preferably in Hertz(77 Araújo SA, Grellet M, Pereira JC, Rosa MO. Normatização de medidas acústicas da voz normal. Rev Bras Otorrinolaringol. 2002;68(4):540-4. http://dx.doi.org/10.1590/S0034-72992002000400014.
http://dx.doi.org/10.1590/S0034-72992002...
). When applied to voice studies, this measurement corresponds to the number of cycles of the waves produced by the vibration of the VFs, and is studied and analyzed from 3 parameters: functional, auditory and acoustic(33 Nemr K, Amar A, Abrahão M, Leite GCA, Köhle J, Santos AO, et al. Análise comparativa entre avaliação fonoaudiológica perceptivo-auditiva, análise acústica e laringoscopias indiretas para avaliação vocal em população com queixa vocal. Rev Bras Otorrinolaringol. 2005;71(1):13-7. http://dx.doi.org/10.1590/S0034-72992005000100003.
http://dx.doi.org/10.1590/S0034-72992005...
).

Regarding the functional field, the measure corresponds to the speed of vibration of the VFs, which combine the opening and closing movements in an almost cyclical manner (in normal voices) and result in voice production(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.). In auditory terms, frequency analysis corresponds to a subjective clinical practice, through the evaluator's impression of the patient's voice(2121 Nemr K, Simões-Zenari M, Cordeiro GF, Tsuji D, Ogawa AI, Ubrig MT, et al. GRBAS and Cape-V scales: high reliability and consensus when applied at different times. J Voice. 2012;26(6):812.E17-22. http://dx.doi.org/10.1016/j.jvoice.2012.03.005. PMid:23026732.
http://dx.doi.org/10.1016/j.jvoice.2012....
). The auditory perception of frequency, with all its changes made by the vocal tract, is called pitch, and depends for its evaluation on psychosocial factors such as age and gender(2121 Nemr K, Simões-Zenari M, Cordeiro GF, Tsuji D, Ogawa AI, Ubrig MT, et al. GRBAS and Cape-V scales: high reliability and consensus when applied at different times. J Voice. 2012;26(6):812.E17-22. http://dx.doi.org/10.1016/j.jvoice.2012.03.005. PMid:23026732.
http://dx.doi.org/10.1016/j.jvoice.2012....
,2222 Babacan O, Drugman T, d’Alessandro N, Henrich N, Dutoit T. A comparative study of pitch extraction algorithms on a large variety of singing sounds. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing; 2013 Mai 26-31; Vancouver. Anais. Nova York: IEEE. p. 7815. http://dx.doi.org/10.1109/ICASSP.2013.6639185.
http://dx.doi.org/10.1109/ICASSP.2013.66...
). In the acoustic sphere, f0 is the lowest frequency detected in the voice signal, analyzed by means of specialized extraction algorithms(2323 Lacerda EB. Detecção de frequência fundamental baseada em mecanismos laríngeos [dissertação]. Recife: Centro de Informática/Universidade Federal de Pernambuco; 2018.).

There are multiple ways of extracting or estimating the fundamental frequency, based on various mathematical models(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.,2323 Lacerda EB. Detecção de frequência fundamental baseada em mecanismos laríngeos [dissertação]. Recife: Centro de Informática/Universidade Federal de Pernambuco; 2018.). Each method is created based on a category of input domain, and the main ones are time domain and frequency domain(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.). In the time domain, there are event rate detection, autocorrelation, and phase space methods. Regarding the frequency domain there are the frequency component rate, filter-based, cepstral analysis, and multi-resolution methods(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.).

Peak rate is a time domain method that counts the number of wave peaks per second. In addition, the distance between the peaks reveals the wavelength, a measure inversely proportional to frequency(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.). Another widely used method is YIN, which consists of a combination of autocorrelation and cancellation techniques in the algorithm. Using the developed formulas, this method decreases the chance of subharmonic peak counting error(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.). An example of a frequency domain method is cepstral analysis, which takes into account regularly spaced partial frequencies and gives more linearity to the analysis by means of a logarithmic version of the Fourier Transform, which transforms the spectrum into a cepstrum(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.). Most algorithms have their own characteristics that can result in advantages or disadvantages in the estimation of the fundamental frequency of the voice, depending mainly on the type of sample(2020 Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.,2222 Babacan O, Drugman T, d’Alessandro N, Henrich N, Dutoit T. A comparative study of pitch extraction algorithms on a large variety of singing sounds. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing; 2013 Mai 26-31; Vancouver. Anais. Nova York: IEEE. p. 7815. http://dx.doi.org/10.1109/ICASSP.2013.6639185.
http://dx.doi.org/10.1109/ICASSP.2013.66...
,2323 Lacerda EB. Detecção de frequência fundamental baseada em mecanismos laríngeos [dissertação]. Recife: Centro de Informática/Universidade Federal de Pernambuco; 2018.).

The correlation among functional, acoustic, and auditory aspects are of key importance for the multidimensional evaluation of the voice, which consists of the integration and interpretation of data obtained through phonoaudiological and otorhinolaryngological evaluations and the patient's self-perception of the vocal phenomenon(11 Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.,99 Pimenta RA. Uso da avaliação multidimensional da voz na caracterização vocal de pacientes com paralisia unilateral de pregas vocais [dissertação]. São Carlos: Universidade de São Paulo; 2016.). Thus, by associating various methods of evaluation, this approach is able to detect losses not only physiological, as well as social and environmental(11 Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.,99 Pimenta RA. Uso da avaliação multidimensional da voz na caracterização vocal de pacientes com paralisia unilateral de pregas vocais [dissertação]. São Carlos: Universidade de São Paulo; 2016.).

In this way, by stating that the measures of acoustic fundamental frequency and the dominant frequency of the opening variation of the VFs of DKG are equivalent, it contributes to the understanding of the correlation of acoustic and functional data, and consequently, for a more efficient and integral vocal evaluation. Such correlation is essential for a better functional understanding of the biomechanical process underlying the evaluated voice quality, which can bring important functional understandings in the case of dysphonia, helping in the decision-making of vocal techniques necessary for the desired functional rebalance in the speech-language pathology treatment process.

The findings of this research show mean acoustic f0 values for women of 214.81 Hz, and for men 129.82 Hz (Tables 1 and 2). These results are similar to what has been found in the literature, with mean values ranging from 194.09 Hz to 219.6 Hz for women and 118 Hz to 142 Hz for men(66 Svec JG, Sram F, Schutte HK. Videokymography: a new high-speed method for the examination of vocal-fold vibrations. Otorinolaryngol Foniatr. 1999;48:155-62.,2424 Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):e20170225. PMid:30365649.,2525 Arantes P, Linhares ME. Efeito da língua, estilo de elocução e sexo do falante sobre medidas globais da frequência fundamental. Let Hoje. 2017;52(1):26-39. http://dx.doi.org/10.15448/1984-7726.2017.1.25419.
http://dx.doi.org/10.15448/1984-7726.201...
).

Regarding the digital kymographic frequency measurements, the mean values in the women's group ranged from 211.46 Hz to 216.96 Hz for left and right VFs (Table 1). The values were slightly lower than those found in the literature(2626 Baravieira PB. Análise do padrão vibratório das pregas vocais em sujeitos com e sem nódulo vocal por meio da videolaringoscopia de alta velocidade [dissertação]. São Carlos: Universidade de São Paulo; 2012. http://dx.doi.org/10.11606/D.82.2012.tde-17072012-142118.
http://dx.doi.org/10.11606/D.82.2012.tde...
,2727 Nascimento UN, Santos MAR, Gama ACC. Digital videokymography: analysis of glottal closure in adults. J Voice. 2021. In press. PMid:34417083.) with women without laryngeal alteration, for lines 1 and 2, and similar in line 3.

In the men’s group, the mean values were 136.92 Hz to 138.24 Hz (Table 2) for the left and right VFs, respectively. These values are in agreement with the literature(2727 Nascimento UN, Santos MAR, Gama ACC. Digital videokymography: analysis of glottal closure in adults. J Voice. 2021. In press. PMid:34417083.,2828 Yamauchi A, Yokonishi H, Imagawa H, Sakakibara K, Nito T, Tayama N, et al. Quantitative analysis of digital videokymography: a preliminary study on age- and gender-related difference of vocal fold vibration in normal speakers. J Voice. 2015;29(1):109-19. http://dx.doi.org/10.1016/j.jvoice.2014.05.006. PMid:25228432.
http://dx.doi.org/10.1016/j.jvoice.2014....
).

It is important to point out that these tests are complementary, since the acoustic parameters are extracted from the vocal emission, and the DKG parameters from the laryngeal image. Despite being complementary, the results suggest that the aspects of vocal and laryngeal assessment are correlated, and that by acoustically analyzing the voice, the clinician can make inferences about the physiological aspects of the larynx. Considering the object of study of this research, the acoustic parameter of the voice related to f0 reflects the functional parameter of the number of glottal cycles.

It is worth mentioning as a limitation of the research the difference between the speech material used for the acoustic analysis (vowel /a/) and for the DKG (vowels /i/ and /ɛ/). In both acoustic f0 and DKG data collection, participants were monitored in relation to habitual emission, in frequency and intensity. However, the very nature of the HSV assessment, which requires tongue protrusion, prevents a habitual positioning of the vocal tract during the acquisition of images, and the emission of the vowels /i/ and /ɛ/ requires a slightly higher positioning of the larynx(1717 Mehta DD, Hillman RE. Current role of stroboscopy in laryngeal imaging. Curr Opin Otolaryngol Head Neck Surg. 2012;20(6):429-36. http://dx.doi.org/10.1097/MOO.0b013e3283585f04. PMid:22931908.
http://dx.doi.org/10.1097/MOO.0b013e3283...
). Both the protrusion of the tongue during the exam and the selected vowels are necessary for a better visualization of the VFs(1717 Mehta DD, Hillman RE. Current role of stroboscopy in laryngeal imaging. Curr Opin Otolaryngol Head Neck Surg. 2012;20(6):429-36. http://dx.doi.org/10.1097/MOO.0b013e3283585f04. PMid:22931908.
http://dx.doi.org/10.1097/MOO.0b013e3283...
). Such aspects may interfere in the measures analyzed; however, the research results allow us to conclude that these aspects did not interfere in the results.

There are few studies in the literature with data from digital kymography but comparing the dominant frequency data of the opening variation of the VFs with the acoustic fundamental frequency is a viable option, since these measures do not differ in men and women without vocal alteration.

CONCLUSION

The comparison of the values of the dominant frequency of the VFs opening variation as evaluated by digital kymography, with the acoustic fundamental frequency of the voice show that they are similar, which allows a comparison between these measures in the multidimensional evaluation of the voice.

  • Study conducted at Universidade Federal de Minas Gerais - UFMG - Belo Horizonte (MG), Brasil.
  • Financial support: PROBIC/FAPEMIG for granting a scientific initiation scholarship.

REFERÊNCIAS

  • 1
    Behlau M. Voz: o livro do especialista. São Paulo: Revinter; 2001.
  • 2
    Zhang Z. Mechanics of human voice production and control. J Acoust Soc Am. 2016;140(4):2614-35. http://dx.doi.org/10.1121/1.4964509 PMid:27794319.
    » http://dx.doi.org/10.1121/1.4964509
  • 3
    Nemr K, Amar A, Abrahão M, Leite GCA, Köhle J, Santos AO, et al. Análise comparativa entre avaliação fonoaudiológica perceptivo-auditiva, análise acústica e laringoscopias indiretas para avaliação vocal em população com queixa vocal. Rev Bras Otorrinolaringol. 2005;71(1):13-7. http://dx.doi.org/10.1590/S0034-72992005000100003
    » http://dx.doi.org/10.1590/S0034-72992005000100003
  • 4
    Alberti PW. The history of laryngology: a centennial celebration. Otolaryngol Head Neck Surg. 1996;114(3):345-54. http://dx.doi.org/10.1016/S0194-59989670202-4 PMid:8649866.
    » http://dx.doi.org/10.1016/S0194-59989670202-4
  • 5
    Krausert CR, Olszewski AE, Taylor LN, McMurray JS, Dailey SH, Jiang JJ. Mucosal wave measurement and visualization techniques. J Voice. 2011;25(4):395-405. http://dx.doi.org/10.1016/j.jvoice.2010.02.001 PMid:20471798.
    » http://dx.doi.org/10.1016/j.jvoice.2010.02.001
  • 6
    Svec JG, Sram F, Schutte HK. Videokymography: a new high-speed method for the examination of vocal-fold vibrations. Otorinolaryngol Foniatr. 1999;48:155-62.
  • 7
    Araújo SA, Grellet M, Pereira JC, Rosa MO. Normatização de medidas acústicas da voz normal. Rev Bras Otorrinolaringol. 2002;68(4):540-4. http://dx.doi.org/10.1590/S0034-72992002000400014
    » http://dx.doi.org/10.1590/S0034-72992002000400014
  • 8
    Read C, Buder EH, Kent RD. Speech analysis systems: an evaluation. J Speech Hear Res. 1992;35(2):314-32. http://dx.doi.org/10.1044/jshr.3502.314 PMid:1573872.
    » http://dx.doi.org/10.1044/jshr.3502.314
  • 9
    Pimenta RA. Uso da avaliação multidimensional da voz na caracterização vocal de pacientes com paralisia unilateral de pregas vocais [dissertação]. São Carlos: Universidade de São Paulo; 2016.
  • 10
    Pimenta RA, Dájer ME, Hachiya A, Tsuji DH, Montagnoli AN. Parameters acoustic and high-speed kymography identified effects of voiced vibration and vocal fry exercises. CoDAS. 2013;25(6):577-83. http://dx.doi.org/10.1590/S2317-17822014000100010 PMid:24626983.
    » http://dx.doi.org/10.1590/S2317-17822014000100010
  • 11
    Lindestad PA, Sodersten M, Merker B, Granqvist S. Voice source characteristics in Mongolian “throat singing” studied with high-speed imaging technique, acoustic spectra, and inverse filtering. J Voice. 2001;15(1):78-85. http://dx.doi.org/10.1016/S0892-1997(01)00008-X PMid:12269637.
    » http://dx.doi.org/10.1016/S0892-1997(01)00008-X
  • 12
    Lindestad PA, Blixt V, Pahlberg-Olsson J, Hammarberg B. Ventricular fold vibration in voice production: a highspeed imaging study with kymographic, acoustic and perceptual analyses of a voice patient and a vocally healthy subject. Logoped Phoniatr Vocol. 2004;29(4):162-70. http://dx.doi.org/10.1080/14015430410020339 PMid:15764210.
    » http://dx.doi.org/10.1080/14015430410020339
  • 13
    Koishi HU, Tsuji DH, Imamura R, Sennes LU. Vocal intensity variation: a study of vocal folds vibration in humans with videokymography. Rev Bras Otorrinolaringol. 2003;69(4):464-70. http://dx.doi.org/10.1590/S0034-72992003000400005
    » http://dx.doi.org/10.1590/S0034-72992003000400005
  • 14
    Yokonishi H, Imagawa H, Sakakibara KI, Yamauchi A, Nito T, Yamasoba T, et al. Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity. J Voice. 2016;30(2):145-57. http://dx.doi.org/10.1016/j.jvoice.2015.01.009 PMid:25953586.
    » http://dx.doi.org/10.1016/j.jvoice.2015.01.009
  • 15
    Larsson H, Hertegard S, Lindestad PA, Hammarberg B. Vocal fold vibrations: high-speed imaging, kymography, and acoustic analysis: a preliminary report. Laryngoscope. 2000;110(12):2117-22. http://dx.doi.org/10.1097/00005537-200012000-00028 PMid:11129033.
    » http://dx.doi.org/10.1097/00005537-200012000-00028
  • 16
    Cielo CA, Schwarz K, Finger LS, Lima JM, Christmann MK. Glottal closure in women with no voice complaints or laryngeal disorders. Int Arch Otorhinolaryngol. 2019;23(4):e384-8. http://dx.doi.org/10.1055/s-0038-1676108 PMid:31649756.
    » http://dx.doi.org/10.1055/s-0038-1676108
  • 17
    Mehta DD, Hillman RE. Current role of stroboscopy in laryngeal imaging. Curr Opin Otolaryngol Head Neck Surg. 2012;20(6):429-36. http://dx.doi.org/10.1097/MOO.0b013e3283585f04 PMid:22931908.
    » http://dx.doi.org/10.1097/MOO.0b013e3283585f04
  • 18
    Woo P. Stroboscopy and high-speed imaging of the vocal function. 2. ed. San Diego: Plural Publishing; 2022.
  • 19
    Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159-74. http://dx.doi.org/10.2307/2529310 PMid:843571.
    » http://dx.doi.org/10.2307/2529310
  • 20
    Gerhard D. Pitch extraction and fundamental frequency: history and current techniques: history and current techniques. Regina: Department of Computer Science/University of Regina; 2003.
  • 21
    Nemr K, Simões-Zenari M, Cordeiro GF, Tsuji D, Ogawa AI, Ubrig MT, et al. GRBAS and Cape-V scales: high reliability and consensus when applied at different times. J Voice. 2012;26(6):812.E17-22. http://dx.doi.org/10.1016/j.jvoice.2012.03.005 PMid:23026732.
    » http://dx.doi.org/10.1016/j.jvoice.2012.03.005
  • 22
    Babacan O, Drugman T, d’Alessandro N, Henrich N, Dutoit T. A comparative study of pitch extraction algorithms on a large variety of singing sounds. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing; 2013 Mai 26-31; Vancouver. Anais. Nova York: IEEE. p. 7815. http://dx.doi.org/10.1109/ICASSP.2013.6639185
    » http://dx.doi.org/10.1109/ICASSP.2013.6639185
  • 23
    Lacerda EB. Detecção de frequência fundamental baseada em mecanismos laríngeos [dissertação]. Recife: Centro de Informática/Universidade Federal de Pernambuco; 2018.
  • 24
    Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):e20170225. PMid:30365649.
  • 25
    Arantes P, Linhares ME. Efeito da língua, estilo de elocução e sexo do falante sobre medidas globais da frequência fundamental. Let Hoje. 2017;52(1):26-39. http://dx.doi.org/10.15448/1984-7726.2017.1.25419
    » http://dx.doi.org/10.15448/1984-7726.2017.1.25419
  • 26
    Baravieira PB. Análise do padrão vibratório das pregas vocais em sujeitos com e sem nódulo vocal por meio da videolaringoscopia de alta velocidade [dissertação]. São Carlos: Universidade de São Paulo; 2012. http://dx.doi.org/10.11606/D.82.2012.tde-17072012-142118
    » http://dx.doi.org/10.11606/D.82.2012.tde-17072012-142118
  • 27
    Nascimento UN, Santos MAR, Gama ACC. Digital videokymography: analysis of glottal closure in adults. J Voice. 2021. In press. PMid:34417083.
  • 28
    Yamauchi A, Yokonishi H, Imagawa H, Sakakibara K, Nito T, Tayama N, et al. Quantitative analysis of digital videokymography: a preliminary study on age- and gender-related difference of vocal fold vibration in normal speakers. J Voice. 2015;29(1):109-19. http://dx.doi.org/10.1016/j.jvoice.2014.05.006 PMid:25228432.
    » http://dx.doi.org/10.1016/j.jvoice.2014.05.006

Publication Dates

  • Publication in this collection
    27 Oct 2023
  • Date of issue
    2023

History

  • Received
    29 June 2022
  • Accepted
    07 Nov 2022
Sociedade Brasileira de Fonoaudiologia Al. Jaú, 684, 7º andar, 01420-002 São Paulo - SP Brasil, Tel./Fax 55 11 - 3873-4211 - São Paulo - SP - Brazil
E-mail: revista@codas.org.br