Comparison of fundamental frequency and formants frequency measurements in two speech tasks

Viegas, Flávia; Viegas, Danieli; Guimarães, Glaucio Serra; Souza, Margareth Maria Gomes de; Luiz, Ronir Raggio; Simões-Zenari, Marcia; Nemr, Katia

doi:10.1590/1982-0216/201921612819

ABSTRACT

Purpose:

to compare the measurements of fundamental frequency (F0) and frequency of the first two formants (F1 and F2) of the seven oral vowels of the Brazilian Portuguese in two speech tasks, in adults without voice and speech disorders.

Methods:

eighty participants in the age range 18 and 40 years, paired by gender, were selected after orofacial, orthodontic and auditory-perceptual assessments of voice and speech. The speech signals were obtained from carrier phrases and sustained vowels and the values of the F0 and frequencies of F1 and F2 were estimated. The differences were verified through the t Test, and the effect size was calculated.

Results:

differences were found in the F0 measurements between the two speech tasks, in two vowels in males, and in five vowels, in females. In the F1 frequencies, differences were noted in six vowels, in men, and in two, in women. In the F2 frequencies, there was a difference in four vowels, in men, and three, in women.

Conclusion:

based on the differences found, it is concluded that the speech task for evaluation of fundamental frequency and formants’ frequencies, in the Brazilian Portuguese, can show distinct results in both glottal and supraglottal measures in the production of different oral vowels of this language. Thus, it is suggested that clinicians and researchers consider both forms of emission for a more accurate interpretation of the implications of these data in the evaluation of oral communication and therapeutic conducts.

Keywords:
Voice; Speech Production Measurement; Speech Acoustics; Phonetics

RESUMO

Objetivo:

comparar as medidas de frequência fundamental (F0) e frequência dos dois primeiros formantes (F1 e F2) das sete vogais orais do português brasileiro em duas tarefas de fala em adultos sem distúrbios de voz e fala.

Métodos:

oitenta participantes entre 18-40 anos pareados por gênero foram selecionados após avaliações orofacial, ortodôntica e perceptivo-auditiva da voz e fala. Os sinais de fala foram obtidos de sentenças-veículo e vogais sustentadas e foram estimados os valores de F0 e frequências de F1 e F2. As diferenças foram verificadas por meio do teste t e foi calculado o Effect Size.

Resultados:

foram encontradas diferenças nas medidas de f0 entre as duas tarefas de fala em duas vogais no gênero masculino e em cinco vogais no gênero feminino. Nas frequências de F1 foram notadas diferenças em seis vogais nos homens e em duas nas mulheres. Nas frequências de F2 houve diferença em quatro vogais nos homens e em três nas mulheres.

Conclusão:

a partir das diferenças encontradas, conclui-se que a tarefa de fala para avaliação de frequência fundamental e frequências dos formantes no português brasileiro pode demonstrar resultados distintos tanto em medidas glóticas como supraglóticas na produção das diferentes vogais orais deste idioma. Desta forma, sugere-se que os clínicos e pesquisadores considerem ambas formas de emissão para interpretação mais apurada das implicações destes dados na avaliação da comunicação oral e no direcionamento de condutas terapêuticas.

Descritores:
Voz; Medida da Produção da Fala; Acústica da Fala; Fonética

Introduction

Technological advances contribute to enlarge the studies on speech sciences. Among the many forms of assessment, the acoustic analysis of speech and voice stands out for being noninvasive and relatively low-cost¹1. Maryn Y, Corthals P, Van Cauwenberge P, Roy N, De Bodt M. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. J Voice. 2010;24(5):540-55., which contributes for it to be frequently used in researches conducted by different professionals, including the speech-language-hearing therapist²2. Lima MFB, Camargo ZA, Ferreira LP, Madureira S. Qualidade vocal e formantes das vogais de falantes adultos da cidade de João Pessoa. Rev. CEFAC. 2007;9(1):99-109.

3. Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.^-⁴4. Braga JN, Oliveira DSF, Sampaio TMM. Frequência fundamental da voz de crianças. Rev. CEFAC. 2009;11(1):119-26..

It is possible to observe, in the literature, different methodologies for the analysis of the same phenomena. In speech-language-hearing sciences, the acoustic parameters frequently investigated are the fundamental frequency and the vowel formant frequencies³3. Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.

4. Braga JN, Oliveira DSF, Sampaio TMM. Frequência fundamental da voz de crianças. Rev. CEFAC. 2009;11(1):119-26.

5. Viegas F, Viegas D, Atherino CCT, Baeck HE. Frequência fundamental das 7 vogais orais do português em vozes de crianças. Rev. CEFAC. 2010;12(4):563-70.^-⁶6. França FP, Evangelista DS, Lopes LW. Revisão sistemática sobre os formantes e a produção da voz e fala. Rev Prolíngua. 2017;12(1):2-16..

The fundamental frequency (F0) produced by the vibration of the vocal folds and its harmonics are modified in the supraglottal cavities, which work as a filter attenuating some frequencies and amplifying other ones. The amplified frequency ranges are known as the formant frequencies, of which the most studied are the first two (F1 and F2), as they furnish phonetical identity to the vowels. The frequency of the first formant (F1) presents relation to the vertical position of the tongue and with the degree of mandible opening; its value is inversely proportional to the position of the linguomandibular complex. The frequency of the second formant (F2) is influenced by the anteroposterior displacement of the tongue, the more anterior the constriction of the tongue, the greater will be the value of F2; and, the more posterior, lower will be that measure⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.

8. Stevens KN, House AS. An acoustical theory of vowel production and some of its implications. J Speech Hear Res. 1961;4(4):303-20.

9. Johnson K. Acoustic and auditory phonetics. 3rd ed. Oxford:Wiley-Blackwell; 2012.

10. Fant G. Acoustic theory of speech production. The Hague: Mouton; 1960.

11. Lehiste I. Suprasegmentals. Cambridge: MIT Press, 1970.^-¹²12. Kent RD, Read C. Análise acústica da fala. 1a. ed. São Paulo: Cortez Editora; 2015..

Both the values of fundamental frequency and those of the formant frequencies present correlation with the language. In Portuguese, according to the position of the tongue on the vertical axis, the vowels may be divided in: low [a], medium-low [Ԑ] and [ᴐ], medium-high [e] e [o], and high [i] and [u]. Thus, they form four F1 regions; and so, except for the vowel [a], each anterior vowel has F1 frequency similar to its correspondent posterior vowel. On the anteroposterior axis, the vowels are classified according to the following oral cavity regions: medium [a], anterior [Ԑ], [e], [i], and posterior [ᴐ], [o], [u]. Since these regions are related to F2 measures, the anterior vowels present higher F2 values, and posterior ones, lower measures of this parameter⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.^,¹³13. Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93. (Figure 1). The height of the tongue also reflects on F0 values, as high vowels have higher pitch values than the lower ones. Hence, in Portuguese, the vowels with higher F0 values are [u] and [i]. The position of the vowel on the anteroposterior axis also influences this parameter, once posterior vowels usually present higher F0 values than their anterior correspondents¹³13. Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93..

Figure 1:
Schematic representation of the position of oral vowels in the oral cavity, in the Brazilian Portuguese

In these measures, distinctions between the genders are also observed, mainly due to anatomical differences. In general, since among females the vocal folds and vocal tract are shorter, higher values for F0 and the formant frequencies are expected, in relation to men, who have longer vocal tract and vocal folds, and so, lower frequencies¹⁴14. Sundberg J. Ciência da voz: fatos sobre a voz na fala e no canto. 2a. ed. São Paulo: EdUSP; 2018..

The fundamental frequency is the most robust parameter for studying voice, and the formant frequencies are essential for the identification of the vowels, and they enable for articulatory interpretations of acoustic data⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015..

The proposition of studying differences between acoustic parameters of two forms of emission (carrier phrases and sustained vowels) was based on the fact that these are the most used speech tasks in researches and speech-language-hearing clinic, as referred in a recent systematic review regarding formants and production of voice and speech⁶6. França FP, Evangelista DS, Lopes LW. Revisão sistemática sobre os formantes e a produção da voz e fala. Rev Prolíngua. 2017;12(1):2-16..

Studies that investigated the differences between continuous speech and sustained vowels concentrated on the perception of dysphonia degree¹1. Maryn Y, Corthals P, Van Cauwenberge P, Roy N, De Bodt M. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. J Voice. 2010;24(5):540-55.^,¹⁵15. Maryn Y, Roy N. Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity. J Soc Bras Fonoaudiol. 2012;24(2):107-12.

16. Lu FL, Matteson S. Speech tasks and interrater reliability in perceptual voice evaluation. J Voice. 2014;28(6):725-32.^-¹⁷17. Gerratt BR, Kreiman J, Garellek M. Comparing measures of voice quality from sustained phonation and continuous speech. JSLHR. 2016;59(5):994-1001.; it is, thus, a theme little approached in individuals with unaltered voices. Only one of the researches found had analyzed glottal parameters of healthy people on two speech tasks in Brazilian Portuguese¹⁸18. Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC de C. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):1-7.. However, no data on the formant frequencies with this same outline were found.

Therefore, the comparison of measurements of F0 and of the two first formants on these speech tasks in people without articulatory and vocal disorders is relevant, especially in Brazilian Portuguese, due to the lack in the literature thereof. It is important that all vowels be analyzed due to the circumstances in the position of the articulators for them to be produced. The characterization of such aspects will collaborate to a more refined knowledge of the variants in producing speech, and it can aid the work of the speech-language-hearing therapist both in the clinic and in improving oral communication, since different results can be found depending on the form of emission chosen for assessment of the clients.

In this sense, this study aimed at comparing the fundamental frequency and the two first formant frequencies (F1 and F2) of all the seven oral vowels of the Brazilian Portuguese (BP) between the emissions in sustained vowels and in continuous speech, with the use of carrier phrases in people without dysphonia and speech disorders.

Methods

This is an observational, descriptive, cross-sectional study, whose participants were divided into two groups according to their gender. The project was approved by the Research Ethics Committees of the institutions involved: Faculdade de Medicina da Universidade de São Paulo (Authorization No. 1.540.289/2016) and Hospital Universitário Antônio Pedro of the Universidade Federal Fluminense (Authorization No. 1.585.551/2016). All the participants signed the Informed Consent Form.

Selection of the Participants

To compose the sample of this paper, 80 people were included, paired by gender, aged between 18 and 40 years (men: X = 23.3 years, SD=2.71; women: 𝑋 = 22.2 years, SD= 2.66).

The participants were interviewed by the first author of the research, and answered a questionnaire with personal data and questions related to their health conditions. Afterwards, they were evaluated by orthodontists, coauthors of this study, and underwent speech-language-hearing assessment of the orofacial structures and auditory-perceptual voice and speech assessment.

The inclusion criteria were: not having a history of respiratory, auditory, vocal or speech disorders, not being a smoker, being a native speaker of Brazilian Portuguese from the city of Rio de Janeiro, having normal occlusion or Angle Class I with balanced maxillomandibular relation in the three dimensions of space, harmonic profile, and small alterations of dental positioning. The inclusion of participants with Angle Class I was adopted once the patients with normal occlusion, i.e., without dentoskeletal alterations, are rare. The participants should present scores corresponding to grade 4 (normal) in the evaluation of the orofacial structures by means of the OMES-E Protocol¹⁹19. De Felício CM, Folha GA, Ferreira CLP, Medeiros APM. Expanded protocol of orofacial myofunctional evaluation with scores: validity and reliability. Int J Pediatr. Otorhinolaryngol. 2010;74(11):1230-9., score zero on the general dysphonia degree (G) according to the GRBAS auditory-perceptual evaluation scale²⁰20. Isshiki N, Okamura H, Tanabe M, Morimoto M. Differential diagnosis of hoarseness. Folia Phoniatr. Logoapedica. 1969;21(1):9-19., balanced resonance, and not presenting speech disorders. The exclusion factors were: presence of open-bite, anterior or posterior crossbite, absence of teeth or presence of supernumerary teeth. The participants that reported the presence of cold or allergic processes on the day the speech samples were being collected, or that for some reason could not adequately perform the emissions, were excluded from the sample.

Recording of the Speech Signals and Signal Processing

The recording of the speech signals followed a methodology tested in previous researches³3. Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.^,⁵5. Viegas F, Viegas D, Atherino CCT, Baeck HE. Frequência fundamental das 7 vogais orais do português em vozes de crianças. Rev. CEFAC. 2010;12(4):563-70.^,²¹21. Viegas D, Viegas F, Atherino CCT, Baeck H. Parâmetros espectrais da voz em crianças respiradoras orais. Rev. CEFAC. 2010;12(5):820-30..

For the estimation of the fundamental frequency and the formant frequencies, the speech signals were obtained from: a) recording of carrier phrase: “Fale____ para mim” (“Say____ to me”) , filled in with the words “pápa”, “pépe”, “pêpe”, “pípi”, “pópo”, “pôpo” e “púpu”; and, b) in prolonged emission of the seven oral vowels of Brazilian Portuguese (BP) for three seconds. The participants read the instructions and performed tasks with comfortable pitch and loudness. Each speech task was repeated four times, and the two emissions with best definition of formants tracing were selected.

The recordings took place in a silent room, with the use of Praat software, version 6.0.16 (P. Boresmaand D. Weenink, University of Amsterdam, Netherlands, free, available at http://www.fon.hum.uva.nl/praat/), in mono-channel, with a sampling rate of 22,050Hz, and in .wav format. A notebook, HP brand (Hewlett-Packard, USA), was used, with Windows 10 operational system, as well as a microphone, Shure brand, model SM 58 (Shure, USA), positioned at a distance of 10 cm from the lips of the person.

The parts with best definition of formants tracing of two emissions of each task were identified, based on the overlapping LPC tracings on broadband spectrogram. The 10 milliseconds of the intermediate portion of each vowel were manually clipped for the estimation of values (Figure 2).

Figure 2:
Example of selection of the ten milliseconds of the intermediary portion of vowel [Ԑ] from a broadband spectrogram, with the aid of PRAAT software.

After the clipping, each segment was saved in a .wav extension file. For the digital processing of the signals, a script created with the Praat software was used, which had been tested in previous studies³3. Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.^,⁵5. Viegas F, Viegas D, Atherino CCT, Baeck HE. Frequência fundamental das 7 vogais orais do português em vozes de crianças. Rev. CEFAC. 2010;12(4):563-70.^,²²22. Loureiro LMJ, Gameiro MGH. Interpretação crítica dos resultados estatísticos: para lá da significância estatística. Rev Enferm Ref. 2011;3(3):151-62.. The measurements were obtained from two samples of carrier phrases (CP) and sustained vowels (SV) of each vowel for all the participants. Thus, 3,360 parametric values were collected, composed of three parameters (F0, F1 and F2) X seven vowels X two samples X 80 individuals. All the clippings of the vowels were carried out by the same researcher.

The values obtained with the script were revised in three different moments to ensure that the measurements were correct. Hence, the first researcher manually conferred the measurements, while another author verified the frequencies through the script and manually conferred the values. In the cases in which there was divergence between the automatic and the manual estimations, the measurements obtained manually were considered. These procedures were adopted to avoid estimation errors, mainly in the posterior vowels, which, as they present proximities in the first two formant frequencies¹²12. Kent RD, Read C. Análise acústica da fala. 1a. ed. São Paulo: Cortez Editora; 2015., are liable to the occurrence of automatic estimation errors by the programs.

After this stage, the average between the two emissions of SV and CP of each participant was calculated. Therefore, the final value of each measurement corresponded to average of the two emissions in each modality. The formants frequencies of each gender were plotted by means of a program available at http://www.adambaker.org/formant-chart/formant-chart.html (Figures 3 and 4).

Figure 3:
Plotting of F1 and F2 in the sustained vowels and carrier phrase tasks in the male group

Figure 4:
Plotting of F1 and F2 in the sustained vowels and carrier phrase tasks in the female group

Statistical Analysis

The statistical analysis was conducted with the use of the Statistical Package for Social Sciences for Windows (SPSS®, Inc. Chicago, Illinois), and the average, median and standard deviation measures of central tendency were considered.

To verify the normality of data distribution, the nonparametric Kolmogorov-Smirnov test was used, and there were noted evidences that the variables presented normal distribution.

For the comparison of the F0, F1 and F2 measurements between the forms of emission researched, the Paired t Test was used. The level of significance adopted for rejecting the null hypothesis (frequencies in the two forms of emissions were equal) was equal to or lower than 0.05 (5%). The alternative hypothesis was that there would be differences between the two analyzed forms of emission.

The effect size was also calculated, as it is an important complement of statistical significance test. The objective was to verify the degree in which the phenomenon was present in the population studied; the greater its value, the greater was the presence of the phenomenon. The ES values are considered small (0.20≤d<0.50), medium (0.50≤d<0.80) or large (d≥0.80) ²²22. Loureiro LMJ, Gameiro MGH. Interpretação crítica dos resultados estatísticos: para lá da significância estatística. Rev Enferm Ref. 2011;3(3):151-62..

Results

A difference has been observed between the averages of the two forms of emission (CP and SV), both of the F0 and the frequencies of the first two formants in several oral vowels of the Brazilian Portuguese.

In the males, higher F0 values were found in two vowels, and higher F1 values in six vowels in the CP emission. The F2 values were lower in the CP emission in four vowels. In this group, the largest effect size value was found in F1 of the vowel [i] (Table 1).

Thumbnail

Table 1:
Descriptive values and statistical treatment of fundamental frequency and frequency of the first two formants in the emissions of carrier phrases and sustained vowels, in males

In the females, lower F0 values were observed in five vowels, higher F1 values in tow vowels, besides lower F2 values in three vowels in CP emission (Table 2).

Thumbnail

Table 2:
Descriptive values and statistical treatment of fundamental frequency and frequency of the first two formants in the emissions of carrier phrases and sustained vowels, in females

Discussion

In this study, the averages of the fundamental frequency and the frequencies of the first two formants were compared in emissions in carrier phrases and sustained vowel in people without dysphonia and speech disorders. After a bibliographical survey was conducted, it was noted a shortage of studies analyzing the differences between these two speech tasks in different vowels of the Brazilian Portuguese in vocally healthy individuals, which limits the comparison with the present results.

Fundamental Frequency

When analyzing the values of fundamental frequency, it was noted that higher pitch measurements in the CP were found only in the anterior [i] and posterior [u] high vowels, in the male group, with effect size values considered small. Therefore, a hypothesis for such findings was that these results may have been favored by elevation of the hyoid-laryngeal complex during the coarticulation process present in this kind of emission. The presence of differences only in the high vowels may present correlation with the symmetry between the height of the tongue and F0 measurements present in Portuguese¹³13. Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93.. In the literature, two studies were found that compared measurements of the vowel [a] between sustained vowels and continuous speech. In one of them, averages of F0 close to the male voices were observed; this tendency was also observed in this vowel in this investigation. However, another study observed reduction of F0 in the sustained emission of the vowel [a] in relation to that emitted through text reading in this gender²³23. Moon KR, Chung SM, Park HS, Kim HS. Materials of acoustic analysis: sustained vowel versus sentence. J Voice. 2012;26(5):563-5..

In the female group, higher pitch measurements were found in the sustained emissions in five vowels, with medium effect size in four of them. Therefore, this was the parameter that most differed in this gender. As in men, symmetry between height of the tongue and F0 averages was also observed, with statistical differences. Hence, the anterior medium-low [Ԑ] and medium-high [e] vowels presented tendency similar to their corresponding posterior ones, medium-low [ᴐ] and medium-high [o] vowels.Likewise, higher F0 measurement in the sustained vowel was observed in the vowel [a], which, for being central, has no correspondence with another vowel on the anteroposterior axis¹³13. Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93..

Once these differences were noted in most of the vowels in the female group in the speech task closest to usual communication situations, it was hypothesized that, by basing on physiological aspects that indicate that higher F0 values can demonstrate higher elevation of the hyoid-laryngeal complex and higher vibration speed of the vocal folds, these findings may aid in understanding some clinical implications based on the observation of which speech task was used in the assessment. Thus, assuming that the sustained vowels are historically the most investigated and most used form of emission in speech-language-hearing clinic¹1. Maryn Y, Corthals P, Van Cauwenberge P, Roy N, De Bodt M. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. J Voice. 2010;24(5):540-55., and that the present findings demonstrated higher F0 values in SV in most of the vowels in women without dysphonia, probably this muscular adjustment may reflect on the increase of the degree of dysphonia reported in researches¹⁵15. Maryn Y, Roy N. Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity. J Soc Bras Fonoaudiol. 2012;24(2):107-12.

16. Lu FL, Matteson S. Speech tasks and interrater reliability in perceptual voice evaluation. J Voice. 2014;28(6):725-32.^-¹⁷17. Gerratt BR, Kreiman J, Garellek M. Comparing measures of voice quality from sustained phonation and continuous speech. JSLHR. 2016;59(5):994-1001., given that, physiologically, higher-pitched sounds require more muscular refinement to be produced.

A hypothesis for higher F0 values in the SV in most of the vowels produced by women would be that, as this type of emission is not part of a usual communicative context¹⁸18. Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC de C. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):1-7., there is greater probability of interference of the speaker²³23. Moon KR, Chung SM, Park HS, Kim HS. Materials of acoustic analysis: sustained vowel versus sentence. J Voice. 2012;26(5):563-5.. Higher F0 values in the sustained vowel [a] in relation to continuous speech were also observed in three age groups of women in a study that investigated normal voices; however, the differences were subtle¹⁸18. Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC de C. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):1-7.. The same tendency of increase in F0 in the sustained vowel in relation to continuous speech of female voices was observed in another study²³23. Moon KR, Chung SM, Park HS, Kim HS. Materials of acoustic analysis: sustained vowel versus sentence. J Voice. 2012;26(5):563-5..

Another possible justification for the symmetry found between anterior and posterior vowels with statistical differences may be based on the existing correlation between tongue constriction position on the vertical axis and fundamental frequency¹³13. Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93.. Hence, even with the assessment of different emission tasks, the fundamental frequency values followed similar tendencies according with the height of the vowel in both forms of emission.

The differences found between the forms of emission are also supported in other papers²⁴24. Honda K. Relationship between pitch control and vowel articulation. Haskins Laboratories Status Report on Speech Research. 1983;73:269-82.^,²⁵25. Shaw JA, Chen W, Proctor MI, Derrick D. Influences of tone on vowel articulation in mandarin chinese. JSLHR. 2016;59(6):S1566-74. which associated the change in tone with articulatory aspects, and highlighted the interaction between glottal source and filter. By analyzing the relation between the control of frequency and articulation of vowels, a research²⁴24. Honda K. Relationship between pitch control and vowel articulation. Haskins Laboratories Status Report on Speech Research. 1983;73:269-82. reported that the changes found in fundamental frequency, in addition to being originated at the intrinsic laryngeal tensor muscles, could also partly result from the geniohyoid muscle, genioglossus muscle, and hyoid bone movements. The extrinsic tongue and laryngeal muscles influence directly and indirectly the position of the hyoid-laryngeal complex and the intralaryngeal configuration.

Formants Frequencies

In the male group, the measurements that most differed between the tasks analyzed were the frequencies of the first formant. The results demonstrated higher values in the CP in six vowels, with medium effect size in four of them (Figure 3). In the female group, the same tendency was observed, though only in the vowels [a] and [Ԑ] with medium effect size (Figure 4). Therefore, by establishing an acoustic-articulatory correspondence, it is possible to infer that the tongue was in a lower position and the mandible in more open position, besides having occurred greater narrowing of the pharynx in the carrier phrases in relation to the sustained vowels⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.. A hypothesis for the differentiation found between the two speech tasks would be the interference of the coarticulation phenomenon present in continuous speech, once a given segment influences adjacent segments; i.e., in the analyzed vowel, there are acoustic hints of the consonant that precedes it⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.^,¹²12. Kent RD, Read C. Análise acústica da fala. 1a. ed. São Paulo: Cortez Editora; 2015.^,²⁶26. Kent RD, Vorperian HK. Static measurements of vowel formant frequencies and bandwidths: a review. J Commun Disord. 2018;74:74-97.. In women, the differences found only in the vowels [a] and [Ԑ] may have occurred due to the position of the height of the tongue inherent to their production, i.e., low tongue and open mandible in vowel [a], and anterior medium-low tongue in vowel [Ԑ].

The F2 values presented lower averages in CP in all posterior vowels ([ᴐ], [o], [u]) in both genders, besides the vowel [a] in the male group (Figures 3 and 4). Based on the observation of these data, it is possible to infer, by means of an acoustic-articulatory correspondence, that the tongue constriction position was more posterior and the conformation of the pharynx was narrower⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015. than in the sustained vowels. A hypothesis for the reduction in F2 values would be a greater interference of the coarticulation present in the continuous speech in these vowels, since the movement of the articulators to produce a sound will change because of the nearby sounds⁷7. Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.^,¹²12. Kent RD, Read C. Análise acústica da fala. 1a. ed. São Paulo: Cortez Editora; 2015.^,²⁶26. Kent RD, Vorperian HK. Static measurements of vowel formant frequencies and bandwidths: a review. J Commun Disord. 2018;74:74-97.. The lower F2 values in the CP in all posterior vowels in both genders may have been favored by the tongue posterior constriction inherent to their production. And, in males, when analyzing the vowel [a], it was possible to observe the lowering of the oromandibular complex by the increase of F1 in the carrier phrases, which may have probably contributed to a more posterior constriction of the tongue, collaborating for the reduction of F2 frequency in this vowel.

The fact that the production of some vowels did not present statistical differences between the speech tasks is supported in the literature, which highlights that, even though different forms of emission may use different muscle adjustments for the production of the same vowels, adaptations in the articulators may occur and, thus, not produce so many differences in particular in the formant frequencies¹⁴14. Sundberg J. Ciência da voz: fatos sobre a voz na fala e no canto. 2a. ed. São Paulo: EdUSP; 2018..

The acoustic measurement values presented in this paper represent the averages of the population studied, according to a methodology tested in other studies³3. Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.^,⁵5. Viegas F, Viegas D, Atherino CCT, Baeck HE. Frequência fundamental das 7 vogais orais do português em vozes de crianças. Rev. CEFAC. 2010;12(4):563-70.^,²¹21. Viegas D, Viegas F, Atherino CCT, Baeck H. Parâmetros espectrais da voz em crianças respiradoras orais. Rev. CEFAC. 2010;12(5):820-30., there not being the intention of proposing parameters of normality.

The tendencies observed of elevation of F0 in some vowels in men and lowering of pitch in several vowels in women, as well as elevation of F1 values and lowering of F2 measurements in the posterior vowels in the carrier phrases may be complemented with other studies. Therefore, it is suggested that more researches with this scope be developed in order to increase the information on these differences in Brazilian Portuguese.

Limitations

Although this study has contributed on the differences that may be found in acoustic parameters according to the speech task assessed in people without voice and speech disorders, some limitations must be recognized. Firstly, only the two most used speech tasks in research and speech-language-hearing clinic were examined; however, other forms of emission were not considered. Secondly, even though other researchers, according to their objectives, may concentrate on evaluating measurements obtained from spontaneous or semi-spontaneous speech with more complex excerpts, it was opted for the use of the most referred carrier phrases in the literature, which can, in the future, allow for the comparison of data with other researches using the same corpus.

Contributions

The results of this research can aid the speech-language-hearing therapists who work both in the clinic and on researches and on the improvement of speech and voice, as it reinforces the presuppositions that different emission tasks may produce distinct acoustic measurements. Hence, it is highlighted the importance of the speech-language-hearing therapist consider more than one emission form when assessing their clients, and that this may aid in guiding their work in accordance with the therapeutic objectives for each case. It should be also emphasized that no isolated measure in the speech-language-hearing clinic is enough to define the conduct; nonetheless, the set of information aids the clinician in making better decisions.

Conclusion

There was a difference between the measurements of fundamental frequency and F1 and F2 frequencies between the two speech tasks. In the male group, the vowels [i] and [u] were higher-pitched, and in the female group, the vowels [a], [Ԑ], [e], [ᴐ], [o], lower-pitched in the carrier phrases. The F1 frequencies were distinguished in [a], [Ԑ], [e], [i], [ᴐ] and [o], in men, and, [a] and [Ԑ], in women, being higher in the CP. The F2 frequencies demonstrated lower values in all posterior vowels in both genders, besides vowel [a], in the male group. Therefore, it is concluded that the speech task for assessing fundamental frequency and formants frequency in the Brazilian Portuguese may demonstrate distinct results both in glottal and supraglottal measurements, when producing the different oral vowels of this language. Hence, it is suggested that clinicians and researchers consider both forms of emission for a more accurate interpretation of the implications of these data in assessing oral communication, thus, guiding their therapeutic conducts.

Acknowledgement

Gratitude is extended to the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) for financing in part the accomplishment of this study (Finance Code 001).

REFERENCES

¹
Maryn Y, Corthals P, Van Cauwenberge P, Roy N, De Bodt M. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. J Voice. 2010;24(5):540-55.
²
Lima MFB, Camargo ZA, Ferreira LP, Madureira S. Qualidade vocal e formantes das vogais de falantes adultos da cidade de João Pessoa. Rev. CEFAC. 2007;9(1):99-109.
³
Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.
⁴
Braga JN, Oliveira DSF, Sampaio TMM. Frequência fundamental da voz de crianças. Rev. CEFAC. 2009;11(1):119-26.
⁵
Viegas F, Viegas D, Atherino CCT, Baeck HE. Frequência fundamental das 7 vogais orais do português em vozes de crianças. Rev. CEFAC. 2010;12(4):563-70.
⁶
França FP, Evangelista DS, Lopes LW. Revisão sistemática sobre os formantes e a produção da voz e fala. Rev Prolíngua. 2017;12(1):2-16.
⁷
Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.
⁸
Stevens KN, House AS. An acoustical theory of vowel production and some of its implications. J Speech Hear Res. 1961;4(4):303-20.
⁹
Johnson K. Acoustic and auditory phonetics. 3rd ed. Oxford:Wiley-Blackwell; 2012.
¹⁰
Fant G. Acoustic theory of speech production. The Hague: Mouton; 1960.
¹¹
Lehiste I. Suprasegmentals. Cambridge: MIT Press, 1970.
¹²
Kent RD, Read C. Análise acústica da fala. 1a. ed. São Paulo: Cortez Editora; 2015.
¹³
Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93.
¹⁴
Sundberg J. Ciência da voz: fatos sobre a voz na fala e no canto. 2a. ed. São Paulo: EdUSP; 2018.
¹⁵
Maryn Y, Roy N. Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity. J Soc Bras Fonoaudiol. 2012;24(2):107-12.
¹⁶
Lu FL, Matteson S. Speech tasks and interrater reliability in perceptual voice evaluation. J Voice. 2014;28(6):725-32.
¹⁷
Gerratt BR, Kreiman J, Garellek M. Comparing measures of voice quality from sustained phonation and continuous speech. JSLHR. 2016;59(5):994-1001.
¹⁸
Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC de C. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):1-7.
¹⁹
De Felício CM, Folha GA, Ferreira CLP, Medeiros APM. Expanded protocol of orofacial myofunctional evaluation with scores: validity and reliability. Int J Pediatr. Otorhinolaryngol. 2010;74(11):1230-9.
²⁰
Isshiki N, Okamura H, Tanabe M, Morimoto M. Differential diagnosis of hoarseness. Folia Phoniatr. Logoapedica. 1969;21(1):9-19.
²¹
Viegas D, Viegas F, Atherino CCT, Baeck H. Parâmetros espectrais da voz em crianças respiradoras orais. Rev. CEFAC. 2010;12(5):820-30.
²²
Loureiro LMJ, Gameiro MGH. Interpretação crítica dos resultados estatísticos: para lá da significância estatística. Rev Enferm Ref. 2011;3(3):151-62.
²³
Moon KR, Chung SM, Park HS, Kim HS. Materials of acoustic analysis: sustained vowel versus sentence. J Voice. 2012;26(5):563-5.
²⁴
Honda K. Relationship between pitch control and vowel articulation. Haskins Laboratories Status Report on Speech Research. 1983;73:269-82.
²⁵
Shaw JA, Chen W, Proctor MI, Derrick D. Influences of tone on vowel articulation in mandarin chinese. JSLHR. 2016;59(6):S1566-74.
²⁶
Kent RD, Vorperian HK. Static measurements of vowel formant frequencies and bandwidths: a review. J Commun Disord. 2018;74:74-97.

Research support source: Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES - Finance Code 001.

Publication Dates

Publication in this collection
02 Dec 2019
Date of issue
2019

History

Received
05 Aug 2019
Accepted
30 Oct 2019

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ¹
Maryn Y, Corthals P, Van Cauwenberge P, Roy N, De Bodt M. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. J Voice. 2010;24(5):540-55.

[2] ²
Lima MFB, Camargo ZA, Ferreira LP, Madureira S. Qualidade vocal e formantes das vogais de falantes adultos da cidade de João Pessoa. Rev. CEFAC. 2007;9(1):99-109.

[3] ³
Viegas F, Viegas D, Baeck HE. Frequency measurement of vowel formants produced by Brazilian children aged between 4 and 8 years. J Voice. 2015;29(3):292-8.

[4] ⁴
Braga JN, Oliveira DSF, Sampaio TMM. Frequência fundamental da voz de crianças. Rev. CEFAC. 2009;11(1):119-26.

[5] ⁵
Viegas F, Viegas D, Atherino CCT, Baeck HE. Frequência fundamental das 7 vogais orais do português em vozes de crianças. Rev. CEFAC. 2010;12(4):563-70.

[6] ⁶
França FP, Evangelista DS, Lopes LW. Revisão sistemática sobre os formantes e a produção da voz e fala. Rev Prolíngua. 2017;12(1):2-16.

[7] ⁷
Barbosa PA, Madureira S. Manual de fonética acústica experimental. São Paulo: Cortez Editora; 2015.

[8] ⁸
Stevens KN, House AS. An acoustical theory of vowel production and some of its implications. J Speech Hear Res. 1961;4(4):303-20.

[9] ⁹
Johnson K. Acoustic and auditory phonetics. 3rd ed. Oxford:Wiley-Blackwell; 2012.

[10] ¹⁰
Fant G. Acoustic theory of speech production. The Hague: Mouton; 1960.

[11] ¹¹
Lehiste I. Suprasegmentals. Cambridge: MIT Press, 1970.

[12] ¹²
Kent RD, Read C. Análise acústica da fala. 1a. ed. São Paulo: Cortez Editora; 2015.

[13] ¹³
Escudero P, Boersma P, Rauber AS, Bion RAH. A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J Acoust Soc Am. 2009;126(3):1379-93.

[14] ¹⁴
Sundberg J. Ciência da voz: fatos sobre a voz na fala e no canto. 2a. ed. São Paulo: EdUSP; 2018.

[15] ¹⁵
Maryn Y, Roy N. Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity. J Soc Bras Fonoaudiol. 2012;24(2):107-12.

[16] ¹⁶
Lu FL, Matteson S. Speech tasks and interrater reliability in perceptual voice evaluation. J Voice. 2014;28(6):725-32.

[17] ¹⁷
Gerratt BR, Kreiman J, Garellek M. Comparing measures of voice quality from sustained phonation and continuous speech. JSLHR. 2016;59(5):994-1001.

[18] ¹⁸
Spazzapan EA, Cardoso VM, Fabron EMG, Berti LC, Brasolotto AG, Marino VC de C. Acoustic characteristics of healthy voices of adults: from young to middle age. CoDAS. 2018;30(5):1-7.

[19] ¹⁹
De Felício CM, Folha GA, Ferreira CLP, Medeiros APM. Expanded protocol of orofacial myofunctional evaluation with scores: validity and reliability. Int J Pediatr. Otorhinolaryngol. 2010;74(11):1230-9.

[20] ²⁰
Isshiki N, Okamura H, Tanabe M, Morimoto M. Differential diagnosis of hoarseness. Folia Phoniatr. Logoapedica. 1969;21(1):9-19.

[21] ²¹
Viegas D, Viegas F, Atherino CCT, Baeck H. Parâmetros espectrais da voz em crianças respiradoras orais. Rev. CEFAC. 2010;12(5):820-30.

[22] ²²
Loureiro LMJ, Gameiro MGH. Interpretação crítica dos resultados estatísticos: para lá da significância estatística. Rev Enferm Ref. 2011;3(3):151-62.

[23] ²³
Moon KR, Chung SM, Park HS, Kim HS. Materials of acoustic analysis: sustained vowel versus sentence. J Voice. 2012;26(5):563-5.

[24] ²⁴
Honda K. Relationship between pitch control and vowel articulation. Haskins Laboratories Status Report on Speech Research. 1983;73:269-82.

[25] ²⁵
Shaw JA, Chen W, Proctor MI, Derrick D. Influences of tone on vowel articulation in mandarin chinese. JSLHR. 2016;59(6):S1566-74.

[26] ²⁶
Kent RD, Vorperian HK. Static measurements of vowel formant frequencies and bandwidths: a review. J Commun Disord. 2018;74:74-97.

Parameters	MALES (n =40)
	Carrier phrase (CP)			Sustained vowel (SV)			t Test	Effect size
	Average (Hz)	Median (Hz)	Standard deviation	Average (Hz)	Median (Hz)	Standard deviation	p value	Effect size
F0 [a]	116	114	14.12	118	115	15.01	0.128	- 0.139
F0 [Ԑ]	117	115	16.49	118	114	15.45	0.790	- 0.063
F0 [e]	123	120	17.80	122	119	16.29	0.258	0.059
F0 [i]	132	131	19.17	126	122	18.46	0.013*	0.322
F0 [ᴐ]	119	118	15.16	120	118	16.72	0.660	- 0.063
F0 [o]	126	124	19.37	124	122	18.70	0.384	0.106
F0 [u]	136	135	22.35	127	124	18.19	<0.001*	0.447
F1 [a]	795	791	72.80	746	753	72.09	<0.001*	0.684
F1 [Ԑ]	554	558	45.12	535	540	41.14	0.015*	0.445
F1 [e]	358	358	31.83	341	346	30.37	0.006*	0.553
F1 [i]	313	320	27.55	277	276	25.27	<0.001*	1.379
F1 [ᴐ]	588	584	41.89	552	544	54.51	<0.001*	0.750
F1 [o]	421	418	32.11	402	394	32.59	<0.001*	0.594
F1 [u]	351	347	34.90	346	343	34.62	0.510	0.145
F2 [a]	1304	1309	83.62	1349	1346	86.79	0.009 *	- 0.534
F2 [Ԑ]	1946	1952	114.86	1969	1973	134.99	0.188	- 0.185
F2 [e]	2203	2189	175.07	2187	2189	128.36	0.479	0.105
F2 [i]	2236	2251	144.33	2226	2239	134.07	0.491	0.072
F2 [ᴐ]	898	916	67.17	975	969	69.93	<0.001*	- 1.137
F2 [o]	741	724	72.23	817	813	82.68	<0.001*	- 0.801
F2 [u]	721	722	63.86	792	797	64.95	<0.001*	- 1.116

Parameters	FEMININO (n =40)
	Carrier phrase (CP)			Sustained vowel (SV)			t Test	Effect size
	Average (Hz)	Median (Hz)	Standard deviation	Average (Hz)	Median (Hz)	Standard deviation	p value	Effect size
F0 [a]	192	190	16.44	207	208	18.59	<0.001*	- 0.865
F0 [Ԑ]	192	191	15.07	206	202	19.78	<0.001*	- 0.806
F0 [e]	199	199	15.65	210	209	19.06	<0.001*	- 0.638
F0 [i]	210	209	23.41	218	214	24.25	0.062	- 0.339
F0 [ᴐ]	198	198	16.40	207	206	19.59	<0.001*	- 0.504
F0 [o]	205	207	16.14	212	209	21.07	0.004*	- 0.377
F0 [u]	222	226	19.27	222	218	27.02	0.980	0.000
F1 [a]	945	941	98.26	886	867	127.05	0.003*	0.526
F1 [Ԑ]	623	634	54.71	591	607	67.65	0.010*	0.526
F1 [e]	406	403	28.70	413	409	39.09	0.109	- 0.206
F1 [i]	306	303	36.50	306	305	37.84	0.599	1.251
F1 [ᴐ]	665	662	55.59	667	655	65.39	0.221	1.302
F1 [o]	457	455	33.05	453	441	41.49	0.566	- 0.033
F1 [u]	417	420	41.92	415	417	36.71	0.900	0.051
F2 [a]	1598	1588	176.50	1637	1645	201.03	0.219	-0.208
F2 [Ԑ]	2297	2286	174.80	2312	2288	194.89	0.452	1.310
F2 [e]	2540	2555	231.79	2577	2590	139.52	0.324	- 0.082
F2 [i]	2763	2767	127.73	2784	2806	127.48	0.190	- 0.195
F2 [ᴐ]	1015	1019	68.17	1152	1108	74.54	<0.001*	- 1.942
F2 [o]	854	847	65.25	900	891	61.09	<0.001*	- 0.737
F2 [u]	747	759	92.96	816	837	90.31	<0.001*	- 0.762