Acoustic characteristics of the metallic voice quality

Fadel, Congeta Bruniere Xavier; Dassie-Leite, Ana Paula; Santos, Rosane Sampaio; Rosa, Marcelo de Oliveira; Marques, Jair Mendes

doi:10.1590/2317-1782/20152014159

Abstracts

PURPOSE:

To characterize the fundamental frequency and the frequency of the formants F1, F2, F3, and F4 from vocal emissions of amateur singers with metallic voice quality.

METHODS:

There were 60 amateur female singers aged between 18 and 60 years old; 30 women with metallic voice quality forming the study group (SG) and 30 women without such a vocal quality forming control group (CG). The sample was selected through voice screening confirmed by reviewers after reaching a consensus. Regarding data collection, sustained vowel emissions in usual tone and at two predetermined frequencies, by which the values of F₀ and frequency of the formants F1, F2, F3, and F4 were obtained, were recorded and analyzed.

RESULTS:

Comparing the emissions in usual tone, no difference for F₀ was found, but the values of the formants F2, F3, and F4 were higher in the SG. In the preestablished tones, there was a difference between the two groups in the formants F3 and F4 for both tones.

CONCLUSION:

It is possible to characterize metallic voice quality as a normal fundamental frequency, with increasing frequency of the F2 formant, and values of frequencies of formants F2, F3, and F4 higher when compared to the CG.

Voice; Voice Quality; Acoustics; Speech Acoustics; Singing

OBJETIVO:

Caracterizar a frequência fundamental (F₀) e frequência de formantes F1, F2, F3 e F4 das emissões vocais de cantoras amadoras com qualidade vocal metálica.

MÉTODOS:

Participaram da pesquisa 60 cantoras amadoras, com idades entre 18 e 60 anos, sendo 30 mulheres de qualidade vocal metálica integrando o grupo estudo (GE) e 30 mulheres sem essa qualidade vocal compondo o grupo controle (GC). A amostra foi selecionada através de triagem vocal confirmada por avaliadores, em consenso. Para a coleta de dados, foram gravadas e analisadas emissões de vogal sustentada em tom habitual e em duas frequências pré-determinadas, pelas quais extraíram-se os valores de F₀ e frequência de formantes F1, F2, F3 e F4.

RESULTADOS:

Quanto à comparação das emissões em tom habitual, não houve diferença para F₀, mas os valores dos formantes F2, F3 e F4 foram maiores no GE. Nas tonalidades pré-estabelecidas, verificou-se diferença entre os dois grupos nos formantes F3 e F4, em ambos os tons.

CONCLUSÃO:

Foi possível caracterizar a qualidade de voz metálica como de voz de frequência fundamental normal, com frequência de formante F2 aumentado, e valores de frequências de formantes F2, F3 e F4 maiores quando comparados ao GC.

Voz; Qualidade da Voz; Acústica; Acústica da Fala; Canto

INTRODUCTION

The metallic voice quality is described by literature as being strident, thin, and unpleasant⁽¹⁾1. Boone DlR, McFarlane SC. A voz e a terapia vocal. 5ª edição. Porto Alegre: Artes Médicas; 1994.; it is also associated with the vocal resonance pattern of pharyngeal focus⁽²⁾2. Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70.. Its emission is related to vocal tract contraction by adjustments in pharyngeal constriction and articulators, laryngeal elevation, adductor tension⁽³⁾3. Pinho SMR. Fundamentos em Fonoaudiologia. Rio de Janeiro: Guanabara Koogan; 1998., velar lowering, aryepiglottic constriction, and lateral constriction⁽²⁾.

Voice metallization is considered to be an efficient voice projection resource by singers and actors, and it is generally used in specific singing styles, like American country music⁽³⁾3. Pinho SMR. Fundamentos em Fonoaudiologia. Rio de Janeiro: Guanabara Koogan; 1998.. However, because this vocal production involves hyperfunctional adjustments in the vocal tract⁽²⁾2. Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70., and is being seen as acute and annoying⁽³⁾, it is also considered to be a voice resonance disorder outside the artistic context⁽¹⁾1. Boone DlR, McFarlane SC. A voz e a terapia vocal. 5ª edição. Porto Alegre: Artes Médicas; 1994..

Glottic and supraglottic adjustments combined to the anatomic characteristics of the individual are responsible for the characterization of voice quality⁽⁴4. Laver J. Principles of phonetics. New York: Cambridge University Press; 1994. ^, ⁵5. Camardo ZA, Madureira S. Dimensões perceptivas das alterações de qualidade vocal e suas correlações aos planos da acústica e da fisiologia. DELTA. 2009;25(2):285-317. ⁾. This combination directly impacts the measurements of the formant frequencies (FF), in which F1 and F2 (lower formants) are sensitive regarding the position of the lips and the tongue in the oral cavities; the upper formants (F3 and F4) are related to the total length of the vocal tract⁽⁶6. Kent RD. Vocal tract acoustics. J Voice. 1993;7(2):97-117. ^, ⁷7. Lindblom BE, Sundberg JE. Acoustical consequences of lip, tongue, jaw, and larynx movement. J Acoust Soc Am. 1971;50(4):1166-79. ⁾.

The importance of this type of approach to a singing teacher is based on the knowledge regarding the acoustic phenomena that occur in the vocal tract; therefore, this professional can develop a more objective technical reasoning and apply scientific principles to the pedagogical practice.

The objective of this study was to characterize the fundamental frequency and formant frequencies of F1, F2, F3, and F4, of vocal emissions of amateur female singers with metallic voice quality.

METHODS

This was an observational, analytical, and cross-sectional study. The sample comprised 60 amateur female singers, aged between 18 and 60 years old; 30 women with metallic voice quality comprising the study group (SG) (mean age 32.6 years old), and 30 women without such a voice quality composing the control group (CG) (mean age 34.2 years). The project was approved by the ethics committee of Hospital de Clínicas of Universidade Federal do Paraná (UFPR), number 154.350.

The sample was selected by a perceptual-auditory voice screening test (which included voice samples of sustained emission of the vowel /ε/, continuous speech, and singing) conducted by a researcher who had 13-year experience as a singing teacher. She identified 30 metallic and 30 non-metallic voices. The voice samples were confirmed by three judges - two singing teachers with the same academic formation in music and one speech-language pathologist who was an expert in voice - having an average of eight years' experience in their respective fields. They all clearly shared the definition of the expression "metallic voice" and had previous knowledge regarding the production and the perceptual-auditory identification of the researched vocal pattern.

By consensus, the judges confirmed which would be the sample voices considered to be metallic or non-metallic. For that procedure, samples of metallic voice quality were those presenting focus on pharyngeal resonance, making the judge uncomfortable due to the characteristics that are compatible with strident voices.

After the confirmation, the collection stage started, which consisted of recorded samples of the vocal emission of the vowel /ε/, in habitual tone (HT) and in two predetermined tones (frequencies) - A3 (220.0 Hz) and C5 (523.2 Hz). Emissions were recorded with the software SONAR^(r), version 8.0.2, with unidirectional microphone Shure^(r), model SM58, placed at 45° and 4 cm away from the singer's mouth. For the predetermined tones, the auditory reference before every emission was the digital keyboard from the software SPEECHPITCH, version 1.1.

Samples were imported to the software PRAAT, version 5.3.42, to perform the acoustic analysis. From the emissions in HT, values of fundamental frequency were extracted (F₀) as well as the measurements of the formant frequencies F1, F2, F3, and F4; for the emissions in predetermined tones, values of F1, F2, F3, and F4 were extracted. For the analysis of these frequencies (in Hz), only the six central seconds of each emission were used as selection criteria for the sound records, so the initial and final seconds were not considered, and the most stable fragments were extracted.

Data were statistically analyzed with the parametric Student's t-test, with significance level of 0.05 (5%).

RESULTS

The intragroup comparison for HT showed that there were no differences for vocal parameter F₀, but values for F2, F3, and F4 were higher in the SG (Table 1).

Thumbnail

Table 1.
Comparison of the study and control groups for the variables F0, F1, F2, F3, and F4, in the emission of the vowel /e/ in habitual tone

In the preestablished tones, a difference was observed between both groups for F3 and F4, in both tones (Table 2).

Thumbnail

Table 2.
Comparison between the study and control groups for the variables F1, F2, F3, and F4, in the emission of the vowel /e/ for tones A3 and C5

DISCUSSION

In the musical field, it is possible to observe that singing teachers are not unanimous regarding the use of terminology to describe different vocal qualities and their respective vocal tract adjustments; there, they use some names based only on auditory sensations and body vibrations⁽⁸⁾8. Pacheco COLC, Marçal M, Pinho SMR. Registro e cobertura: arte e ciência no canto. Rev CEFAC. 2004;6(4):429-35.. Therefore, the association between the acoustic analysis and the perceptual-auditory analysis in the description/identification of the vocal quality can be described as a complementary resource for this practice.

In this study, the F₀ values obtained for the female singers in both groups were similar to the reference values⁽⁹9. Santos CC, Mituuti CT, Berretin-Felix G, Teles LCS. Características da fonetografia em mulheres com equilíbrio dentofacial. Rev Soc Bras Fonoaudiol. 2010;15(4):584-8. ^, ¹⁰10. Felippe ACN, Grillo MHMM, Grechi TH. Normatização de medidas acústicas para vozes normais. Rev Bras Otorrinolaringol. 2006;72(5):659-64. ⁾, so it is possible to state that the metallic voice quality cannot be considered to be the voice pattern of high F₀. However, the acute pitch sensation attributed to it is a result of the increasing frequencies and amplitudes of the formants⁽²⁾2. Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70..

Regarding FF values, except for F1, all the formants in HT were higher for metallic voices. It is possible to relate this feature to the shortening adjustment of the vocal tract, that is, a shorter vocal tract would lead to higher FFs when compared to longer vocal tracts⁽⁵⁾5. Camardo ZA, Madureira S. Dimensões perceptivas das alterações de qualidade vocal e suas correlações aos planos da acústica e da fisiologia. DELTA. 2009;25(2):285-317.. In terms of physiology, this vocal tract shortening would result in laryngeal elevation and hypertonicity of pharyngeal constrictor muscles, and these adjustments are associated with the production of metallic voice⁽¹1. Boone DlR, McFarlane SC. A voz e a terapia vocal. 5ª edição. Porto Alegre: Artes Médicas; 1994. ^, ²2. Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70. ⁾.

Among these values, the F2 of the SG was above the reference value, which was 2,062 Hz⁽¹¹⁾11. Monteiro MC. Uma análise computadorizada espectrográfica dos formantes das vogais orais do Português Brasileiro falado em São Paulo [monografia]. São Paulo: Universidade Federal de São Paulo; 1995.. This increase in F2 was related to the metallic voice production in a study that investigated the physiological adjustments related to it⁽²⁾2. Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70.. It is known that this formant is variable according to the placement of the tongue in the oral cavity, especially concerning the changes in its body and its placement in the anteroposterior direction⁽⁵5. Camardo ZA, Madureira S. Dimensões perceptivas das alterações de qualidade vocal e suas correlações aos planos da acústica e da fisiologia. DELTA. 2009;25(2):285-317. ^, ⁷7. Lindblom BE, Sundberg JE. Acoustical consequences of lip, tongue, jaw, and larynx movement. J Acoust Soc Am. 1971;50(4):1166-79. ⁾. Possibly, the muscle adjustments that led to this increasing frequency were a result of anteriorization and elevation of the tongue dorsum, similarly to the movement that takes place in the production of the vowel /i/. This tongue movement, besides the adjustments of lip stretching in a smile, is used for singing in order to look for voice metallization, and has been reported in literature as an adjustment that is present in this voice quality⁽³⁾3. Pinho SMR. Fundamentos em Fonoaudiologia. Rio de Janeiro: Guanabara Koogan; 1998.. However, for that hypothesis to be confirmed, it would be necessary to perform complementary radiological/imaging tests.

In the emissions of predetermined tones, only F3 and F4 presented higher values for this voice quality. These formants can change in relation to the dimension of the vocal tract cavity, and their reduction would lead to increased frequencies⁽⁷7. Lindblom BE, Sundberg JE. Acoustical consequences of lip, tongue, jaw, and larynx movement. J Acoust Soc Am. 1971;50(4):1166-79. ^, ¹²12. Sundberg J. Articulatory interpretation of the "singing formant". J Acoust Soc Am. 1974;55(4):838-44. ⁾. In that case, the adjustments in the vocal tract applied by the SG during the emission of deep and acute tones possibly led to smaller dimensions of the vocal tract when compared to the CG.

CONCLUSION

From the vocal acoustic assessment, it is possible to conclude that amateur female singers with metallic voice quality present F₀ within normality patterns. However, they present increased frequency values of formants F2, F3, and F4 in relation to the amateur female singers with non-metallic voice quality. The F2 formant seems to be mostly related to the metallic voice quality, because the mean values presented by the SG are higher than those indicated in the literature.

ACKNOWLEDGMENTS

We acknowledge the grant (number 158639/2012-0) provided by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for conducting this study.

REFERENCES

¹
Boone DlR, McFarlane SC. A voz e a terapia vocal. 5ª edição. Porto Alegre: Artes Médicas; 1994.
²
Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70.
³
Pinho SMR. Fundamentos em Fonoaudiologia. Rio de Janeiro: Guanabara Koogan; 1998.
⁴
Laver J. Principles of phonetics. New York: Cambridge University Press; 1994.
⁵
Camardo ZA, Madureira S. Dimensões perceptivas das alterações de qualidade vocal e suas correlações aos planos da acústica e da fisiologia. DELTA. 2009;25(2):285-317.
⁶
Kent RD. Vocal tract acoustics. J Voice. 1993;7(2):97-117.
⁷
Lindblom BE, Sundberg JE. Acoustical consequences of lip, tongue, jaw, and larynx movement. J Acoust Soc Am. 1971;50(4):1166-79.
⁸
Pacheco COLC, Marçal M, Pinho SMR. Registro e cobertura: arte e ciência no canto. Rev CEFAC. 2004;6(4):429-35.
⁹
Santos CC, Mituuti CT, Berretin-Felix G, Teles LCS. Características da fonetografia em mulheres com equilíbrio dentofacial. Rev Soc Bras Fonoaudiol. 2010;15(4):584-8.
¹⁰
Felippe ACN, Grillo MHMM, Grechi TH. Normatização de medidas acústicas para vozes normais. Rev Bras Otorrinolaringol. 2006;72(5):659-64.
¹¹
Monteiro MC. Uma análise computadorizada espectrográfica dos formantes das vogais orais do Português Brasileiro falado em São Paulo [monografia]. São Paulo: Universidade Federal de São Paulo; 1995.
¹²
Sundberg J. Articulatory interpretation of the "singing formant". J Acoust Soc Am. 1974;55(4):838-44.

Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPq

Publication Dates

Publication in this collection
Jan-Feb 2015

History

Received
26 Aug 2014
Accepted
12 Nov 2014

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

[1] ¹
Boone DlR, McFarlane SC. A voz e a terapia vocal. 5ª edição. Porto Alegre: Artes Médicas; 1994.

[2] ²
Hanayama EM, Camargo ZA, Tsuji DH, Pinho SMR. Metallic voice: physiological and acoustic features. J Voice. 2009;23(1):62-70.

[3] ³
Pinho SMR. Fundamentos em Fonoaudiologia. Rio de Janeiro: Guanabara Koogan; 1998.

[4] ⁴
Laver J. Principles of phonetics. New York: Cambridge University Press; 1994.

[5] ⁵
Camardo ZA, Madureira S. Dimensões perceptivas das alterações de qualidade vocal e suas correlações aos planos da acústica e da fisiologia. DELTA. 2009;25(2):285-317.

[6] ⁶
Kent RD. Vocal tract acoustics. J Voice. 1993;7(2):97-117.

[7] ⁷
Lindblom BE, Sundberg JE. Acoustical consequences of lip, tongue, jaw, and larynx movement. J Acoust Soc Am. 1971;50(4):1166-79.

[8] ⁸
Pacheco COLC, Marçal M, Pinho SMR. Registro e cobertura: arte e ciência no canto. Rev CEFAC. 2004;6(4):429-35.

[9] ⁹
Santos CC, Mituuti CT, Berretin-Felix G, Teles LCS. Características da fonetografia em mulheres com equilíbrio dentofacial. Rev Soc Bras Fonoaudiol. 2010;15(4):584-8.

[10] ¹⁰
Felippe ACN, Grillo MHMM, Grechi TH. Normatização de medidas acústicas para vozes normais. Rev Bras Otorrinolaringol. 2006;72(5):659-64.

[11] ¹¹
Monteiro MC. Uma análise computadorizada espectrográfica dos formantes das vogais orais do Português Brasileiro falado em São Paulo [monografia]. São Paulo: Universidade Federal de São Paulo; 1995.

[12] ¹²
Sundberg J. Articulatory interpretation of the "singing formant". J Acoust Soc Am. 1974;55(4):838-44.

Variables	Mean (Hz)		Standard deviation		p-value
Variables	SG (n=30)	CG (n=30)	SG (n=30)	CG (n=30)	p-value
F₀	212.35	210.49	25.71	19.56	0.7543
F1	615.42	609.01	72.86	52.18	0.6969
F2	2,175.89	2,043.10	145.42	197.83	0.0044*
F3	2,976.63	2,731.48	160.46	177.85	0.0000*
F4	4,407.52	3,842.04	218.13	314.82	0.0000*

Variables	Mean (Hz)		Standard deviation		p-value
Variables	SG (n=30)	GC (n=30)	SG (n=30)	CG (n=30)	p-value
La 2
F1	605.00	620.04	76.22	45.60	0.3577
F2	2,117.17	2,069.73	171.43	187.97	0.3113
F3	2,972.23	2,839.76	175.54	244.76	0.0192*
F4	4,337.19	4,142.15	229.32	405.28	0.0254*
Do 4
F1	771.91	712.14	134.97	136.90	0.0939
F2	1,501.16	1,543.46	351.20	319.71	0.6275
F3	2,697.04	2,519.94	252.42	274.51	0.0118*
F4	3,932.86	3,612.50	382.70	353.95	0.0014*