Acessibilidade / Reportar erro

Use of the program MDVP in different contexts: a literature review

Abstract:

The purpose of this study is to describe,the reliability of acoustic analysis using Multi Dimensional Voice Program, compare their measures with other acoustic analysis programs, characterize the voices of different groups, and use it to assess the effects and effectiveness of different therapeutic procedures. The study is based on a literature review that prioritized the past five years and included articles whose purpose was according to the interest of this review. The survey was conducted in the databases of Lilacs, BIREME, PubMed, Medline, SciELO and Google Scholar, using the descriptors acoustic, speech acoustics and voice. The measures most commonly used are fundamental frequency, jitter, shimmer and harmonics-to-noise ratio. The measures of tremor do not have much reliability. The most consistent measures that present a high correlation with other acoustic analysis programs are related to fundamental frequency. Several studies aim to characterize different types of voices such as those of subjects of both sexes without voice alterations, in order to establish a normality parameter for different population groups. In addition, some studies were characterized with different voice disorders. Acoustic analysis is an objective and necessary resource for the evaluation of patients with voice disorders and especially to compare several types of treatment.

Keywords:
Voice; Speech Acoustics; Voice Quality; Speech Recognition Software; Acoustics

Resumo:

Este estudo tem como objetivo descrever, a partir de revisão de literatura, a confiabilidade da análise acústica utilizando o Multi Dimensional Voice Program, comparação de suas medidas com outros programas de análise acústica; caracterização de vozes de diferentes grupos; e sua utilização para verificar os efeitos e eficácia de diferentes procedimentos terapêuticos. Realizou-se levantamento bibliográfico que priorizaram estudos dos últimos cinco anos, incluindo-se artigos cujo objetivo estivesse de acordo com o interesse da presente revisão. A pesquisa foi realizada nos bancos de dados das bases Lilacs, BIREME, PubMed, MedLine, Scielo e Google Schoolar, por meio dos descritores acoustic, speech acoustics e voice. As medidas mais utilizadas são frequência fundamental, jitter, shimmer e proporção harmônico-ruído. As medidas de tremor não apresentam boa confiabilidade. As medidas mais consistentes que apresentam alta concordância com outros programas de análise acústica são as relacionadas com a frequência fundamental. Diversos estudos buscam caracterizar diferentes tipos de vozes tais como as de sujeitos sem alterações vocais de ambos os sexos, buscando estabelecer parâmetro de normalidade para diferentes grupos populacionais. Além disso, alguns estudos caracterizam vozes com diferentes distúrbios. A análise acústica é um recurso objetivo e necessário na avaliação de pacientes com distúrbio vocal e, principalmente, na comparação de diversos tipos de tratamento.

Descritores:
Voz; Acústica da Fala; Qualidade da Voz; Interface para o Reconhecimento da Voz; Acústica

Introduction

The human ear is prepared to perceive voice as a whole, which is advantageous from the point of view of linguistic communication. But such ability becomes limited when it is decisive for the individualization of relevant clinical aspects. For this reason, it is difficult to determine the origin of certain voice disorders when using only subjective tools such as the auditory-perceptual assessment1. González J, Cervera T, Miralles JL. Análisis acústico de la voz: fiabilidad de un conjunto de parámetros multidimensionales. Acta Otorrinolaringol Esp. 2002;53: 256-68. 3. Aguero PD, Tulli JC, Moscardi G, Gonzalez EL, Uriz AJ. Estimating RASATI scores using acoustical parameters. J Physics. 2011;332(1):1-8..

Accordingly, the number of researches in speech therapy has increased, which use techniques and tools that provide objective and increasingly reliable results. The Multi Dimensional Voice Program (MDVP) is a standard software for acoustic assessment which is widely used by many researchers in the voice field for being very comprehensive. Thus, it is important to study its particularities.

Acoustic vocal assessment consists of a noninvasive process of obtaining objective measures from signal. Because of its exceptional accuracy, it allows evaluators to recognize voice alterations early and helps compare the effects and efficiency of several vocal techniques or other voice therapeutic procedures, even if they produce subtle changes4. Finger LS, Cielo CA, Schwarz K. Acoustic vocal measures in women without voice complaints and with normal larynxes. Braz J Otorhinolaryngol. 2009; 75(3):432-40. 6. Mora R, Crippa B, Dellepiane M, Jankowska B. Effects of adenotonsillectomy on speech spectrum in children. Inter J of Pediatric Otorhinolaryngol. 2007;71:1299-304..

Among the advantages of using this method in voice assessment is the fact that it is noninvasive and deliver numerical parameters. One of the major current limitations is that is has not been possible yet to establish an exact correlation between the numerical parameters of the acoustic analysis with the auditory-perceptual aspects of voice 1. González J, Cervera T, Miralles JL. Análisis acústico de la voz: fiabilidad de un conjunto de parámetros multidimensionales. Acta Otorrinolaringol Esp. 2002;53: 256-68. 2. Tezcaner CZ, Ozgursoy SK, Sati I, Dursun G. Changes after voice therapy in objective and subjective voice measurements of pediatric patients with vocal nodules. Eur Arch Otorhinolaryngol. 2009;266:1923-7..

Given the above, this study is the result of a literature review aiming to describe the reliability of acoustic analysis using the Multi Dimensional Voice Program compare its measures with other acoustic analysis software, characterize voices from different groups, and use it to verify the effects and effectiveness of different therapeutic procedures.

Methods

A theoretical and exploratory survey was conducted using the non-systematic method of literature review. Only original research articles and those based on bibliographic review on MPVP from traditional and modern national and international sources of literature were included. The search was carried out on Lilacs (Latin-American and Caribbean Center on Health Sciences Information), Bireme (Regional Library of Medicine), Medline (Medical Literature Analysis and Retrieval System on Line), PubMed (Medical Publications), Scielo (Scientific Electronic Library Online), and Google Scholar databases. The Health Science Descriptors (DeCS, 2013) used to find the papers were: acoustic, speech acoustics and voice. The search was made using isolate descriptors and later by combining them.

In total, 102 articles were retrieved, and of this 15 were excluded at once for being studies involving the use of medications. So, 87 articles were first selected before reading the abstracts. Of these, it was included in the present survey all papers published in between 2008 to 2013, having any of the following approaches: reliability of MDVP; comparison between MDVP and other acoustic analysis tools; use of acoustic assessment to compare vocal techniques used in speech therapy.

The criteria used for exclusion were: periodicals not indexed in the above databases; articles whose data of publication exceeded five years, except two papers of 2002 and 2007 which were included by the authors for being considered relevant to the present review. In addition, articles addressing comparative acoustic measures before and after treatment of non-vocal pathologies were also excluded.

Based on the above criteria, 37 papers were selected for analysis. First, these articles were organized according to the authors, title, publication year, and kind of study. Based on this categorization, the references were grouped according to the themes that gave origin to the subtitles of the present work, namely: "Acoustic analysis using MDVP: reliability of the software and measures"; "Comparison with other acoustic assessment programs"; "Characterization of voices of different groups"; "Use of the MDVP system to verify the effect and effectiveness of different therapeutic procedures".

Literature Review

Acoustic analysis using MDVP: reliability of the software and measures

The MDVP software of Kay Pentax(r)calculates 33 measures, and those of major interest in the literature are the frequency measures: f0; maximum f0 (fhi); minimum f0 (flo); standard deviation f0 (STD); frequency perturbation measures; absolute jitter (Jita); Jitterpercentage (Jitt); Relative measure of the pitch disturbance (RAP); pitch perturbation quotient (PPQ); smoothed pitch perturbation quotient (sPPQ); f0 variation (vf0); measures of perturbation intensity: Shimmer in dB (ShdB); percentage Shimmer (Shim); amplitude perturbation quotient (APQ); smoothed amplitude perturbation quotient (sAPQ); amplitude variation (vAm); Noise-to-harmonics Ratio (NHR); Voice turbulence index (VTI); Smoothed phonation index (SPI); voice break measures: Degree of voice breaks (DVB); Number of voice breaks (NVB); mute or unvoiced segments measures: Number of unvoiced segments (NUV); Degree of unvoiced segments (DUV); sub-harmonic segments measures: Degree of sub-harmonics (DSH); Number of sub-harmonics (NSH)1. González J, Cervera T, Miralles JL. Análisis acústico de la voz: fiabilidad de un conjunto de parámetros multidimensionales. Acta Otorrinolaringol Esp. 2002;53: 256-68. 4. Finger LS, Cielo CA, Schwarz K. Acoustic vocal measures in women without voice complaints and with normal larynxes. Braz J Otorhinolaryngol. 2009; 75(3):432-40. 7. Conterno G, Cielo CA, Elias VS. Características vocais acústicas do som basal em homens com fissura pós-forame reparada. Rev CEFAC. 2011;13(1):171-81. 8. Roman-Niehues G, Cielo CA. Modificações vocais acústicas produzidas pelo som hiperagudo. Rev CEFAC. 2010;12(3):462-70..

There are a great number of studies in the literature using the MDVP software as an assessment tool for acoustic analysis. Firstly, it is possible to locate studies that evaluate the reliability of the program and its measures. The f0, jitter, shimmer and NHR measures are widely used by researchers 9. Bonilha HS, Dawson AE. Creating a Mastery Experience During the Voice Evaluation. J Voice. 2012;26(5):665-71. 1212 . Garcia MJV, Cobeta I, Martin G, Alonso-Navarro H, Jimenez-Jimenez J. Acoustic analysis of voice in Huntington's disease patients. J Voice. 2009;25(2):208-17.. Such objective parameters are important for the assessment of patients with voice disorders.

In addition, measures of tremor such as FATr and FFTr are also often analyzed by many studies1313 . Boucher VJ, Ayad T. Physiological Attributes of vocal fatigue and their acoustic effects: a synthesis of findings for a criterion-based prevention of acquired voice disorders. J Voice. 2010;24(3):324-36. 1414 . Franca MC, Simpson KO. Effects of systemic hydration on vocal acoustics of 18- to 35-year-old females. Commu Disord Quarterly. 2012;34(29):29-33.. However, such measures may show variations according to the time of emission or the analyzed text. A study aiming to reduce the effects of such measure fluctuations used constant voiced speech time of 3,000 ms as starting points and performed averages of seven consecutive measures involving FATr and FFTr1313 . Boucher VJ, Ayad T. Physiological Attributes of vocal fatigue and their acoustic effects: a synthesis of findings for a criterion-based prevention of acquired voice disorders. J Voice. 2010;24(3):324-36..

Another study1515 . Shao J, Maccallum JK, Zhang Y, Sprecher A, Jiang JJ. Acoustic analysis of the tremulous voice: Assessing the utility of the correlation dimension and perturbation parameters. J Commun Disord. 2010;43(1):35-44. investigated tremor measurements comparing 25 voices of subjects with Parkinson's disease with 24 voices of normal subjects using Jitter, shimmer, ATRI, FTRI, Fftr and Fatr measures. The jitter, shimmer, FTRI, and Fftr measures were significantly higher in the group of subjects with Parkinson's disease, suggesting that such measures possibly are more reliable to assess the tremor parameter in clinical practice.

Reliability of MDVP measures was also investigated by a study1. González J, Cervera T, Miralles JL. Análisis acústico de la voz: fiabilidad de un conjunto de parámetros multidimensionales. Acta Otorrinolaringol Esp. 2002;53: 256-68.that tested and retested the same voices and found high reliability for the frequency parameters; an acceptable reliability of jitter, shimmer noise, sub-harmonics and voice irregularity measures; and low reliability in tremor parameters. It also found that the parameters related to shimmer are more reliable than those related to jitter.

Concerning amplitude, a study 1616 . Remacle A, Finck C, Roche A, Morsomme D. Vocal Impact of a prolonged reading task at two intensity levels: objective measurements and subjective self-ratings. J Voice. 2012;26(4):177-86. had the purpose of assessing the impact of prolonged reading on voice in two different levels of vocal sound pressure. The sample consisted of collecting sustained vowel [a] in usual tone, performed in three replications, with 50 female individuals and carried out in two sessions. In the first session the voice intensity ranged from 60 to 65 dB and in the second from 70 to 75 dB. The parameters assessed in MDVP were average f0, Jitter, Shimmer and NHR. The findings of this study indicated that average f0 rises as the voice intensity increases, while the voice quality decreases. Such findings confirm the importance of duration and intensity of vocal sound as factors of voice overload. These data were already known by clinic speech therapists, but the use of an objective evaluation enabled such corroboration.

Yet, according to literature some caution is necessary when using acoustic measurements. According to a study1717 . Leong K, Hawkshaw MJ, Dentchev D, Gupta R, Lurie D, Sataloff R. Reliability of objective voice measures of normal speaking voices. J Voice. 2013;27(2):170-6. NHR is considered the less reliable measure and the cepstral peak prominence (CPP) may vary according to gender.

According to some authors1717 . Leong K, Hawkshaw MJ, Dentchev D, Gupta R, Lurie D, Sataloff R. Reliability of objective voice measures of normal speaking voices. J Voice. 2013;27(2):170-6., understanding the intrapersonal variability in objective voice measurements, and if such variations result from biological differences or measurement error, the need for a standard test protocol should be considered. Thus, further studies covering the aspects of reliability of the MDVP software and measures are still necessary.

Comparison of MDVP with other acoustic analysis software

It is known that acoustic analysis programs use their own method to calculate f0, and although each one produces minor differences in the f0 mean value, to a greater extent it may influence the perturbation measures. This fact makes the clinical practice more difficult, because the programs that are available could show different values when they analyze identical voice samples3. Aguero PD, Tulli JC, Moscardi G, Gonzalez EL, Uriz AJ. Estimating RASATI scores using acoustical parameters. J Physics. 2011;332(1):1-8.. Therefore, having full understanding of the differences between the programs and their measures is necessary to facilitate comparison and generalization of the results of studies using different programs and even to select the most suitable program to each situation.

Furthermore, it is not clear whether the normative data contained in the software (e.g., MDVP) are comparable with those obtained with other programs. Some researchers 3. Aguero PD, Tulli JC, Moscardi G, Gonzalez EL, Uriz AJ. Estimating RASATI scores using acoustical parameters. J Physics. 2011;332(1):1-8. 1818 . Manfredi C, Giordanoa A, Schoentgenb J, Fraj S, Bocchia L, Dejonckered PH. Perturbation measurements in highly irregular voice signals: Performances/validity of analysis software tools. Biomed Signal Proces. 2012;7:409-16. 2020 . Amir O, Wolf M, Amir N. A clinical comparison between two acoustic analysis softwares: MDVP and Praat. Biomed Signal Proces. 2009;4(1):202-5. have already observed such discrepancy between the results obtained by different programs.

A study1818 . Manfredi C, Giordanoa A, Schoentgenb J, Fraj S, Bocchia L, Dejonckered PH. Perturbation measurements in highly irregular voice signals: Performances/validity of analysis software tools. Biomed Signal Proces. 2012;7:409-16. compared the jitter values found using Praat and MDVP programs. It was found that the threshold is greater in using Praat than in MDVP. Both programs are efficient and have comparable sensitivity when measuring small jitter values. This same study also compares the results obtained from AMPEX and BioVoice programs and found that BioVoice has the best performance in the analysis of jitter. This is because of the time of analysis that is possible with this program. However, another research found similar values for jitter when analyzed by MDVP and BioVoice1919 . Manfredi C, Bocchi l, Cantarella G. A multipurpose user-friendly tool for voice analysis: Application to pathological adult voices. Biomed Signal Proces. 2009;4:212-20..

In a research2020 . Amir O, Wolf M, Amir N. A clinical comparison between two acoustic analysis softwares: MDVP and Praat. Biomed Signal Proces. 2009;4(1):202-5. conducted with women with and without functional dysphonia, similar results for f0 were obtained using MDVP and Praat programs. But for jitter, shimmer and NHR values, the results obtained by Praat were lower when compared to MDVP. This was different from the findings of another study 2121 . Oguz H, Kiliç MA, Safak M.A. Comparison of results in two acoustic analysis programs: Praat and MDVP. Turk J Med Sci. 2011;41(5):835-41., in which the shimmer values were similar, while jitter and NHR values were not similar. This highlights the need for further investigations on this matter, because by knowing the details of each software thoroughly it will be possible in the future to select the most appropriate program to suit the goals of the research to be conducted.

Another study2222 . Silva DG, Oliveira LC, Andrea, M. Jitter estimation algorithms for detection of pathological voices. EURASIP J Adv Sig Pr. 2009;1(1):1-9. examined the use of a new proposed measurement for frequency perturbation called LocJitt, which provides better accuracy.This measurement was considered more reliable when used either with MDVP or with Praat. The only case in which Jitt exceeds the LocJitt measurement was when relative jitter was used in Praat. In all other cases the results obtained with LocJitt are similar or better than Jitt.

A study proposes a new method to estimate short-term jtter based on a mathematical model that describes coupling of two periodic phenomena. The proposed method measures such periodic movement indirectly, considering the spectral properties as a whole. The assessment was conducted with normal and dysphonic voice samples using two databases and jitter estimates from MDVP and Praat. The study showed that traditional methods really depend on the lowest frequency information, while the proposed method takes into account the entire spectrum. The study of short duration statistical jitter measures provided by the suggested method shows that there is a greater correlation with the voice pathology compared to the other two systems2323 . Vasilakis M, Stylianou Y. Spectral jitter modeling and estimation. Biomed Signal Proces. 2009;4(1):183-93..

With respect to MDVP reliability, a research1010 . Hill AC, Ojha S, Maturo S, Maurer R, Bunting G, Hartnick CJ. Consistency of voice frequency and perturbation measures in children. Otolaryngol Head Neck Surg. 2013;148:637.compared its findings with the Voice Evaluation Suite (VES) program, analyzing voices of 50 children aged 4 to 17 years having no vocal alterations, and obtained excellent reliability for f0 measures. In the conclusion, the authors reported that measures of sustained vowels could provide consistent frequency measures. But perturbation is not measured consistently, and the average of various samples of the same subject improves the result consistency. The WPCVox program showed similar results to those found with MDVP, and the frequency and amplitude perturbation values showed minor variation, with a difference within expectations. So, WPCVox is a program that allows comparisons with data analyzed by MDVP 2424 . Godino-Llorente JI, Osma-Ruiz V, Sáenz-Lechón N, Cobeta-Marco I, González-Herranz R, Ramírez-Calvo C. Acoustic analysis of voice using WPCVox: a comparative study with Multi Dimensional Voice Program. Eur Arch Otorhinolaryngol. 2008;265:465-76..

To date, the literature has shown that the most consistent measures that are in agreement with other acoustic analysis programs are those related to f0. On the other hand, the frequency and amplitude perturbation measures have some variance, and the users should be careful when comparing the results from studies on these measures.

Characterization of voices of different groups

There are studies in literature that seek to characterize acoustically voices of different groups of individuals. Among them, a study conducted a literature review describing and discussing the aspects involving acoustic vocal characteristics (of source and filter) of men with normal larynx. Such investigation concluded that the Jitt, ShdB and NHR values tend to be higher in men, while f0, PHR/HNR and ATRI tend to be lower than in women5. Beber BC, Cielo CA. Medidas acústicas de fonte glótica de vozes masculinas Normais. Pró-Fono R Atual Cient. 2010;22(3):299-304..

Another study described acoustic measures of 56 female, young adults with normal larynx and no complaints. The analysis showed that the measures with normal distribution were: f0; Jitter (local); Jitter (local, absolute); Jitter (ppq); Jitter (ddp). The Jitter (rap) measures, all Shimmer measures, the noise-to-harmonics ratio (NHR) and harmonics-to-noise ratio (HNR) did not follow normal distribution. From this finding it can be inferred that the measures that followed normal distribution are likely to be used as reference values for the interpretation of the results from female acoustic voice analyses, with or without laryngeal disorders. In addition, the study showed that all measures with or without normal distribution had similar results of those reported in national and international literature4. Finger LS, Cielo CA, Schwarz K. Acoustic vocal measures in women without voice complaints and with normal larynxes. Braz J Otorhinolaryngol. 2009; 75(3):432-40..

In the same line, a study 2525 . Yaser S, Wingate JM. Fundamental frequency characteristics of jordanian arabic speaker. J Voice. 2009;23(5):562-6. was conducted to verify the normal vocal acoustic characteristics of Arabs. The sample consisted of 300 Arabic-speaking Jordanians (100 male adults, 100 female adults and 10 children of both sexes). The MDVP program was used for f0 analysis of sustained [a] and f0 of speech of a produced sentence. The results revealed a significant difference among the f0 values of male, female adults and children. Comparison with other ethnic groups suggests that the f0 values of Jordanian male and female Arabic-speaking adults are in general consistent with the values of Caucasian and African adults. But for children the f0 values were relatively higher when compared to children of other languages.

This study 2525 . Yaser S, Wingate JM. Fundamental frequency characteristics of jordanian arabic speaker. J Voice. 2009;23(5):562-6. suggests that speech therapists in Arabic language specifically use the new data provided by the study in the assessment and/or treatment of Arab patients with speech disorders.

Also, an investigation sought to characterize vocal emissions from adult male patients with repaired post-foramen palatine fissure in a modal and basal recording. An acoustic analysis of glottal source in basal recording showed: f0 was within the speech modal register range and increased when compared to the emission in modal register; a large increase of measures that show frequency and amplitude variation; most of the jitter and shimmer measures significantly increased; increased measures of noise, voice breaks, unvoiced segments and tremor segments; SPI significantly reduced. This shows high emission instability and noise in basal recording in these subjects, likely due to weak intensity and reduced transglotic air flow7. Conterno G, Cielo CA, Elias VS. Características vocais acústicas do som basal em homens com fissura pós-forame reparada. Rev CEFAC. 2011;13(1):171-81..

MDVP still offers two measures of sub-harmonic segments, viz degree of sub-harmonic components (DSH) and numbers of sub-harmonic components (NSH). It is expected that subjects with normal voice have null values for both parameters, despite a small percentage of people, often women, who although may not have pathology of vocal folds, have sub-harmonic segments1212 . Garcia MJV, Cobeta I, Martin G, Alonso-Navarro H, Jimenez-Jimenez J. Acoustic analysis of voice in Huntington's disease patients. J Voice. 2009;25(2):208-17.. Thus, such measures appear less frequently in the literature, especially in studies with individuals without laryngeal disorders.

A study1212 . Garcia MJV, Cobeta I, Martin G, Alonso-Navarro H, Jimenez-Jimenez J. Acoustic analysis of voice in Huntington's disease patients. J Voice. 2009;25(2):208-17.that performed acoustic voice analysis in individuals with Huntington's disease found that both parameters (NSH and DSH) of frequency and amplitude perturbation had values significantly higher than the control group. In addition, these values had a significantly positive correlation with the degree of severity of the disease. Also, the number of voiceless segments or unvoiced segments (DUV and NUV) and the NHR, FTRI and ATRI measures also presented higher values for the group of subjects with Huntington's disease, when compared to the control group. However, the VTI and SPI values did not differ significantly between the two groups of the study. In addition, none of the tremor parameters, irrespective of frequency or amplitude, showed a significant correlation with the degree of severity of the disease.

Use of the MDVP system to verify the effect and effectiveness of different therapeutic procedures

Another category of studies compares the effects of voice techniques using acoustic analysis. In this context, a study 2626 . Schwarz K, Cielo CA. Modificações laríngeas e vocais produzidas pela técnica de vibração sonorizada de língua. Pró-Fono R Atual Cient. 2009;21(2):161-6.that used the voiced tongue vibration technique of speech therapy in 24 women observed immediately after performing the technique a significant increase of f0. Other studies that investigated vocal modifications occurring after using the techniques of fricative [ž] 2727 . D'avila H, Cielo CA, Siqueira, MS. Som fricativo sonoro /Ž/: modificações vocais. Rev CEFAC. 2010;12(6):915-24.and high-pitch sound 8. Roman-Niehues G, Cielo CA. Modificações vocais acústicas produzidas pelo som hiperagudo. Rev CEFAC. 2010;12(3):462-70. in women without vocal alterations did not find significant differences regarding the acoustic parameters analyzed via MDVP.

A study with 32 adult women without vocal complaints investigated the immediate effect of lips and tongue vibration and nasal sound vocal techniques, considered SOVTE (semi-occluded vocal tract exercises) and over-articulation. Using the MDVP system it was possible to determine a significant decrease of jitter and shimmer in the voices of the subjects 2828 . Pereira EC, Silvério KCA, Marques JM, Camargo PAM. Efeito imediato de técnicas vocais em mulheres sem queixa vocal. Rev CEFAC. 2011;13(5):886-95..

Also, there are studies that aim to evaluate the efficiency or effectiveness of different voice therapeutic methods in case of subjects with dysphonia. Among them, a case study aimed to verify the effect of therapy with wind instrument in the phonatory function of patients with Parkinson's disease by means of acoustic voice analysis, which resulted in a reduction in the measures of Jita, Jitt, RAP, PPQ, sPPQ, ShdB, Shim, APQ, sAPQ, NHR after therapy2929 . Rosa JC, Cielo CA, Cechella C. Função fonatória em pacientes com doença de parkinson: uso de instrumento de sopro. Rev CEFAC. 2009;11(2):305-13., showing that such method provided positive results on the patients' voice.

With the purpose of determining the efficiency of the voice therapy, a study was conducted with 39 children who had vocal nodules, using the acoustic analysis and subjective evaluation. Each individual attended one therapy session per week during a period of three or six months. The evaluations were performed before and after the therapy, using MDVP and auditory-perceptual analysis based on the GRBAS scale. There was a significant improvement in four of five auditory-perceptual parameters and in the jitter (jitt), shimmer (shimm) measures, and in the noise-to-harmonics ratio (NHR). The study showed that acoustic analysis and GRBAS are useful evaluation tools in the comparison of therapies of children with vocal nodules 2. Tezcaner CZ, Ozgursoy SK, Sati I, Dursun G. Changes after voice therapy in objective and subjective voice measurements of pediatric patients with vocal nodules. Eur Arch Otorhinolaryngol. 2009;266:1923-7..

A study3030 . Petrovic-Lazic M, Babac S, Vukovic M, Kosanovic R, Ivankovic Z. Acoustic voice analysis of patients with vocal fold polyp. J Voice. 2011;25(1):94-7. with women who had vocal polyps assessed the acoustic correlates of the vocal quality of these patients, before and after three weeks of endolaryngeal phonosurgery. To this end, ear, nose and throat (ENT) examinations and voice recording were performed. All acoustic parameters analyzed (vf0; Jitter; shimer; NHR; VTI; PPQ and APQ) improved significantly, and after the surgery they were similar to the reference values provided by the program. Therefore, the acoustic voice analysis is also suitable to compare voice in the pre- and post-surgical period.

A randomized study3131 . Duan J, Zhu L, Yan Y, Pan T, Lu P, Ma F. The efficacy of a voice training program: a case-control study in China. Eur Arch Otorhinolaryngol. 2010; 267:101-5.examined the efficiency of a vocal training program in 36 teachers who had had vocal problems for more than three months, comparing the results with 22 individuals of the control group (without dysphonia). The therapeutic program had duration of four weeks and included vocal hygiene, vocal training with the yawn-sigh voice therapy technique, and guidance to perform the technique at home. The jitter, shimmer and NHR measures were analyzed. After administration of the therapeutic program there was a significant difference only in the NHR values.

Still regarding the use of MDVP to verify the effects of different therapeutic procedures, a group of elderly choral singers underwent a program of Vocal Function Exercises3232 . Tay EYL, Phyland DJ, Oates J. The effect of vocal function exercises on the voices of aging community choral singers. J Voice. 2012;26(5):672e19-27.. After training, MDVP showed that there was a significant improvement in the jitter, shimmer and NHR values. Such findings indicate that the program of vocal function exercises had a positive effect on the voices, and so they have the potential to attenuate the physiological effects of voice aging.

Some studies compare the effects produced in voice by different surgeries. Among them, a study3333 . Pedersen M, Eeg M. Does treatment of the laryngeal mucosa reduce dystonic symptoms? A prospective clinical cohort study of mannose binding lectin and other immunological parameters with diagnostic use of phonatory function studies. Eur Arch Otorhinolaryngol. 2012;269:1477-82.using MDVP found a positive effect in treating laryngeal symptoms, evidenced by a decrease of dystonic complaints and a reduction of the edema in the interarytenoid region after a medical intervention in a 55-year old patient with dystonia. The finding of this study 3333 . Pedersen M, Eeg M. Does treatment of the laryngeal mucosa reduce dystonic symptoms? A prospective clinical cohort study of mannose binding lectin and other immunological parameters with diagnostic use of phonatory function studies. Eur Arch Otorhinolaryngol. 2012;269:1477-82. is in agreement with other researches 2. Tezcaner CZ, Ozgursoy SK, Sati I, Dursun G. Changes after voice therapy in objective and subjective voice measurements of pediatric patients with vocal nodules. Eur Arch Otorhinolaryngol. 2009;266:1923-7. 3030 . Petrovic-Lazic M, Babac S, Vukovic M, Kosanovic R, Ivankovic Z. Acoustic voice analysis of patients with vocal fold polyp. J Voice. 2011;25(1):94-7. 3131 . Duan J, Zhu L, Yan Y, Pan T, Lu P, Ma F. The efficacy of a voice training program: a case-control study in China. Eur Arch Otorhinolaryngol. 2010; 267:101-5. where the MDVP is used as a method to analyze effectiveness of therapeutic programs.

Another similar study used MDVP in a pre- and post-operative evaluation to release Reinke's space and in a specific surgical procedure, and also to compare long-term findings in patients who had benign vocal lesions. Based on the findings of the study it was possible to see that the use of the operative technique in analysis produced good vocal and laryngeal results in the patients. In addition, MDVP showed to provide progressive post-operative improvement during four years 3434 . Finck CL, Harmegnies B, Remacle A, Lefebvre P. Implantation of esterified hyaluronic acid in microdissected Reinke's space after vocal fold microsurgery: short- and long-term results. J Voice. 2010;24(5):626-35..

A study used the following parameters for MDVP analysis: mean values of f0, jitter, shimmer and NHR to determine the effectiveness of injection of laryngoplasty technique using calcium hydroxyapatite, comparing the cricotyroid (CT) approach with the thyrohyoid (TH) approach in patients with unilateral vocal fold paralysis3535 . Woo SH, Young-Ik S, Lee SH, Park JJ, Kim JP. Comparative analysis on the efficiency of the injection laryngoplasty technique using calcium hydroxyapatite (CaHA): The thyrohyoid approach versus the cricothyroid approach. J Voice. 2013;27(2):236-41.. The videostroboscopic, acoustic and auditory-perceptual parameters improved significantly after the injection using either the CT or the TH approach.

Another study with patients undergoing total thyroidectomy (TT) aimed to assess vocal functionality and swallowing in pre- and post-operative patients3636 . Lombardi CP, D'alatri L, Marchese MR, Maccora D, Lo Monaco, Me Crea C et al. Prospective electromyographic evaluation of functional postthyroidectomy voice and swallowing symptoms. World J Surg. 2012;36:1354-60.. An acoustic analysis was conducted with four parameters: mean frequency of noise-to-harmonics ratio (NHR), jitter (%), fundamental frequency (f0) and shimmer (%). The result showed that the patients often have subjective complaints as the first symptom after TT. In addition, the laryngeal electromyography (LEMG) showed absence of subclinical lesion of the laryngeal nerve in all patients.

Literature also contains articles that compare the acoustic effects on the voice of subjects who underwent other treatments not directly related to the voice.

A study had the objective of assessing the acoustic changes before and after adenotonsillectomy in 40 children aged five to 24 years, with and hypertrophic adenoids and palatine tonsils, using the MDVP system, and compared them with 40 healthy children of the control group. The investigated parameters were average fundamental frequency (f0), Jitt, Shim, NHR, voice turbulence index (VTI), soft phonation index (SPI), degree of unvoiced segments (DUV), and degree of voice break (DVB). After the surgery of the group of study there was a reduction of all parameters under analysis compared to the control group, suggesting that the surgery provides improvement of the voice quality with reduction of the inadequate nasal resonance, even without voice therapy6. Mora R, Crippa B, Dellepiane M, Jankowska B. Effects of adenotonsillectomy on speech spectrum in children. Inter J of Pediatric Otorhinolaryngol. 2007;71:1299-304..

A study verified the effects of deep brain stimulation on the voice of 19 patients with Parkinson's disease, comparing them with a control group, i.e., 10 patients who did not agree in undergoing deep brain stimulation, and also with a group of 11 subjects without Parkinson's disease and who had normal voice. The results indicated improvement in the voice of the patients with Parkinson's disease treated with deep brain stimulation, showing that this method can be used to provide significant improvement in patients with severe vocal dysfunction 3737 . Lee VS, Zhou XP, Rahn DA, Wang EQ, Jiang JJ. Perturbation and nonlinear dynamic analysis of acoustic phonatory signal in Parkinsonian patients receiving deep brain stimulation. J Comm Disord. 2008;41(1):485-500..

Another study1414 . Franca MC, Simpson KO. Effects of systemic hydration on vocal acoustics of 18- to 35-year-old females. Commu Disord Quarterly. 2012;34(29):29-33. investigated the effects of body hydration on vocal acoustic by analyzing the RAP and shimmer values and using the MDVP system. The sample comprised 38 female individuals aged 18 to 35 years. The subjects were evaluated in both the conditions of dehydration and hydration. For the first condition, the individuals were examined after 12 hours off liquids and foods. For the second condition, the individuals were examined after ingesting one liter of water for 20 minutes and waiting 70 minutes for the water to be absorbed by the body. Samples of sustained vowels for both conditions were collected and analyzed.

With respect to the findings of this study1414 . Franca MC, Simpson KO. Effects of systemic hydration on vocal acoustics of 18- to 35-year-old females. Commu Disord Quarterly. 2012;34(29):29-33., the authors found a decrease (improvement) in all comparisons made in both dehydration and hydration conditions regarding the RAP and shimmer values. Thus, according to these findings, the effect of water on vocal acoustic is positive, even though no significant difference was found in all vowels comparisons.

In general, it is possible to see that MDVP is largely used when the purpose is to compare the effects of diverse treatments on the voice, and that it has been very effective in providing numerical data that reveal even the subtlest changes that a perceptual-auditory analysis would no be able to show.

Conclusion

The measures most commonly used by researches are f0, jitter, shimmer and NHR because they are considered the most important objective measures for assessing patients with voice disorders. On the other hand, tremor measures such as FATr and FFTr are measures that may fluctuate depending on the sample length and the positioning of measurement points for the analysis, and so they are not very reliable.

The literature shows that the most consistent measures that present high agreement with other acoustic analysis programs are those related to f0. But frequency and amplitude perturbation measures show some divergences, and some studies found variability when comparing the results with other programs.

Various studies sought to characterize different types of voices, such as those of subjects of both sexes without vocal alterations, in an attempt to establish a parameter of normality of different population groups. Such studies are very important in order to set parameters that define voice alterations via acoustic analysis. In addition, some studies characterize voices with different disorders.

The literature showed that acoustic analysis is an objective and necessary resource to assess patients with vocal disorders and, particularly, to compare different types of treatment. In addition, it can also be used in clinical practice as a tool for monitoring surgical procedures or in speech therapy. Based on the advantages in using acoustic analysis and the possibility of MDVP in analyzing diverse quantitative parameters, the use of this program can provide great benefits in the practice of speech therapy.

Referências

  • 1
    González J, Cervera T, Miralles JL. Análisis acústico de la voz: fiabilidad de un conjunto de parámetros multidimensionales. Acta Otorrinolaringol Esp. 2002;53: 256-68.
  • 2
    Tezcaner CZ, Ozgursoy SK, Sati I, Dursun G. Changes after voice therapy in objective and subjective voice measurements of pediatric patients with vocal nodules. Eur Arch Otorhinolaryngol. 2009;266:1923-7.
  • 3
    Aguero PD, Tulli JC, Moscardi G, Gonzalez EL, Uriz AJ. Estimating RASATI scores using acoustical parameters. J Physics. 2011;332(1):1-8.
  • 4
    Finger LS, Cielo CA, Schwarz K. Acoustic vocal measures in women without voice complaints and with normal larynxes. Braz J Otorhinolaryngol. 2009; 75(3):432-40.
  • 5
    Beber BC, Cielo CA. Medidas acústicas de fonte glótica de vozes masculinas Normais. Pró-Fono R Atual Cient. 2010;22(3):299-304.
  • 6
    Mora R, Crippa B, Dellepiane M, Jankowska B. Effects of adenotonsillectomy on speech spectrum in children. Inter J of Pediatric Otorhinolaryngol. 2007;71:1299-304.
  • 7
    Conterno G, Cielo CA, Elias VS. Características vocais acústicas do som basal em homens com fissura pós-forame reparada. Rev CEFAC. 2011;13(1):171-81.
  • 8
    Roman-Niehues G, Cielo CA. Modificações vocais acústicas produzidas pelo som hiperagudo. Rev CEFAC. 2010;12(3):462-70.
  • 9
    Bonilha HS, Dawson AE. Creating a Mastery Experience During the Voice Evaluation. J Voice. 2012;26(5):665-71.
  • 10
    Hill AC, Ojha S, Maturo S, Maurer R, Bunting G, Hartnick CJ. Consistency of voice frequency and perturbation measures in children. Otolaryngol Head Neck Surg. 2013;148:637.
  • 11
    Zheng YK, Zhang BR, Gong J, Yuan MQ, Ding KL, Rao SK. Laryngeal aerodynamic analysis in assisting with the diagnosis of muscle tension dysphonia. J Voice. 2012;26(2):177-81.
  • 12
    Garcia MJV, Cobeta I, Martin G, Alonso-Navarro H, Jimenez-Jimenez J. Acoustic analysis of voice in Huntington's disease patients. J Voice. 2009;25(2):208-17.
  • 13
    Boucher VJ, Ayad T. Physiological Attributes of vocal fatigue and their acoustic effects: a synthesis of findings for a criterion-based prevention of acquired voice disorders. J Voice. 2010;24(3):324-36.
  • 14
    Franca MC, Simpson KO. Effects of systemic hydration on vocal acoustics of 18- to 35-year-old females. Commu Disord Quarterly. 2012;34(29):29-33.
  • 15
    Shao J, Maccallum JK, Zhang Y, Sprecher A, Jiang JJ. Acoustic analysis of the tremulous voice: Assessing the utility of the correlation dimension and perturbation parameters. J Commun Disord. 2010;43(1):35-44.
  • 16
    Remacle A, Finck C, Roche A, Morsomme D. Vocal Impact of a prolonged reading task at two intensity levels: objective measurements and subjective self-ratings. J Voice. 2012;26(4):177-86.
  • 17
    Leong K, Hawkshaw MJ, Dentchev D, Gupta R, Lurie D, Sataloff R. Reliability of objective voice measures of normal speaking voices. J Voice. 2013;27(2):170-6.
  • 18
    Manfredi C, Giordanoa A, Schoentgenb J, Fraj S, Bocchia L, Dejonckered PH. Perturbation measurements in highly irregular voice signals: Performances/validity of analysis software tools. Biomed Signal Proces. 2012;7:409-16.
  • 19
    Manfredi C, Bocchi l, Cantarella G. A multipurpose user-friendly tool for voice analysis: Application to pathological adult voices. Biomed Signal Proces. 2009;4:212-20.
  • 20
    Amir O, Wolf M, Amir N. A clinical comparison between two acoustic analysis softwares: MDVP and Praat. Biomed Signal Proces. 2009;4(1):202-5.
  • 21
    Oguz H, Kiliç MA, Safak M.A. Comparison of results in two acoustic analysis programs: Praat and MDVP. Turk J Med Sci. 2011;41(5):835-41.
  • 22
    Silva DG, Oliveira LC, Andrea, M. Jitter estimation algorithms for detection of pathological voices. EURASIP J Adv Sig Pr. 2009;1(1):1-9.
  • 23
    Vasilakis M, Stylianou Y. Spectral jitter modeling and estimation. Biomed Signal Proces. 2009;4(1):183-93.
  • 24
    Godino-Llorente JI, Osma-Ruiz V, Sáenz-Lechón N, Cobeta-Marco I, González-Herranz R, Ramírez-Calvo C. Acoustic analysis of voice using WPCVox: a comparative study with Multi Dimensional Voice Program. Eur Arch Otorhinolaryngol. 2008;265:465-76.
  • 25
    Yaser S, Wingate JM. Fundamental frequency characteristics of jordanian arabic speaker. J Voice. 2009;23(5):562-6.
  • 26
    Schwarz K, Cielo CA. Modificações laríngeas e vocais produzidas pela técnica de vibração sonorizada de língua. Pró-Fono R Atual Cient. 2009;21(2):161-6.
  • 27
    D'avila H, Cielo CA, Siqueira, MS. Som fricativo sonoro /Ž/: modificações vocais. Rev CEFAC. 2010;12(6):915-24.
  • 28
    Pereira EC, Silvério KCA, Marques JM, Camargo PAM. Efeito imediato de técnicas vocais em mulheres sem queixa vocal. Rev CEFAC. 2011;13(5):886-95.
  • 29
    Rosa JC, Cielo CA, Cechella C. Função fonatória em pacientes com doença de parkinson: uso de instrumento de sopro. Rev CEFAC. 2009;11(2):305-13.
  • 30
    Petrovic-Lazic M, Babac S, Vukovic M, Kosanovic R, Ivankovic Z. Acoustic voice analysis of patients with vocal fold polyp. J Voice. 2011;25(1):94-7.
  • 31
    Duan J, Zhu L, Yan Y, Pan T, Lu P, Ma F. The efficacy of a voice training program: a case-control study in China. Eur Arch Otorhinolaryngol. 2010; 267:101-5.
  • 32
    Tay EYL, Phyland DJ, Oates J. The effect of vocal function exercises on the voices of aging community choral singers. J Voice. 2012;26(5):672e19-27.
  • 33
    Pedersen M, Eeg M. Does treatment of the laryngeal mucosa reduce dystonic symptoms? A prospective clinical cohort study of mannose binding lectin and other immunological parameters with diagnostic use of phonatory function studies. Eur Arch Otorhinolaryngol. 2012;269:1477-82.
  • 34
    Finck CL, Harmegnies B, Remacle A, Lefebvre P. Implantation of esterified hyaluronic acid in microdissected Reinke's space after vocal fold microsurgery: short- and long-term results. J Voice. 2010;24(5):626-35.
  • 35
    Woo SH, Young-Ik S, Lee SH, Park JJ, Kim JP. Comparative analysis on the efficiency of the injection laryngoplasty technique using calcium hydroxyapatite (CaHA): The thyrohyoid approach versus the cricothyroid approach. J Voice. 2013;27(2):236-41.
  • 36
    Lombardi CP, D'alatri L, Marchese MR, Maccora D, Lo Monaco, Me Crea C et al. Prospective electromyographic evaluation of functional postthyroidectomy voice and swallowing symptoms. World J Surg. 2012;36:1354-60.
  • 37
    Lee VS, Zhou XP, Rahn DA, Wang EQ, Jiang JJ. Perturbation and nonlinear dynamic analysis of acoustic phonatory signal in Parkinsonian patients receiving deep brain stimulation. J Comm Disord. 2008;41(1):485-500.
  • Financial support: CAPES; FAPERGS

Publication Dates

  • Publication in this collection
    Aug 2015

History

  • Received
    29 Jan 2014
  • Accepted
    21 July 2014
ABRAMO Associação Brasileira de Motricidade Orofacial Rua Uruguaiana, 516, Cep 13026-001 Campinas SP Brasil, Tel.: +55 19 3254-0342 - São Paulo - SP - Brazil
E-mail: revistacefac@cefac.br