Executives ’ speech expressiveness : analysis of perceptive and acoustic aspects of vocal dynamics

Purpose: To analyze speech expressiveness in a group of executives based on perceptive and acoustic aspects of vocal dynamics. Methods: Four male subjects participated in the research study (S1, S2, S3, and S4). The assessments included the Kingdomality test to obtain the keywords of communicative attitudes; perceptive-auditory assessment to characterize vocal quality and dynamics, performed by three judges who are speech language pathologists; perceptiveauditory assessment to judge the chosen keywords; speech acoustics to assess prosodic elements (Praat software); and a statistical analysis. Results: According to the perceptive-auditory analysis of vocal dynamics, S1, S2, S3, and S4 did not show vocal alterations and all of them were considered with lowered habitual pitch. S1: pointed out as insecure, nonobjective, nonempathetic, and unconvincing with inappropriate use of pauses that are mainly formed by hesitations; inadequate separation of prosodic groups with breaking of syntagmatic constituents. S2: regular use of pauses for respiratory reload, organization of sentences, and emphasis, which is considered secure, little objective, empathetic, and convincing. S3: pointed out as secure, objective, empathetic, and convincing with regular use of pauses for respiratory reload and organization of sentences and hesitations. S4: the most secure, objective, empathetic, and convincing, with proper use of pauses for respiratory reload, planning, and emphasis; prosodic groups agreed with the statement, without separating the syntagmatic constituents. Conclusion: The speech characteristics and communicative attitudes were highlighted in two subjects in a different manner, in such a way that the slow rate of speech and breaks of the prosodic groups transmitted insecurity, little objectivity, and nonpersuasion.


INTRODUCTION
With regard to executive professionals, organizations have invested in the improvement of their collaborators' speech expressiveness.This action has been considered a strategic factor to achieve competitive differential, credibility, and positive reputation.Such an investment includes not only the opening of new communication channels, but also the collaborators' training, especially executives, whose speeches will represent the identity of a company (1) .
As the present investigation comprises aspects related to executives' speech expressiveness, it is important to point out some concepts about this role.Executives work inside an organization with the role of planning, organizing, managing, and distributing tasks that are performed by other employees.They establish a formal leadership and clarify to the collaborators about what is expected from them (2,3) .
On the basis of the definition of expressiveness as a level of information in the communication process (4) , speech expressiveness is built from interactions between segmental (vowels and consonants) and prosodic (rhythm, intonation, voice quality, elocution rate (ER), pauses, and stress) elements and between sound and meaning.Perception of the resources used in speech expressiveness happens in the interaction of what is the speaker's intention and what is the listener's interpretation (5) .
Prosody and constituting elements mold the enunciation; therefore a "manner of speaking" is printed to what has been speaking, which may be intentionally or not intentionally and listener-directed.With regard to the communicative role, there is a reference to a hearable aspect of the prosodic elements.In the expressiveness plan, there is a distinction between the prosodic roles, comprising expression of attitude, interpersonal posture, and kind of elocution; the affection roles, such as sadness, joy, anger, and affections such as humor; and indicial roles, which are signs of gender, social origin, and dialect (6) .
According to speech language pathologists' practice regarding speech expressiveness of radio professionals, the term speech expressiveness refers to the transmission of emotions and intentions in the message (7) .
The study (8) about persuasive effects of voice quality and dynamics in advertising broadcasters demonstrated that the assessed speakers with the most convincing, enthusiastic, and pleasant speech presented lower pitch, strategic demarcation of pauses, accurate articulation, and average ER.However, speakers considered as little convincing and pleasant had higher pitch and fast ER.
The variation in parameters constituting vocal dynamics and effective indication of frontiers between enunciations allow the comprehension of the enunciation and favor the listener/ speaker's attention focus (9) .
The study with teachers (10) was carried out applying the speech expressiveness assessment in pedagogy students, using, in order of preference, classifications such as motivating, pleasant, catch the attention, and transmits solidity and hesitation.Speech language pathologists evaluated the use of pauses and duration of enunciations, ER, articulation, intonation, and occurrence of melodic standard repetition.The teacher, whose speech expressiveness was positive, presented accurate articulation, ER with average variation, and pitch variability, and his or her speech seemed objective and secure.The teacher, whose speech expressiveness was negative, presented inaccurate articulation, slow ER, and little pitch variability, with a speech that seemed insecure for the listeners.
With the aim of studying emotion effects on the voice, speech, and fluency when speaking in front of an audience, a study (11) analyzed the speech of four subjects in the following situations: seminar/lecture; interview; moments before the lecture and interview; right after the lecture.The perceptiveauditory assessment for the analysis of voice and speech fluency parameters and discourse analysis was done.In conclusion, the changed emotional status interfered in the pneumo-phonoarticulatory coordination, in the production of vices of the language, in speech intelligibility, in the appearance of hesitations, repetitions, extensions, and filling sounds.
With regard to essential frequency, deeper voices are considered more pleasant (12) and transmit more safety, authority, and credibility (13) .The ER can be classified as slow, average, or fast and is directly related to temporal standards.Its variations affect not only the duration but also the quality of segments (14,15) .
The articulation rate (AR) is the proportion between the speech total time (including filled pauses, syllable extensions, and silent pauses) and the number of phonetic syllables per second.The AR can be considered as slow, average, or fast, in which 3.7-4.8syllables per second is average and 4.6-6.3syllables per second is fast (16) .
Pause is a prosodic element that helps constructing and comprehending the discourse meaning, favors the change of shifts between speakers, and enables message processing, as it delimitates the prosodic continuity.Pauses coincide with synthetic frontiers, which, in general, mark frontiers between sentences.Their use is indicated for the movement of synthetic elements and for signaling a change in the semantic content.Unexpectedly, the use of pauses may represent a hesitation that reveals speech reorganization.Furthermore, it can be used as a resource for emphasis, to call the attention to what will be said (17) .
According to some authors (8,(18)(19)(20) , pauses can be studied regarding their structure, role, distribution, and time.With regard to structure, pauses can be silent when there is actually a silence that can be used for different purposes; filled, when they happen in nonlinguistic content productions, such as extension of some parts of a word, in sounds like [m] and breathing in with a hearable noise; or signalized, when there is f0 variation or break in the speech flow.Pauses can be divided based on their role into discursive pauses of planning to demarcate the constituents of sentences (words, groups of words or sentences and they are used to organize parts of the discourse, such as the beginning and end of a report); expressive pauses have the role of highlighting words that the speaker wants to emphasize, thus producing a certain effect; and respiratory pauses happen due to a physiological reason.Pauses may happen between or in the enunciations, with different roles according to the speaker's intention and needs, and they can be brief or long regarding their temporal classification (21) .
In the study (22) about the use of pauses in different television styles, pauses were classified into delimitative for the planning and structure of the discourse; and planning to arrange speech continuity and discursive structuration, which organizes parts of the discourse.Speech samples from two journalists in five television shows were chosen.The measures were taken by means of the acoustic analysis, with segmentation of the narration times and silent pauses in milliseconds (ms).Brief silent pauses presented measures between 50 and 250 ms and a plosive element together with the pause; pauses were those above 200 ms.Long silent pauses showed more than 250 ms.The results point out changes when using pauses due to the kind of narrative.In conclusion, pauses, in the style of television interactive shows, are meant to build speech shifts and, among other prosodic elements, their appropriate use is essential to build speech expressiveness as they perform a delimitative and signalizing role of meaning effects.
The objective of this research was to analyze the speech expressiveness in a group of executives based on perceptive and acoustic data of vocal dynamics.

METHODS
The Ethics Committee from Pontifícia Universidade Católica de São Paulo (PUC-SP) approved this study, using the protocol number 423/2011.
The Kingdomality test (23) , which was created for a job placement program, was used with the aim of characterizing subjects regarding the corporative styles.Based on them, semantic keywords corresponding to communicative aspects used in the analysis of perceptive-auditory impressions about subjects' speech expressiveness were chosen.The program comprises four groups of corporative styles, namely supporters, challengers, maintainers, and explorers.In the Kingdomality test, the authors mention terms related to speech expressiveness and communicative skills to describe the corporative style of each group, such as being a good listener, empathetic, objective, convincing, polite, and secure, among others.
For speech sample collection, the speech was recorded and it was characterized through reports with suggestions of investment products.It ended with a persuasive sentence about the products presented in the advisory service.
Each subject was individually recorded for 10 min, in a silent room, using a Sony digital recorder, model ICD-PX312/ PX312F, attached to a head microphone that was positioned at 10 cm from the left oral rhyme.Recording was done through the acoustic analysis software from Vocalgrama program.Digital representation and edition of data were performed through the Sony Sound Organizer program, version 1.1, installed in a laptop computer from Dell Inspiron.
In the perceptive-auditory analysis of vocal quality, some aspects of vocal production were considered for the assessment, such as presence or absence of vocal quality alteration; balance or imbalance of the resonance system, and accuracy or inaccuracy in articulation (speaking comprehension or unintelligibility).
The perceptive-auditory analysis of the voice dynamics had as aim the assessment of voice dynamic aspects, such as pitch variation, loudness variation, speech continuity, ER, and respiratory support.The voice dynamic fragment of the instrument Vocal Profile Analysis Scheme (VPAS) was used for this purpose (24) .Three speech language pathologists, acting as judges, took part in the analyses.They are graduated in phonetics and specialized specifically in assessment under phonetic perspective (VPAS-PB).They received via e-mail the recordings with the persuasion sentence of the four subjects and the assessment instrument.
For surveying the impressions caused by executives' speech expressiveness, related to communicative attitudes, four qualitative keywords described in the Kingdomality test were chosen, namely secure, empathetic, objective, and convincing.A semantic questionnaire was prepared to evaluate 30 judges, who are considered here as probable investors of several occupations and different ages, and they performed the judgment together with the recordings.
The Praat program, version 5.2.21 (www.praat.org),was used for the acoustic analysis -this software is freely distributed for acoustic phonetic research.The following measurements were achieved through the acoustic analysis: measures of essential frequency -f0 (minimum f0, maximum f0, and f0 extension in Hertz, Hz), which were taken in all the sentence extension by writing the minimum and maximum f0 values, and extracting extension measures of f0; measures of durationvowel-to-vowel unit measure (VV unit), which comprises the extension of a vowel until the beginning of the following vowel.In every sentence, the unities of produced enunciations were delimitated and segmented by each of the four subjects.After eliminating the unities, division of the number of VV unities by the duration of sentence time was done.Thereunto, the Z-score statistic test was applied to these measures to obtain a mitigation of the VV unities duration contour, which facilitated verifying the highlighted duration peaks through separation of the prosodic groups.The results considered as the most important or significant peaks those that surpassed that red upper line of the graphics 5, 6, 7, and 8 for the subjects 1, 2, 3, and 4 respectively.The ER measurement of the enunciation was calculated as the number of syllables per second; whereas the AR of the enunciation measurement was obtained through the division of phonetic syllable number by second; pauses were perceptively marked with support in the inspection of sound wave and of the wideband spectrogram.
Pauses were classified regarding their structure into silent, filled, and signalized pauses.With regard to their role, they were divided into respiratory, discursive, and expressive pauses, and also, as to time characteristics, into long or short lengths and in regular or irregular intervals.
In this study, the pause classification model (21) was used (Chart 1).
Data descriptive analysis was performed through absolute and relative frequencies, central (mean and median) and dispersion (standard deviation, minimum, and maximum) tendency measurements.The 95% confidence interval (95%CI) was calculated as estimation per interval to verify the difference between proportions.
To evaluate similarity among the research subjects, the multivariate analysis of hierarchical agglomerative cluster was used applying the technique of closest neighbor and Euclidian distance dissimilarity, through simple connection.The graphic representation was presented in a dendrogram.In such process, all variables were standardized, namely a) nominal variable, simple transformation of alpha-numerical to numerical character; b) ordinal variable, transformation through equation (real value (value above scale 1); and c) continuous variable, transformation through equation ((real value -mean)/standard deviation).
For values regarding the division of VV unities number by the sentence time duration, for each subject, the Z-score test and mitigation of the curve through mobile mean were performed.The subjects were compared using the Kruskal-Wallis test.Nonparametric tests were used because the continuous variables did not present a normal distribution in the Kolmogorov-Smirnov test.
A 5% descriptive level for statistical significance was considered.The statistical analyses were done in the statistical program Statistical Package for the Social Sciences (SPSS) software for Windows, version 17.0, and in EstatCamp.

RESULTS
The characterization of subjects shows that they were especially male; there were two administrators, and two economists working as managers.With regard to the Kingdomality corporative style, two of them were challengers and the other two were helpers.
Emissions of subjects were described in speech fragments and pause moments (indicated with a slash -/ -, the parentheses indicated the kind regarding structure; role; time had its length signalized between # and in ms) and are found in Appendix 1.

Perceptive-auditory analyses
With regard to quality and dynamics of the voice and impressions about speech expressiveness related to semantic keywords and acoustic analysis, the conclusions were seen as follows.
S1 presented appropriate vocal quality, precise articulation, and laryngopharyngeal resonance; lowered habitual pitch, pitch with decreased extension, decreased variability of pitch, increased habitual loudness, neutral extension of loudness, decreased variability of loudness, interrupted continuity, slow ER, and inappropriate respiratory support.S1 was also more rejected because the subject was pointed out as insecure, nonobjective, nonempathetic, and nonconvincing.There were also lower values of elocution and ARs; four moments of pause, in which one was a respiratory pause that lasted 143 ms and the others were characterized by hesitations, i.e., signalized pause, with long syllabic duration in plosive sound (586 ms); filled, breathing in with hearable noise (378 ms); and filled, characterized by hesitation and total obstruction of articulators in the production of plosives (1,688 ms).Intervals between pauses were irregular because the pauses were used with breakings of prosodic groups and separation of syntagmatic constituters.
S2 presented appropriate vocal quality, precise articulation, and balanced resonance; neutral habitual pitch, decreased extension of pitch, decreased variability of pitch, neutral habitual In comparison with S1, S3 presented intervals more regularly, used pauses of respiratory reload, and emphasis pauses at speaking, even though S3 had some hesitation moments similarly to S2. S4 showed appropriate vocal quality, precise articulation, and balanced resonance; lowered habitual pitch, decreased extension of pitch, decreased variability of pitch, neutral habitual loudness, neutral extension of loudness, decreased variability of loudness, uninterrupted continuity, quick ER, and appropriate respiratory support.S4 received higher score, being assessed as the most secure, objective, empathetic, and convincing; values indicated lower variation of f0; values were considered average for ER and AR; there were five moments of pause, three discursive, for planning and structuring of speech (991, 532, and 208 ms), and two expressive ones, characterized as a resource of word emphasis.Intervals between pauses were regular, did not present hesitation, and prosodic groups coincided with the enunciation without separating the syntagmatic constituters.
Using the distribution of subjects, according to the assessment of three speech language pathologists working as judges, the closeness matrix was calculated (Chart 2).Based on such data, we verified a formation in which S3 and S4 are close, creating a group with S2 that is a little farther (Figure 1).
With regard to impressions caused by executives' speech expressiveness, associated with communicative attitudes, the calculation of number and percentage of research subjects was done based on the characteristics of impressions on speech expressiveness related to semantic keywords, which were attributed by 30 laymen (Table 1).
In the comparison performed applying the 95%CI, it was seen a statistically significant difference (p<0.050)among all subjects when "secure" was assessed.There was no difference between S1 and S2 regarding "objective".S3 and S4 presented a difference among themselves and with S1 and S2.In the presentation of "empathetic" and "convincing" features, it was seen a difference among all subjects (p<0.005; Figure 2).

Acoustic analysis
The distribution of absolute values for essential frequency measures (Table 2) and values associated with essential frequency measures for the four subjects (Figure 3) was seen.
Using the absolute values of the analyzed measures of duration, S2, S3, and S4 were in a group and S1 in another (Table 3) in the division per groups.
There was no difference among subjects regarding the number of VV unities removed (p=0.761).However, when such variable was continuously analyzed based on the subjects, peaks of duration that marked important moments of pause use were seen, both for aspects considered negative at speaking, such as hesitations, and also for positives ones at speaking, such as sentence emphases and structures (Table 4 and Figure 4).

DISCUSSION
With regard to the perceptive-auditory analysis of voice dynamics through the assessment of the majority of judges, S1, S2, S3, and S4 showed lowered habitual pitch.This characteristic may be associated with the fact that they were all men with f0 values, which is in agreement with what was found in the literature related to the male gender and vocal alteration (12) .
With the exception of S1, the other subjects had positive results as to speaking.One of the acceptance factors may be associated with lower pitch, since some studies showed that a deeper voice is more pleasant and creates a higher sensation of safety to the listeners (8,9,13) .
CoDAS 2015;27(2):160-9 As to ER values, both in the perceptive-auditory and acoustic analyses, S1 had a quite slow ER, while S2, S3, and S4 showed a quick ER in the perceptive-auditory assessment performed by the speech language pathologist judges, and mean ER in the acoustic assessment.Subjects with mean ER had higher acceptance with regard to speaking by judges, while the slow ER of a speaker may distract the listener and transmit insecurity (8,10) .
We have seen that S1 had a slow AR, S2 had a mean AR, and S3 and S4 had a quicker AR; these results are similar to those mentioned in another research (19) .
Based on results from the removal of VV unities through sentence time duration, which were demonstrated in the statistical analysis, it was possible to see duration peaks that happened in the emissions of enunciations of every subject concerning separation of prosodic groups, and which were marked when subjects used pauses.
In S1's speech, duration peaks that characterize hesitation and disfluency at speaking can cause the listener a feeling of insecurity, impression of lack of expertise on the content and comprehension difficulty, through the inappropriate separation of prosodic groups with breaking of syntagmatic constituents (9,17,18) .In S4's speech, duration peaks happened to organize the word formation to finish sentences and emphasize words.Such data demonstrate the efficient use to separate prosodic groups and favor the message comprehension with lower possibility of ambiguity (9,17,18) .
Respiratory reload and hesitations formed the pause moments done by S1.Intervals between pauses were irregular, because they were used with breakings of prosodic groups and separation of syntagmatic constituents.This speech formation reinforces the impression pointed out by judges of insecure, very little objective, and nonconvincing speech, because breaking of prosodic groups did not coincide with intervals of word groups.Use of pauses in some speech fragments was inappropriate, resulting in the loss of discourse meaning and difficulty of comprehension of the message by the listener, also causing negative impressions regarding speech expressiveness (8,9,14,(18)(19)(20)(21)(22) .
S2 had several pause moments with some hesitations.Intervals happened more regularly if compared to those from S1, because S2 used pauses for respiratory reload and for structure and organization of sentences and emphasis in the speech.These speech characteristics from S2 received more acceptance because they enabled a better comprehension of the message and transmitted positive impressions, such as convincing, empathetic, and secure.Hesitation moments may be responsible for the impression of being considered as very little objective (8,9,14,(18)(19)(20)(21)(22) .Like S2, S3 presented good use of pauses with more regular intervals, used respiratory reload and speech emphasis pauses, even though it had some hesitation moments.The way as S3 made use of pauses enabled a better comprehension of the message and collaborated for being the second one best ranked as positive impressions, i.e., convincing, empathetic, secure, and objective (8,9,14,(18)(19)(20)(21)(22) .
S4 used pauses for speech planning and structuring and as a word emphasis resource.Intervals between pauses were regular, did not present hesitation, and prosodic groups coincided with the enunciation, without separating the syntagmatic components.Pauses that were performed by S4 may be considered strategic ones, because speech was articulated and created unanimity among judges when they evaluated speech expressiveness of this subject as positive, therefore as convincing, secure, empathetic, and objective (8,9,14,(18)(19)(20)(21)(22) .In addition to the reference of pause uses, the discursive pauses applied by S2 and S4 revealed  Caption: VV = vowel to vowel these executives' ability to identify relevant parts of the speech context and sensitivity in highlighting them to the listener, making the message process easier (8,9,14,15,(18)(19)(20)(21)(22)25,26) . The exressive pause, according to literature (19) , can be considered as emphatic, because it has the aim of emphasizing the provided information.Expressive pauses used by S2, S3, and S4 showed subjects' ability to use the emphasis resource when marking certain words, and showed the intention of reinforcing the transmitted ideas.
With regard to the executive professional (2,3) , it is expected that this professional is a secure, convincing, and objective person, which are characteristics of persuasion.Based on this, it is important to emphasize that S4 was distinguished due to positive aspects, whereas S1 showed up due to negative ones, associated with communication resources.

Figure 2 .
Figure 2. Rate and 95% confidence interval (95%CI) for the four research subjects for the categories (A) secure, (B) objective, (C) empathetic, and (D) convincing Closeness matrix according to the assessment of vocal quality and dynamics characteristics, performed by judges who are speech language pathologists for the four subjects 3.2.Change of vocal quality: change in voice quality of the end of the enunciation Functional 1. Respiratory 2. Discursive: speech planning or structure of parts 3. Expressive: emphatic Temporal 1. Length: short or long 2. Intervals: regular or irregular Chart 1. Classification of pauses Chart 2.

Table 1 .
Number and percentage of research subjects according to the characteristics of the impressions regarding the speech expressiveness related to semantic keywords, attributed by 30 non-expert judges

Table 2 .
Absolute frequency of the essential frequency measures for the four subjects

Table 3 .
Absolute frequency of duration measures for the four subjects