1 Introduction
Studies on scales measuring attitudes and behavior have a relatively long history, since Thurstone in 1928 proposed a theory on the measurement of attitudes based on psychophysics models (THURSTONE; CHAVE, 1928). Thurstone’s approach to psychophysics is understandable, since in scientific thinking any observation must be translated into a quantitative result to be considered precise and measurable (BUNGE, 2000). In fact, the classical scientific concept of measurement is to assign numerical values to objects and events, according to defined rules (KERLINGER; LEE, 2002).
It is generally accepted, however, that psychological objects have specificities that must be taken into account, when you want to assign numerical values to its specific expression, the human behavior. Every psychic phenomenon – self, consciousness, expectation, attitude, motivation, etc. – comes from interaction with the biological context of the human individual; but this interaction varies along life: it is not the same in childhood, in adolescence, in adulthood or in old age. No doubt this diversity is a condition that enhances complexity; yet there are other dimensions that must be taken into account, that is, social, cultural, educational and economic conditions (SULBARAN, 2009). Any attempt to explain – and predict – this behavior must combine all these, and perhaps, other dimensions. So it is not possible to confine the measurement of behavior to the simple classical paradigm: interestingly, the explanation of behavior must rely on the paradigm of complexity (MORIN; LE MOIGNE, 1999) and must try to combine these different dimensions, namely: the qualitative and quantitative dimensions of behavior. The paradigm of complexity leads us to understand the subject as a living system whose structures, processes and behavior, occur at an established level of complexity. This level of complexity is determined by the system’s position on four factors:
Level of thought and information processed in the system: The higher the level of thought and information in the human system, the greater its complexity (LE MOIGNE, 2011).
Internal variety of the system: The more diverse the experiences and fields of reality constructed and cognitively interpreted by the human system, the greater its complexity (SIMON, 1987).
External Variety system: The greater the variety of the entities with which the system has continuous relationships, the more complex the human system in this criterion (VAZ, 2003).
Integration of the informational variety: The integration of different structures, processes, and patterns of behavior leads to enlargement of the conceptual boundaries and to richer meaning of the constructed reality. The living system develops the ability to deal with uncertainty in all domains of knowledge, and its reasoning becomes probabilistic, not simply deterministic (LE MOIGNE, 2011).
2 Theoretical Framework
Complex thinking recommends that the interpretation of human behavior takes account of all the data collected through the measuring instruments, and that each type of data is specifically analyzed and interpreted in the light of its level of complexity:
First, recording and interpretation of data that allow qualitative view of the subject (the informational dimension of behavior).
Second, data on the magnitude of behavior (the quantitative dimension): the frequency of significant behaviors; the frequency and intensity of emotions experienced by the subject, his energetic dimension (PARREIRA, 2006).
The study aimed at answering both concerns above, seeking to combine the qualitative and quantitative dimensions and reporting such knowledge to the level of complexity in which the subject is positioned.
The researchers adopted the format scale proposed by Likert, as these are the most used scales today and the easiest to construct (BOZAL, 2006). They are ordinal scales, in general, as Stevens points out (1946, quoted in BOZAL, 2006). Stevens categorized scales according to statistical operations allowed by them, and this categorization has become classic. Those focused in this article are ordinal and interval scales.
Ordinal: Allowed transformations are strictly monotone, the scale only preserves the order between variable positions;
Interval: elements maintain the same known distance between them, but zero and origin points of the scale are arbitrary.
The instruments to capture data about behavior should adapt to the specificity of the psychological; only in this way we can create instruments with higher levels of validity and reliability. In the psychological field, the specificities scales must answer are mainly three:
The diversity of subjects is large, specifically the related to demographic variables;
A real zero does not exist; behavioral diversity is largely qualitative, although it also expresses quantitative values;
The spontaneous evaluation of items by the subject is based on a quasi-quantitative assessment, that is, it is based on the use of adverbs of quantity or frequency, which mix qualitative and quantitative thinking (in fact, the words are information – a qualitative dimension – but express intensities – a quantitative dimension).
To achieve this closer adjustment to the reality of the human subject, we start from the spontaneous evaluative behavior of the subjects, who currently use adverbs of quantity and frequency. This study is a continuation of another carried out in 2003 and published in 2006 (PARREIRA, 2006), whose objective was the substitution of ordinal scales by interval scales, in order to reach an adequate mix of qualitative and quantitative factors, and to create scales of accrued validity and reliability. The obtained results are quite similar to those found, for the same adverbs, in the present study, as can be confirmed comparing the Tables 1 and 5 below.
Table 1 Statistical results from 2003 study.
Quantity adverbs | N | mean | sd | min | max | range |
---|---|---|---|---|---|---|
Extremely | 240 | 9.443 | 0.747 | 9 | 10 | 1 |
Quite enough | 240 | 7.418 | 0.850 | 5 | 9 | 4 |
Medially | 240 | 4.765 | 0.860 | 2.5 | 8 | 5.5 |
Little | 240 | 2.207 | 0.868 | 0.5 | 5 | 4.5 |
Nothing at all | 240 | 0.249 | 0.531 | 0 | 3 | 3 |
Source: Parreira (2006).
Table 5 Quantity Adverbs and their observed Statistics in this Sample.
Original adverbs | Translated adverbs | Mea | Median | Mode | Sd | min | max | |
---|---|---|---|---|---|---|---|---|
A- Totalmente | Totally | 9.52 | 10 | 10 | 0.833 | 7 | 10 | 3 |
B- Completamente | Completely | 9.34 | 10 | 10 | 0.892 | 7 | 10 | 3 |
C- Perfeitamente | Perfectly | 9.36 | 10 | 10 | 0.822 | 7 | 10 | 3 |
D- Inteiramente | Entirely | 9.44 | 10 | 10 | 0.821 | 7 | 10 | 3 |
E- Extremamente | Extremly | 9.25 | 9 | 10 | 0.844 | 7 | 10 | 3 |
F- Muito | Very much | 7.62 | 8 | 8 | 1.541 | 4 | 10 | 6 |
G- Bastante | Quite enough | 7.08 | 7 | 8 | 2.121 | 2 | 10 | 8 |
H- Medianamente | Medially | 4.62 | 5 | 5 | 0.841 | 3 | 6 | 3 |
I- Moderadamente | Moderatly | 5.28 | 5 | 5 | 0.898 | 4 | 7 | 3 |
J- Razoavelmente | Reasonably | 5.42 | 5 | 5 | 0.899 | 4 | 7 | 3 |
K- Mais ou menos | More or less | 4.18 | 5 | 5 | 1.211 | 1 | 6 | 5 |
L- Pouco | Little | 2.27 | 2 | 2 | 1.072 | 0 | 5 | 5 |
M- Nada | Nothing at all | 0.54 | 0 | 0 | 0.873 | 0 | 3 | 3 |
Source: Authors research (2013).
These similar scores show that there is a high stability of the proposed evaluations: even very different composition and staggered in time (10 years) samples produce very close numerical scores. Thus, the use of these qualitative/quantitative scales seems both quite reliable and characterized by a solid empirical validity.
On this basis, the authors decided to conduct a similar research, aiming to gain a more extensive evidence for the following questions:
3 The Methodology
In this study, the authors started by choosing a set of quantity and frequency adverbs currently used by people when they want to mean the quantitative dimension of a cognition or an emotion. This is the starting hypothesis: A scale based on a stable measurement of the numerical value of these adverbs will present consistent and known distances between the various positions; thus it will be effectively an interval scale.
Any interpretation or action based on it will thus be more precise than the mere ordering and will be more adjusted to what is expressed by the subjects, that is, its validity will be accurated. A list of adverbs of quantity/intensity- those most frequently used in Likert scales – was presented to a sample of people. People were asked to attribute a numerical value to each adverb or adverbial phrase on a scale of 0 (meaning 0% intensity) to 10 (meaning 100% intensity), in order to be used as a quantitative scale measuring attitudes, emotions and behavioral patterns.
This procedure is similar to Thurstone’s, who also asked respondents to evaluate the numerical value of a sentence (the difference being that in this case, it is a quantity or frequency adverb or adverbial expression). Its advantage is the possibility of being used with any sample of people who will answer a test or questionnaire, as the scale is independent of the content of the evaluated sentence. The same procedure was used with frequency adverbs; in this case, however, the numerical scale used frequency reasoning, evaluating the frequency adverbs within a continuum from 0% frequency to 100% frequency.
3.1 The Sample
The sample, originally composed by 219 subjects, included mainly University students, some of them workers. Sample treatment: Evidently incongruent subjects or participants with missing data were excluded from the original sample, using the listwise method. Subjects that were not able to correctly understand Portuguese language, that is, the meaning of some adverbs, and detected outliers were also excluded. The final sample was composed by 198 subjects, although the following tables are based on 219 observations (Tables 2, 3 and 4).
Table 2 Frequency observed by gender.
Gender | Frequency | Percent |
---|---|---|
0 | 146 | 66.9 |
1 | 72 | 32.9 |
NA | 1 | 0.5 |
| ||
Total D | 37 | 100 |
0 = Male
1 = Female
Source: Authors research (2013).
Table 3 Frequency of age by classes.
Classes | Frequency | Percent |
---|---|---|
[19,34[ | 114 | 52.1 |
[34,47[ | 62 | 28.2 |
[47,60[ | 35 | 16 |
[60,73[ | 7 | 3.2 |
NA | 1 | 1.5 |
| ||
Total | 219 | 100 |
Source: Authors research (2013).
Table 4 Level of education by groups.
Classes | Frequency | Percent |
---|---|---|
1- Basic (5 years) | 3 | 1.4 |
2- First cycle (6 years) | 4 | 1.8 |
3- Second cycle (9 years) | 4 | 1.8 |
4- Secondary level (12 years) | 20 | 9.1 |
5- University level | 168 | 76.7 |
6- Post-Graduate and PhD | 14 | 6.4 |
NA | 6 | 2.8 |
| ||
Total | 219 | 100 |
Source: Authors research (2013).
3.2 Procedure
Two adverb lists were presented to the subjects of the study: one about quantity adverbs or adverbial expressions, and another about frequency adverbs or corresponding adverbial expressions. Subjects were asked to evaluate the numerical significance of the adverbs in the list, in accordance with the following.
3.3 The results
3.3.1 Quantity adverbs
As one can see (Table 5), the adverbs that indicate extreme positive or negative intensities have a more precise evaluation, with less dispersion; those in the middle of the scale are less precise and are more disperse. The same occurred with the frequency adverbs. This is a result compatible with theories on psychological judgement:
The study is complemented by the graphs presented in Figures 1, 2 and 3, resulting from the use of the Principal Component Analysis methodology for ordinal data as described in Borg and Groenen (2005), (procedure CATPCA in IBM SPSS-Statistics); it confirms the idea that it is possible to build an interval scale based on quantity adverbs, as could be expected.

Figure 1 Main component analysis and discrimination measures of different categories of quantity adverbs.
3.3.2 Frequency adverbs
In the case of frequency adverbs, the procedure was similar to that of quantity adverbs. It was presented an example of a sentence using a frequency adverb to subjects and they were asked to evaluate it in a scale as the shown in the box.
The Table 6 shows the results obtained with this procedure.
Table 6 The evaluated frequency adverbs and their observed statistics in this sample.
Adverbs | Mean | Median | Mode | Std | Min. | Max. |
---|---|---|---|---|---|---|
Always | 88.387 | 90 | 100 | 14.39 | 50 | 100 |
Extremely frequent | 83.785 | 90 | 90 | 13.33 | 50 | 100 |
Frequently | 68.536 | 70 | 80 | 16.85 | 30 | 100 |
Many times | 67.177 | 70 | 70 | 16.19 | 30 | 100 |
Quite enough times | 68.491 | 70 | 70 | 15.83 | 30 | 100 |
sometimes | 39.625 | 40 | 40 | 16.59 | 10 | 80 |
Rarely | 16.012 | 10 | 10 | 9.71 | 10 | 40 |
Never | 1,283 | 0 | 0 | 3.36 | 0 | 10 |
Source: Authors research (2013).
3.4 Practical Applications
The obtained results enable the construction of different equivalent scales, which can be adapted by the researcher to: respondents sample, issue and situation, according to the research objectives. This condition facilitates the transfer of Thurstone´s perspective to different situations and instruments, enhancing the researcher’s flexibility.
Two examples of these different scales (with 5 or 6 degrees based on the two categories of adverbs) can be found below (Tables 7 and 8).
Table 7 . First example: a six position scale.
A 6 position scale, adjusted to a questionnaire with items is presented below (responses and their treatment serve only as an example): | ||||||
---|---|---|---|---|---|---|
Extremely important | (E) | |||||
Very important | (M) | |||||
Enough important | (B) | |||||
Medially important | (md) | |||||
Little important | (P) | |||||
Not at all important | (N) | |||||
| ||||||
E | M | B | md | P | N | |
1. Is it important that the teacher gives incentives, praises and demonstrates personal esteem to his pupils, to motivate them? | x | |||||
2. Is it important that the teacher gives a permanent example of openness to new ideas? | x | |||||
| ||||||
These items focus the skill to motivate the students to develop more complete and open perceptions of reality. The respondent evaluates item 1 at level M = 7,62; and item 2 at level E = 9,25 |
Source: Authors research (2013).
Table 8 Second example: a five position scale.
A 5 position scale, adjusted to a questionnaire with items is presented below: | |||||
---|---|---|---|---|---|
Totally true | (E) | ||||
Enough true | (B) | ||||
Medially true | (md) | ||||
Little true | (P) | ||||
Not at all true | (N) | ||||
| |||||
T | B | md | P | N | |
1. My interactions with other people are not very positive. | x | ||||
2. My productive activities effectively create the resources for my subsistence. I am indeed an effective worker | x | ||||
| |||||
The subject has a positive and an assertive selfimage as a productive worker, with a score of 9,52; a positive image relying mainly on economic productivity; the relational dimension is not so positive; its distance to the top of the scale is quite large (numerical score 4,62). It must be said that current computational devices offer no difficulties to using these scales. |
Source: Authors research (2013).
3.4.2. The Frequency adverb scales
The based on frequency adverbs scales are presented below. A first example of this scale, showing the numerical values of the adverbs and the distance between positions can be observed in Table 9.
Table 9 First example: an example of a frequency adverbs scale.
Adverbs | Numerical values | Distances |
---|---|---|
A- Always | 88,387 | 19,8984 |
E- Enough times | 68,491 | 28,7663 |
F- Sometimes | 39,625 | 23,6132 |
G- Rarely | 16,012 | 14,7291 |
H- Never | 1,283 | - |
Source: Authors research (2013).
d1= G-H; d2 = F-G; d3 = E-F; d4 = A-E.
A second example: the frequency adverbs scale applied to a motivation test1
In its first part, the motivation test confronts value objects and situations, in sets of three. The purpose of this part is to confront the person with affectively guided choices – motivational dilemmas – which is the way motivation works, according to the motivational theory on which the test is based. In this part, the scale is composed by quantity adverbs.
The test
First part: the subjects mark the sentence they consider as the most important in the set; then, they choose the second one in importance and mark its position in the scale; finally, they mark the third one in relevance.
The questionnaire includes 21 sets, like the presented above, covering the most important areas of daily life: Personal life and family (11 sets); Work (5 sets); Leisure and free time (2 sets); Friends and friendships (3 sets).
Second part: the test measures the affective tone of the subject’s life, by registering - in the frequency adverbs scale – the frequency of emotions felt in his personal life and at work (results shown in Table 10).
Table 10 Frequency adverbs: results from the motivation test.
Situations | Frequency in personal life (%) | Frequency in work (%) |
---|---|---|
A- Situations that generate feelings of warmth, sympathy, friendship | 71.39 | 59.69 |
R- Situations that generate feelings of anger, irritation | 26.09 | 35.97 |
C- Situations that arise feelings of curiosity, desire to know, lust for information | 57.12 | 65.54 |
N- Situations that arise feelings of rejection, revulsion, disgust | 9.04 | 23.99 |
M- Situations that arise feelings of fear | 23.30 | 27.87 |
D Situations that arise relax and certainty feelings | 58.88 | 46.95 |
S- Situations that arise feelings of satisfaction, joy, enthusiasm | 62.17 | 46.96 |
T- Situations that arise feelings of sadness, abandon, depression | 19.51 | 26.24 |
O- Situations that arise feelings of pride and a sense of personal importance | 63.69 | 86.44 |
V- Situations that arise feelings of guilt and shame | 9.47 | 11.14 |
The items with substantial meaning differences are presented in italic black.
Source: Authors research (2013).
Using the scale below, please mark the frequency of each feeling in your personal life and in work situations;
Scale for this part:
Table 11 shows that this procedure adequately evidences the contribution of interval scales to a more precise evaluation of behavioral factors and variables.
Table 11 Paired samples test: the variables with substantial mean differences.
Pair differences | t | df | Sig2 extremes) | |||||
---|---|---|---|---|---|---|---|---|
| ||||||||
Mean | St | 95% difference reliable interval | ||||||
| ||||||||
Inferior | Superior | |||||||
Pair | A | 11.713 | 16,963 | 39.918 | 19.434 | 3,164 | 20 | 0,005 |
Pair4 | N | -14.957 | 23,151 | -252.222 | -46.923 | -3,03 | 21 | 0,006 |
Pair6 | D | 11.936 | 28,377 | -0.6454 | 245.181 | 1,973 | 21 | 0,062 |
Pair7 | S | 15.211 | 29,525 | 21.204 | 283.022 | 2,416 | 21 | 0,025 |
Source: Authors research (2013).
3.2.1 Internal consistency
The value of the realibity for the internal consistency (SIJTSMA, 2009) is 0.861 for the 61 items, which means the motivation test presents internal consistency.
4 Conclusion
This study opens a path to the elaboration of interval scales suited to several types of psychological and sociological questionnaires, as they can be adapted to different speeches, ages, professional experiences, and cultural settings. No doubt it will be interesting for the researcher to have different options of scales, adjusted to the issue and to the sample under study. If those scales combined the words used by the subjects in their daily life, and if they could be quantified as true interval scales, we would obtain a fiable and valid instrument for behavioral research.
So this procedure allows us affirmatively answering the first question: the use of adverbs is an interesting basis for the construction of scales combining qualitative and quantitative approaches in a valid and reliable way. It allows avoiding the inaccuracies sometimes seen in Likert scales (LIKERT; ROSLOW; MURPHY, 1993) used by the authors, as shown in the two examples shown below:
In the first example, the position undecided does not contain an intensity position: it actually means a refusal of expressing a position; it is away of expressing frequency reasoning. The second example contains three positions out of the pretended scale: agree and disagree express the qualitative meaning of the position, but do not indicate its quantitative meaning; and the word neutral is even more explicitly away from the scale than undecided: they are expressions out of a scaling reasoning.
According to that, it is believed that the use of adverbs and of the frequency scale construction increases the accuracy eliminating ambiguities.
The results of this study constitute also an affirmative answer to the specific question about validity and reliability, as it was shown along the paper and especially in the presented examples.
In behavioral studies, it is quite inappropriate to speak about a zero position; but we hope that this study helps to define a minimum position (equivalent to zero in each population to whom the scale is applied). Actually this is what is used in every behavioral reasoning: there is a zero pint to each population in each variable. In face of these results, the objectives of this study have been reached and this could be useful to other behavioral scientists.