THE MINIMAL WORD IN EUROPEAN PORTUGUESE: THE ORALIZATION OF ABBREVIATED FORMS

■ ABSTRACT: The phonological constraints that define which phonetic chains are eligible to become a word in a given language generally include a Minimality Condition (MC). MC imposes a minimal amount of phonological substance that must be included by any single word of the language. Traditionally, MC is measured in terms of syllable weight or syllabic extension of the word. It has been much debated whether MC corresponds to a truly universal constraint; as for Portuguese, previous studies have been conflicting between the acceptance and the refusal of its productivity in the phonology of the language. In this study, we will assess the functionality of MC for the building of Portuguese lexical entries by means of observing the oralization of abbreviations. This is assumed here as a word-productive process in Portuguese. Based upon an experimental study with a group of native speakers of Contemporary European Portuguese (CEP), we conclude that (i) MC is an operative constraint in the phonology of Portuguese, and (ii) its defining criterion is purely linear and segmental. According to our proposal, MC is respected in CEP when a phonetic chain contains a minimum of three segments, regardless of syllable weight and syllabic extension.


Introduction
A satisfactory definition of the word, as it is well-known, is one of the main difficulties for linguists. A substantial part of such difficulty lies, first of all, on the lack of universal, objective, formal criteria that make it possible to identify this concept in an unambiguous way and to segment larger linguistic structures in units unanimously recognized as "words". In addition, we lack further criteria allowing an intrinsic distinction and characterization, based solely on linguistic arguments, of the various types of meaning-bearer units, like affixes, clitics, clitic groups, among others. Different approaches, comprehending typologically different languages, have discussed this paradox: why an intuitive, basic unit such as the word proves to be so hard to define in purely technical terms? Examples of such reflections on this issue, some of which focusing on this very contradiction between the "easiness" of the self-intuitive notions of the word and the difficulties of a technical explicitation of this concept, may be found in Jones (1931), Krámský (1969), Juilland;Roceric (1972), Halle (1973), Aronoff (1976), Bisol (2000;, Julien (2006), , Veloso (2009;2010;2016), Haspelmath (2011;2012a,b), Ferreira (2012), Villalva (2012), Mendes (2013), Ulrich (2013), Elordieta (2014), Ibarretxe-Antuñano; Mendívil-Giró (2014) and others.
Among the requirements commonly mentioned as a fundamental criterion for a satisfactory definition of the word, the Minimality Condition (MC) is found very often: in a given language, a sound chain will be a good candidate to be recognized as a word if and if only, together with other properties, it contains a minimum amount of phonological material. If below such limit, no units can be admitted as words of that language (see, for a review of these arguments, McCARTHY;PRINCE, 1995). MC is the central issue of the present study.
In some previous analyses of the same problem (see again, e.g., McCARTHY; PRINCE, 1995), it has been argued that MC is not a universal property: some languages could not include any minimality constraints in their phonologies, whereas crosslinguistic MC parametrization may vary importantly from grammar to grammar.
As far as Portuguese is concerned, two positions are assumed by the literature: on the one hand, some authors postulate the ineffectiveness of any minimality constraint in this language; oppositely, other analyses advocate MC as a mandatory constraint in the phonology of Portuguese: 1 In languages like Portuguese (either in its European or Brazilian varieties), the occurrence of words like pé 'foot', mi 'mi' or nu 'naked' has led several authors -e.g., Bisol (2000), Vigário (2003) -to the conclusion that such constraint [Minimal Word Constraint] is inoperative in Portuguese. However, Vigário (2003: 159) (2004) to argue that Portuguese, in fact, admits minimality constraints, even though a small amount of words can violate them.
(…………………….) [Our own] results show that the proportion of monomoraic/monosyllabic words in relation to the other word formats is higher than [previously] demonstrated […]. The effective usage of monomoraic/monosyllabic forms suggests, thus, that the prosodic word in (European) Portuguese is not conditioned by minimal extent constraints." (VIGÁRIO; MARTINS; FROTA, 2005, p.903; our translation, our italics) In our study, it is our aim to gather evidence that could lead us to a better evaluation of two specific, inter-related questions: is MC a functional constraint in Portuguese, and, if so, how is it parametrized in this language?
Our analysis will be limited to European Portuguese (EP) 2 and follow a methodology previously adopted: the oralization of abbreviations as initialisms or acronyms.
Throughout the paper, the terms "abbreviation" and "abbreviated form" will be used to make reference to the process and to the result of a specific word-formation routine which converts special phonetic chains into "new" lexical words by means of one of the following ways 3 (which are not the only abbreviation procedures available in Portuguese): 4 (i) acronymy: phonetization of a longer expression into a shorter one (="acronym") through the combination of morphologically unmotivated subsegments of the several words found in the original, larger expression; this gives rise to a phonetic chain similar to a phonological word (ω) of the language, since it respects its phonotactic rules and is in accordance with all phonological, morphological, syntactic and semantic conditions (stress, lexical class, thematic class, inflection, gender, meaning, etc.). Example: Organização das Nações Unidas 'United Nations Organization'  "ONU"=["nu] ω _ Noun_Feminine ; (ii) initialization: phonetization of a longer expression into a shorter one (="initialism") through the spelling of the initial letters of each lexical word 2 For this reason, only initialisms and acronyms that are current in the European variety of Portuguese will be considered here; accordingly, all phonetic transcriptions will take the standard norm of European Portuguese as the pattern. 3 Bearing in mind that abbreviations are functionally equivalent to the longer expressions they replace and the fact that they absorb the essential grammatical properties of morphosyntactic words (see examples and analyses in the text), we assume them here as true entries of the speakers' mental lexicon. In fact, most of them are largely shared by the speakers' community and acquire a stable meaning (very often, they are even admitted by lexicographers as entries of current dictionaries). Their lexical representation, though, is inevitably subject to the natural variability that is found when individual lexicons are compared. For a discussion of this topic, see, among others. (CORREIA; LEMOS, 2005). 4 See, for instance, and limiting the literature on this subject to the studies referring to European Portuguese exclusively, Correia e Lemos (2005), Villalva (2008) and Pereira (2013), who offer more detailed descriptions of abbreviation (and other non-concatenative) processes in Portuguese and discuss their role as word-formation -hence, as truly morphological -processes in this language. contained in the original, longer expression, so that a prosodic word group (VIGÁRIO; FERNANDES-SVARTMAN, 2010) is formed, behaving as a sole morphosyntactic word though (one sole gender, one sole lexical class, etc. In the second part of our study, an empirical exploration will be presented. We shall then observe the linguistic behaviour of a group of native speakers of EP when confronted with this kind of specific chains and asked to treat them as "words" of their language. Subjects will be given a list of written representations 5 , introduced as abbreviated forms and embedded in syntactically invariant sentences. They will be asked to read that list. Therefore, we shall try to find cues giving us relevant information about the linguistic processing of this kind of stimuli and, simultaneously, about the obligatory conditions of the language's words. More specifically, it is our purpose to check the eventual functionality of MC in the implicit phonological knowledge of EP native speakers/listeners. In brief, this way to investigate this specific issue coincides with Plénat's (1993) methodology to study the same question with French speakers.
Based on these preliminary questions, our article will be divided as follows: -in section 2, a review of some basic assumptions related to MC and other phonological constraints, either at a general level or focusing on Portuguese, will be made; -in section 3, we shall discuss the relevance of the methodological option that will be followed in our empirical study -the oralization of abbreviations -as a valid means to evaluate MC and other phonological constraints as mandatory requirements for the wordiness in Portuguese; -the empirical study, with data and subjects of Contemporary EP, will be presented in section 4; -section 5 will present the final observations of the study.

Phonological constraints and wordiness
As said in the Introduction, an exhaustive inventory of all linguistic criteria leading to a clear definition, identification and delimitation of words as linguistic units, even for inflectional languages like Portuguese, has challenged and motivated different interpretations.
Studies such as the aforementioned have attempted to identify the wordiness conditions (ARONOFF; FUDEMAN, 2005, p. [36][37][38] which make it possible, at 5 As explicitly admitted by authors such the researchers mentioned in note 4, in languages with an alphabetic, conventional system of writing -as it is the case of Portuguese -the abbreviation processes are regularly grounded on the written forms of the larger words or expressions that are the input for abbreviation. least partially, to assign or to refuse the status of "word" to given phonetic chains in a given language. Wordiness conditions, in fact, are shared by the different grammar modules (namely, phonology, morphology and syntax). In the present study, however, as said before, we shall not deal with all these wordiness dimensions: we shall focus, deliberately and exclusively, on Minimality Condition.

Minimality Condition: general formulation
MC is the phonological constraint that, interacting with others, imposing a minimum amount of phonological material so that a phonetic chain may be admitted as a word in a given language. Although its parametrization varies cross-linguistically, we assume, as a general formulation of MC potentially valid for a great number of languages of the world, a formulation like (1), 6 based on McCarthy and Prince (1995, p.321-322). 7 (1). Generic formulation of the MINIMALITY CONDITION (ap. McCARTHY; PRINCE, 1995, p.321-322) (1a). In languages WITH quantitative distinctions: the minimal word must contain at least one heavy syllable. (1b). In languages WITHOUT quantitative distinctions: the minimal word must contain at least two syllables.

The Minimality Condition in Portuguese
We shall now enter the discussion of the specific issue of the relevance of MC in EP. Since Portuguese does not admit quantitative distinctions, we should expect, according to McCARTHY; PRINCE's (1995) postulate referred to in (1) (see (1b)), that all words of Portuguese would correspond, at least, to two syllables.
Contemporary EP lexicon includes, though, a non-neglectable amount of monosyllabic words. In addition to clitics, 8 EP has many monosyllabic lexical words, as shown in Figure 1. Words in this list are categorized in light monosyllables (non- 6 This formulation -taken here and among many other authors as generic -is not absolutely consensual. Vigário; Martins; Frota (2005), e.g., can be counted among the authors who discuss it very critically. 7 "In quantity-sensitive languages, which distinguish syllable weight, the minimal word is bimoraic; in quantityinsensitive languages, all syllables are presumptively monomoraic, and so the minimal word is disyllabic." (McCARTHY; PRINCE, 1995, p.321-322). 8 The special status of clitics as real "words" has been largely discussed by previous literature, both with respect to Portuguese and to other languages as well. See, for example: Vigário (1998;2003; branched rhyme) and heavy monosyllables (branched rhyme, according to the several prosodic and morphological conditions explicated in the right-foremost columns of Figure 1); the reasons for this categorization will be made more clear later on. Branched nucleus + Segmentally filled dois 'two' pois 'then' Source: author's elaboration.
At first sight, these examples could lead us to conclude that EP would belong to the set of languages for which MC, as formulated by McCarthy and Prince (1995) (see (1)), does not play any role as a phonotactic constraint or as a wordiness condition. In fact, according to such argument, if EP does not admit quantity-based vowel oppositions, its phonology should not allow any phonetic chains with less than two syllables. This is the main reason for several authors to refute the relevance of MC in Portuguese, as summarized in the aforementioned quotation from Vigário, Martins and Frota (2005). 10 These conflicting, contradicting phonological interpretations feed the discussion that will be developed in the following sections of this paper. 9 OBS.: 1. This table includes as "words" inflected forms of verbs and nouns. 2. In the examples of segmentally filed codas, we excluded all possible cases of inflected forms with a non-lexical final /S/. 3. VG diphthongs were assumed as branched nuclei. 4. In the present study, we deliberately exclude nuclei with phonetically nasalized vowels. 5. This table does not include forms traditionally described or classified as "clitics". 10 Other interpretations, referred to by the same study that is quoted here, see the scarcity of this kind of words -seen as exceptional, marked cases, then -as an argument to go on accepting MC as a wordiness condition in Portuguese. This is the explanation found in Booij (2004), as cited by Vigário, Martins and Frota (2005) (see quotation in Introduction).

Syllable weight and wordiness in Portuguese
In our opinion, a complete evaluation of the effectiveness of MC in the phonology of Portuguese requires a deeper examination of dimensions that cannot be reduced to the minimum number of syllables per word. Thus, we shall pay attention, in the next paragraphs, to the specific question of syllable weight and its relation to wordiness in Portuguese, bearing in mind the connection between such property and MC, according to the formulation of this constraint as found in (1) and McCarthy and Prince's (1995) thoughts on the distinction between the language types under consideration there. 11 The importance of syllable weight in Portuguese will be examined here based on several arguments related to two different phonological phenomena: stress-assignment and historical evidence attested in Medieval Portuguese (MP), that is to say, the cases of Latin etyma that almost invariably originated heavy monosyllables.

Syllable weight and word stress
Traditionally, phonological descriptions of EP consider syllable weight as a neglectable factor in domains where it is accepted to operate quite systematically as far as other languages are concerned, as it is the case of stress-assignment. Studies such as Pereira's (1999), Roca's (1999), Mateus and D'Andrade (2000) and Mateus et al.'s (2003), among others, describe word stress in Portuguese as exclusively governed by morphological conditioning: nouns and verbs follow different stress patterns; within each word class, stress-assignment is sensitive to particular morphemes or morphological/ morphosyntactic combinations. As we said before, this is the most consensual viewpoint regarding word-stress assignment among phonologists dealing with EP.
It is worthy to confront this interpretation, though, with some arguments provided by alternative explanations that do not exclude so categorically the interference of syllable weight on word-stress assignment. Brandão de Carvalho (1988;1989;2011), focusing on European Portuguese, as well as Wetzels (2007), considering the Brazilian varieties of the language, are among the authors who emphasize the inexistence of words stressed on the antepenultimate with a heavy syllable on the penultimate position, in addition to the statistical predominance of last-syllable-accented words with a heavy final syllable, as two key arguments in favour of the phonological conditioning of word stress in Portuguese and its sensitivity to syllable weight, mainly in the class of nouns and adjectives. This makes it a little more difficult to categorize Portuguese within the set of languages whose phonology does not admit the importance of syllable weight, a very pertinent criterion for the typological distinction proposed by McCarthy and Prince (1995) in (1). It should also be highlighted here, in accordance with Wetzels (2007), that quantity-based vocalic oppositions are not the sole evidence of syllable 11 See quotation in note 7.
weight effectiveness in the phonology of any language, contrarily to what could be inferred from a quicker interpretation of McCarthy and Prince (1995).

Syllable weight and heavy-monosyllable words in Medieval Portuguese
Our second argument in favour of the phonological importance of syllable weight in Portuguese stems from language diachrony. In Medieval Portuguese (MP), a nonneglectable number of words consisting of one sole heavy syllable is found. Tables  2 and 3 show us a few examples of such type of words. Quite interestingly, we could notice that many of such words descend from Latin monosyllabic etyma (see Table  2). In our view, however, a much greater interest comes from the observation of the cases found in Table 3, where the Portuguese heavy monosyllable (=monosyllabic word) is not explainable through etymology, as these words cannot be viewed as the historical inheritance of Latin monosyllables. Instead, they derive from other prosodic word formats; what is more, their formation in Medieval Portuguese preserves -in some cases, it even introduces -certain phonological structures that were generally deleted in regular historical changes from Latin to Portuguese. This is the case of heavy monosyllabic words which preserved final nasals (lost in Vulgar Latin itself) -e.g., matre>mãi 'mother', fine>fim 'end') -or an intervocalic lateral alveolar, as illustrated by examples such sale>sal 'salt', sole>sol 'sun'). A plausible explanation for these exceptional cases of marked structure preservation could be the need for Portuguese to comply prosodically with MC prohibiting the existence of words containing less than one heavy syllable. This could allow us to postulate a MC constraint for MP such as found in (2); accordingly, we should accept that constraint (1a) would be available and mandatory at this early stage of Portuguese (the same postulate would allow us to reject the existence of quantity-based distinctions in Medieval Portuguese, too).

Other phonotactic constraints related to wordiness requirements in Portuguese
In previous sections, CM and syllable weight were given particular emphasis in our discussion of wordiness conditions of EP, in view of the special relation of such constraints to the central questions of this study.
Nonetheless, as many other studies on the same subject have highlighted (see, a.o., ELORDIETA, 2014, VELOSO, 2016), other phonological constraints apply at the word-level. These will be the focus of the following sections of our paper, emphasizing those that are more relevant for our main purpose.
In (3), we give a short summary of the main phonological constraints that, in languages like Portuguese, a morphological word has to comply with, mandatorily: (3). Phonological constraints mandatorily observed by any phonetic chain admittable as morphological words in Portuguese.
(3a). One and one only primary stress, in full compliance with the stress-assignment prosodic rules (with the notable exception of Clitics).
(3b). Strict accordance with the phonotactic rules of the language and with the Structure Preservation Principle (KIPARSKY, 1985).
As for the phonological constraint found in (3a), let us recall that stress-assignment is one of the major phonological processes that take the word as a domain of application; therefore, it contributes for the identification/delimitation of this linguistic unit AND for the distinction between phonological words (bearing one, and not more than one only, primary stress), clitic words (inherently unstressed) and prosodically more complex, longer units, such as the clitic group (comprehending more than one single morphological word, but bearing just one single primary stress, according to Vigário (1998;2003;; 2010), Bisol (2000;, Veloso (2012; and others, as far as Portuguese is concerned) or the prosodic word group (grouping different phonological words and clitics in one single, complex, longer prosodic unit, according, for instance, to VIGÁRIO; FERNANDES-SVARTMAN, 2010).
In relation to the phonotactic rules mentioned in (3b), we highlight here the fact that candidates to wordiness in EP should also obey a specific set of rules governing the selection and combination of segments. Such rules correspond, basically, to the Syllabification Algorithm of Mateus and D'Andrade (2000, p. [60][61][62][63][64]. In short, these rules specify that Portuguese words be in accordance with the basic principles found in (4).

The importance of abbreviations' oralization for the assessment of Minimality Condition
In the following paragraphs, it is our aim to make it explicit why abbreviations' oralization was chosen as a way to assess the importance of MC as a wordiness condition in Portuguese. As said in the Introduction, the same methodological choice was followed by Plénat (1993) in his study about French; Plénat's (1993) methodology is the main basis for the empirical study that will be presented in section 4.
We come back, at this point of our study (see Table 4), to part of the information that was given before regarding the two main possibilities of abbreviations' oralization in Portuguese -initialization and acronymization. Current examples of EP will be also given.
The cases under A in Table 4 disallow the acronymization modality, since all of them contain gross violations of the major phonotactic rules of Portuguese like the ones that are given in (4): for example, if these potential phonetic chains do not seem to include vocoid nuclei, their prosodization as acronyms 12 becomes impossible; this makes it irrelevant to check any application of MC in their eventual oralization. For this reason, cases like those of column A, Table 4, are excluded from further analysis.
Examples under B and C of Table 4 correspond to a distinct case: both situations respect the constraint of vocoid-filled nuclei, and this makes their production as 12 See the arguments that are summarized in note 3 for the acceptance of abbreviated forms as particular cases of phonological words. acronyms possible. The question that arises from a careful comparison between both situations is as follows: if, due to the respect of that constraint, both types of candidates can be oralized as acronyms, how can we explain that this . Evidence has been gathered from data of language acquisition, children's early written production, abbreviation procedures, hypocoristic formation and other modalities of truncation. In this study, attention will be paid only to abbreviation procedures as a way to assess MC in Portuguese.
It should be borne in mind that, in the cases of column C of Table 4, C2 represents a specific subset of examples which, in spite of respecting the constraint of vocalic nuclei (see (4a)), do not agree with other important phonotactic constraints of EP, since they show codas potentially filled by obstruents or by sonorants unallowed in coda position. These are the cases, e.g., of FENPROF, [fEn"p4Of], and REN, ["En]). The formation of this particular type of acronyms has been taken into consideration by previous studies (see, e.g., PEREIRA, 2013) and might be explained, for instance, by the frequent phonetic deletion of final unstressed vowels in contemporary EP. This originates, at the phonetic level, the high frequency of occurrence of this type of "irregular" codas. Such result could be interpreted as a manifestation of their representation in the speakers' phonological knowledge as "regular" syllables" and viewed as a first hint of their possible on-going phonologization in the current stage of the language. Just as in the cases under A in Table 4, these examples will not be considered further on in the present study, since they raise specific questions that do need the prior understanding of the more general, central topics that constitute the central research of our study. On the basis of these assumptions, we hypothesized that speakers, when faced with the task of reading letter sequences presented as abbreviations, could follow one of two possible paths: 14 -Option 1. The speaker recognizes the chain as a current abbreviation of his/her language (as it the case of the abbreviations found in Table 4 for native speakers of CEP). In a way, these abbreviations correspond to lexicalized words; thus, they are processed as any other word pertaining to the speaker's lexical inventory (although, as mentioned before, individual lexical repertoires vary considerably). Therefore, the main processing way activated by the speaker in the oralization of these stimuli is their lexical knowledge (which is conceived of here as the set of linguistic information related to all entries of the individual's "mental lexicon", which includes indications about the phonetic form of such entries); 13 OBS.: 1. All acronyms of this table are often used in EP. So, they are assumed as potential lexical entries of the individual lexicons of a great number of speakers of the language. 2. Phonetic transcriptions (in this table and in all passages of our text) assume that each initialism contains more than one prominence; since the prosodic status of these units is not the main focus of our study, we do not make any distinction, within each initialism, between primary-and secondary-stress bearing syllables. 3. Each acronym is assumed as one sole prosodic word (bearing, thus, just one main stress). 14 See Abreu (2009, p.48 ss.) for a broader, more detailed discussion on the various psycholinguistic mechanism that are activated by the processing of written abbreviations.
-Option 2. In the case of an unknown abbreviation, the speaker must activate other resources of his/her implicit linguistic knowledge, namely his/her phonological knowledge (PhK). Excluding the cases of column A, Table 4 -which would never be oralized as acronyms, for the reasons that were explained above -, we assume those abbreviations could be read either as acronyms OR as initialisms as the result of the activation of constraints such as MC, plausibly one of the main responsible factors, in the speakers' PhK, for the wordiness evaluation of any given phonetic chain.
Option 2 admits that the competing oralization procedures (acronymization vs. initialization) are not selected at random. Following Plénat's (1993) proposal for French, it is admitted that minimality is the factor that determines the choice of either strategy. This will be the basic assumption for our empirical research, even though other phonological variables are not excluded. 15 This assumption seems to be specially enhanced if we compare, for instance, one of the pairs found in Table 4. Indeed, if we compare the current oralization of "BE" (σ L ) as an initialism (=["be"E]) and the oralization of an abbreviation which differs from this one in just one segment and in syllable weight -"BES" (σ H ), oralized as an acronym (=["bES]) -, we could infer that syllable weight is, indeed, the key factor eliciting the Portuguese speakers' preference for the acronymization (i.e., for the oralization of abbreviations as phonological words: these will emerge only with abbreviations with a syllable weight equaling σ H ). This inference would lead us to accept constraint (2) (postulated for Medieval Portuguese) as still operational in CEP, too, somehow contradicting McCarthy and Prince (1995)'s constraint (1), which limits the importance of syllable weight to quantity-sensitive languages.
All these questions will be assessed by the empirical study that will be presented in the next section.

Empirical study Preliminary remarks
In order to understand the linguistic processing of phonetic chains candidate to be treated as words and to gather evidence regarding the role of MC in Portuguese, we developed the empirical study that will be presented in this section. A group of monolingual speakers of CEP was asked to read a list of 88 abbreviations specifically prepared for this experiment. As far as we could know, these abbreviations were not previously existent -they were assumed thus as unavailable to the subjects' linguistic knowledge. We will refer to them as "pseudoabbreviations". The use previously 15 Other methodologies could have been followed, of course. Abreu and Rosa (2006) e Abreu (2009), e.g., in their studies of the linguistic processing of this kind of stimuli by native speakers of Brazilian Portuguese, take into consideration data obtained from grammaticality judgments and from latency times of stimuli reading. Such studies conclude that variables such as frequency and familiarity also play a role on the processing of written abbreviations. unattested stimuli was chosen in order to cancel the interference of any level of lexical knowledge (that could comprehend any information regarding the phonological representation of such forms, acceptable as lexical entries, comparable to the ones which are found in Table 4). That is to say, it was our intention to make the subjects follow mandatorily the processing path previewed by Option 2 (see above), i.e., we wanted to activate the subjects' phonological knowledge as their only means of processing these stimuli. Only the isolation of this processing strategy could lead us to clear information about the role of MC and other phonological constraints on the phonological representation of wordiness conditions among Portuguese speakers.

Hypotheses
The three following hypotheses were formulated. (5). Experimental Hypotheses: Hypothesis A: Monosyllables corresponding to a heavy syllable (and respecting the language's phonotactic rules), when presented as written abbreviations, will be oralized as ACRONYMS. Hypothesis B: Monosyllables corresponding to a light syllable, when presented as written abbreviations, will be oralized as INITIALISMS. Hypothesis C: Disyllables and trisyllables, when presented as written abbreviations, will be oralized as ACRONYMS.
Hypotheses A and B, related to the processing of monosyllabic stimuli only, are motivated by our previous analysis of data shown in Table 4 (see section 3).
As for Hypothesis C, dealing with the processing of stimuli with 2 or more syllables, its motivation is found in de McCarthy and Prince's (1995) Minimality Condition (see (1b)).

Method Participants
Subjects were 12 adult (8 women, 4 men), monolingual, native speakers of CEP, mostly speakers of the Northern dialects of this language. All of them are undergraduate students at the University of Porto. Their participation was free. None was previously informed of the specific purposes or questions of the study. Participants' age ranged from 16 to 60 years old (mean=38 years; SD=16,06 years).

Linguistic material
Pseudoabbreviations were embedded in a set of 130 sentences bearing a syntactic and lexical invariant structure: the only spot that varied was where abbreviations were inserted. In 88 sentences, such abbreviations corresponded to the aforementioned pseudoabbreviations. The remaining 42 sentences included abbreviations assumed as common in everyday language for a native speaker of European Portuguese. In all sentences, grammatical gender was attributed to the abbreviated form: in the case of current abbreviations, the current grammatical gender was maintained in the corpus. As abbreviations and pseudoabbreviatons were always embedded as the nucleus of a noun phrase, gender agreement was overtly given by the definite article ("masculine"/"feminine") determining the abbreviation.
The structure of the carrier sentence of our stimuli is given in (6). (6) (x=definite article, singular, masculine or feminine) [Z]=word or expression that, in certain sentences, was introduced for the sake of semantic plausibility) 16 The complete list that subjects were asked to read is found in (7). The order in which stimuli appear in this list corresponds to the order of presentation of all sentences to each participant; it was randomly obtained during the preparation of the linguistic material.
Including abbreviations and pseudoabbreviations in the stimuli list was meant to create a distractor effect so that participants were induced to oralize each abbreviation/ pseudoabbreviation either as an initialism or as an acronym according to their own lexical knowledge (in the case of abbreviations) or to their implicit phonological knowledge (in the case of pseudoabbreviations). That is to say, we aimed at eliciting more than one single path in the processing of the stimuli, assuming that participants would choose the most adequate way of processing each stimulus according to its linguistic status.
Thus, each participant was asked to read a set of 130 sentences; 88 sentences contained written chains corresponding to pseudoabbreviations. 42 sentences contained abbreviations which are current among speakers of Portuguese -assumed as probably 16 Example: "O Presidente da Comissão da CREL demitiu-se." previously known by most of the participants, then 17 . These current abbreviationscontaining cases of sequences in accordance with the phonotactic rules of the language and cases that violate such rules -comprehend 23 current initialisms and 19 current acronyms.
As for the 88 pseudoabbreviations found in the corpus, all respected the phonotactic rules of Portuguese as formulated in (3) and (4). That is to say, all pseudoabbreviations of the stimuli list -on which our analysis will be exclusively focused -correspond to the cases under B and C1 of Table 4. The more debatable cases of A and C2 of Table  4 were then excluded from the present study.
In our analysis, we will pay attention to the results regarding the oralization of pseudoabbreviations only. In (7), such pseudoabbreviations are found in the underlined written sentences; it should be borne in mind, however, that the materials given to all participants during the experiment did not in any manner emphasise or differentiate pseudoabbreviations from supposedly real abbreviations. All instructions and materials were absolutely clueless regarding this matter. 17 In fact, it is rather difficult to determine which current abbreviations do or do not belong to the specific individual lexicon of an individual speaker, due to the aforementioned variability among individual lexicons. Similarly, we cannot exclude categorically that some of the pseudoabbreviations that were coined on purpose for our study did not belong to the individual lexicon of some of our participants: for instance, they could correspond to the current designation of small, local companies or institutions eventually familiar to some or few of our subjects. For the list of (7), our own linguistic intuition and knowledge was strongly taken into consideration. In addition, for each (real) abbreviation that was assumed as current in Portuguese, we looked for at least one occurrence in any printed materials, in the internet and/or in general dictionaries of the language. The pseudoabbreviations were explicitly forged on the basis of the phonological variables that we wanted to test; their inoccurrence in the same sources (general press, internet, dictionaries) was also controlled. The 88 pseudoabbreviations contained in the corpus are prosodically controlled: monosyllables for syllable weight; polysyllables for syllable type of initial syllable and syllabic extension (number of syllables). As for syllable type, all unmarked syllable formats of Portuguese were included, either as the only syllable of monosyllabic pseudoabbreviations or as the initial syllable of polysyllabic pseudoabbreviations. In these positions, nasal vowels were excluded. As for syllabic extension, two sets of pseudoabbreviations were created, so that we could obtain results with monosyllables and results with di-and trisyllables, in order to test the validity of (1b) for EP. All stimuli whose written form could potentially elicit the production of one single vocalic nucleus (≅ one single written vowel) were assumed here as monosyllabic; stimuli whose written forms contained more than one potential vocalic nucleus (≅ two or more written vowels) were assumed as polysyllabic.
Therefore, we aimed at gathering information on the roles of syllable weight (see Hypotheses A and B) and syllabic extension (see Hypothesis C) on the definition of wordiness conditions in Portuguese.
Crossing these 3 variables throughout our corpus, the 88 pseudoabbreviations can be categorized as it is done in Table 5.

Method
Each subject was tested individually in a silent room at the Faculty of Arts of the University of Porto (Portugal).
All sentences were presented in written form and displayed on A4 sheets, printed out in Times New Roman, 14 pt. type, double-spaced. Each abbreviation was capitalized, without any signs separating the constituting letters. Sentences were sequentially numbered, from 1 to 130, with Arabic numerals.
Each subject was asked to read each sentence as naturally as s/he could. All were previously informed that all sentences had the same structure; this structure was explicitly explained beforehand. In the pre-test session, each participant was also told that sentences would vary in just one point of the word-chain, after "Presidente de" ('president of'). They were then informed that, after this sequence, they would be shown an "abbreviation". The instruction given at this point was as follows: if they recognized such abbreviation, they should read it according to the usual form they pronounced it. They were also told that many abbreviations could be unknown, since they designated small companies or institutions which were not very well-known. This precision was made in order to prepare the participants for the oralization of pseudoabbreviations. 19 (*): C=Consonant (contoid); V=Vowel (vocoid); G=Glide; S=Coronal Fricative (corresponding to written <S>).
They were asked to read such abbreviated forms so that they sounded "more natural" to them. In the beginning of each session, participants were given a brief explanation of the two possible ways of "reading" an abbreviated form (either as an initialism or as an acronym, without using this explicit terminology). Participants were allowed to make pauses during their reading sessions whenever they felt like it. Each individual session lasted about 15 minutes, approximately. Participants' productions of abbreviated forms were immediately transcribed in IPA by the experimenter as the reading session was going on.

Results
Table 1 displays the results for each syllable type, both with monosyllabic and polysyllabic pseudoabbreviations.
Each cell shows the mean number (and standard deviation) of the oralization of the 88 pseudoabbreviations, either as initialisms or as acronyms. Values were obtained from the 1056 recorded productions (=88 pseudoabbreviations X 12 participants). Both types of response (initialisms vs. acronyms) are statistically compared separately for monosyllabic and di-/trisyllabic stimuli. Results are restricted to the cases for which it was possible to obtain a statistical comparison on the basis of a T-Student test. As for the variables syllable type and syllable weight of disyllables and trisyllables, results regard first syllable only. -by the other hand, CCV monosyllables (light syllables, too) were very often oralized as acronyms, contrarily to was previewed by Hypothesis B, although the difference between both responses was not significant; -finally, it should be noticed that the only cases where a comparison between both types of response (initialism vs. acronym) is not significant are found in light syllables and with VGS-syllables: with other syllable types (=σ H ), results always show a clear statistical preference towards either of response.
Such preference, for monosyllabic stimuli only, is as follows: -with (heavy) syllables with the structure VC, VG and VGS (for the former, without a statistically significant difference between the two types of response), initialisms are statistically preferred over acronyms (contrarily to Hypothesis A); -with syllables CVC, CVG, CVGS, CCVC, CCVG and CCVGS (also heavy syllables), acronyms are statistically predominant; contrarily to results obtained with VC-, VG-and VGS-syllables, these results could be interpreted as a confirmation of Hypothesis A.
Finally, it should be borne in mind that, within monosyllabic stimuli, the syllable type that reaches the higher score of oralizations as acronyms is VC, a heavy syllable; let us remind here that, according to Hypothesis A, acronymization was more expectable than initialization in these cases.

Disyllabic and trisyllabic stimuli
As far as di-and trisyllabic stimuli are concerned, the predominant strategy that is followed by most participants (see Table 1) is acronymization. This seems to confirm Hypothesis C. For each syllable type, and just looking at the first syllable of each stimulus (σ 1 ), the mean number of acronymizations is always statistically higher than the number of initializations (with one single exception: the disyllable where {σ 1 =VG}, which attains the highest scores of acronymization, although the difference between this kind of oralization and initialism-type is not statistically significant). ***** In our view, these results need further examination: more data, from a larger number of participants, would give us a clearer picture of the several, interacting variables.
Notwithstanding, a few general comments can be drawn. First, on the one hand, we could conclude that syllable weight effect is not entirely clear for determining which candidates are better or worse to be accepted as words in European Portuguese: heavy monosyllables are very often oralized as acronyms (see the results for CV and VG monosyllables, for instance), and this is not absolutely clear for a definite confirmation or refutation of Hypothesis A. On the other hand, light monosyllables were not systematically produced as initialisms, contrarily to the initial assumptions of Hypothesis B: with CV stimuli, indeed, initialisms are predominant, but this is not the case for CCV (mostly produced as acronyms). In these stimuli, comparisons between both types of response were not statistically significant either.
If we confront these results with our initial hypotheses, the only hypothesis that seems minimally confirmed is Hypothesis C: when processing stimuli with two or more syllables, participants show a clear preference towards acronymization in detriment of initialisms.
This preliminary observation -not allowing a clear refutation or confirmation of Hypotheses A and C, but supporting, at least in a limited way, the acceptance of Hypothesis C -could be seen as empiric evidence in favour of McCarthy and Prince's (1995) proposal as found in (1b): in languages without quantitative oppositions (as it is the case of CEP), Minimality Condition obeys the word's syllabic extension, not syllable weight.
However, a deeper analysis of this rough conclusion brings us a few more supplementary arguments to be taken into consideration for us to understand it better how monosyllabic stimuli are processed by CEP native speakers. These arguments challenge in some ways the state of the art as well as the experimental hypotheses that were explored earlier in this paper. The most striking finding that such analysis provides us with has to do with the stimuli extension, not in terms of syllables, but in terms of segments.
Indeed, there seems to be a highly interesting, suggestive regularity regarding the possible effect of a Minimality Condition of any potential word in European Portuguese -be that minimality be measured, though, in number of segments, not in syllable weight or number of syllables. In our view, such regularity could explain, quite satisfactorily, why monosyllabic pseudoabbreviations are systematically oralized according to one single type of production. Native speakers of CEP process phonetic chains with three or more segments preferably as acronyms, and process phonetic chains with less than three segments preferably as initialisms -regardless of such stimuli being mono-or polysyllabic, and regardless of syllable weight, too. If this interpretation proves true, this is the explanation why stimuli like, e.g., "BLO" and "PRU", are preferably oralized as acronyms ([blO], [p4u]) and seldom as initialisms. This would lead us to postulate a linear MC constraint in CEP, contrary to Hypothesis B. The same explanation would apply to the results obtained with stimuli such as "AU" and "EI". Although they correspond to heavy monosyllables, they were mainly oralized as initialisms ([a"u], [E"i]), not as acronyms, contrarily to Hypothesis A. Figure 7, with results of the 40 monosyllabic pseudoabbreviations only, illustrates how the score of acronymizations increases as the stimuli extension measured in number of segments also gets higher: stimuli with just two segments (CV, VC, VG) show an average number of acronymizations also lesser than 1,5. Stimuli with 3 or more segments show higher scores of acronymization, even when the only case of light syllable within this set is considered: CCV, in spite of showing a lesser number of acronymizations than other stimuli with 3 or more segments, was processed as an acronym more often than light or heavy monosyllables (formed by 2 segments only) were. We could then postulate a Minimality Condition, which would apply specifically in CEP, clearly grounded on the number of segments of the word, regardless of syllable weight. This constraint is formulated in (8). (8). Minimality Condition in Contemporary European Portuguese CEP: {WMinExtSg ≥3} WMinExtSg=Word Minimal Extension, measured in number of segments This constraint, in our view, would also explain results such as the following: 1 -Disyllables and trisyllables are produced, quite regularly, as acronyms, not as initialisms (as previewed by Hypothesis C and admitted by McCarthy and Prince (1995) Minimality Condition found in (1b)); 2 -Monosyllables with just 2 segments are produced as initialisms, not acronyms, regardless of syllable weight (see Table 1 and Figure 7); 3 -Monosyllables with 3 or more segments are mainly produced as acronyms, not as initialisms, also regardless of syllable weight (see again Table 1 and Figure 7).
We admit that a formulation like (8) does not explain the presence of Portuguese words that are formed just by a light monosyllable (see examples at 1 st column of Table  1). In fact, it remains unanswered how a constraint such (8) is violated by this set of words of the language, even though it consists of a not very large set. We cannot ignore, first, that the answer for this question, very crucial for the phonological description of Portuguese, cannot be categorically given by a study like this one and on the sole basis of the oralization of abbreviated forms. Notwithstanding, we will try to gather some possible explanations for this interrogation.
First of all, it should be reminded that these words form -as it is explicitly acknowledged by several authors who were previously mentioned -a very small subset of the Portuguese lexicon. In an oversimplified manner, they could be presented as exceptions to MC in Portuguese, in line with interpretations such as Vigário's (2003) and Booij's (2004), for instance.
Therefore, we cannot exclude completely that such monosyllables preserve, in their underlying forms, an abstract structure that maintains their status as heavy syllables: for example, pé 'foot'=/pE/ σH , má 'bad_FEM'=/ma/ σH , their syllable heaviness (σ H ) being kept by higher degrees of vowel sonority, according to Brandão de Carvalho (2011, p.59). In some cases, we can even admit the presence, at the maximally abstract level of phonological representation, of 3 phonological segments (nu 'naked'=/nuu/, dó 'sorrow, mourning'=/dOO/). This even becomes segmentally very transparent in morphological derivations of these stems, in cases like desnudo 'naked [poetic]', dolorido 'consumed by sorrow', etc. (accordingly, then, with (8)). Such explanation would render these words more compatible with the various MC versions that were analyzed in this study, cancelling the need for ad hoc explanations strongly based on the exceptionality of those forms as the only explanation for their insertion in the Portuguese lexicon (even if we cannot ignore that these cases do correspond to very particular cases and further investigation would be necessary to reach more robust conclusions about them).

Concluding remarks
Based on our data and our discussion, and not ignoring the limitations of our study, we think it possible to draw some final observations that can shed some light on the initial questions and hypotheses of our research.
The main conclusion is that it seems indeed possible to advocate the pertinence of a Minimality Condition in Contemporary European Portuguese, mandatorily obeyed by the gross majority of lexical entries of this language.
Contrarily to the general understanding about the MC parametrization in Portuguese and other languages, though, CEP's MC does not depend on syllable weight or syllabic extension: according to our data, CEP's MC seems to be peculiarly conditioned by one single parameter, different and independent from those -the number of segments of the word. We propose, in this paper, that phonetic chains with 3 or more segments are good candidates to wordiness in CEP, regardless of their syllable weight and syllabic extension. The participants' behavior in the experimental task they were confronted with showed a consistent tendency, statistically probable, confirming this proposal. In several moments of our paper, we acknowledged that this MC formulation does not comply with most previous versions of minimality for Portuguese and other languages, precisely because of the apparent cancelling of syllable weight and/or syllabic extension, which are generally assumed as the most important variables related to wordiness and minimality (McCARTHY; PRINCE, 1995 -see (1)).
A second conclusion that can be drawn from our study is that MC, as any other phonological property of a given language, is subject to historic variation. The MC that is proposed here for CEP (|CEP: {WMinExtSg ≥3}| -see (8)) is not the same as the MC that was eventually productive in Medieval Portuguese (|MP: {ω≥σ H }| -see (2)), much more sensitive to syllable weight.
We emphasize here that the apparent lack of importance of syllable weight as a crucial criterion of MC in CEP does not equate its elimination from the set of phonological constraints of this language. The conclusion that, at the current stage of the language, syllable weight does not play a decisive role for the parametrization of MC has to do with data that show that monosyllables with at least 3 segments, be they light (with a branched onset but an unbranched rhyme, as in "BLO", "PRI" or "CLA") or heavy (with a branched rhyme, as in "EUS" and "OIS"), are preferably oralized as acronyms, contrasting with monosyllables with just 2 segments, also regardless of their syllabic weigh, which are preferably oralized as initialisms. At other levels -namely as for non-verb stress-assignment, for instance -arguments in favour of syllable weight as a productive phonological property seem to be very relevant, as we have briefly seen in 2.2.1 and according to the descriptions of Brandão de Carvalho (1988;1989;2011) or Wetzels (2007). This specific subject, however, was not addressed as a central question of this study.
Four questions remain unanswered in this study, in need of future research: -the processing of pseudoabbreviations phonologically similar to clitics; -the specific case of words corresponding to light monosyllables formed by 2 segments only (underneath, thus, the MC in terms of syllable weight, syllabic extension or number of segments); -the oralization of phonetic chains that respect important phonotactic constraints of the language but violate others, namely the prohibition of segments in principle unadmitted in coda position (e.g.: FENPROF, REN; see examples C2 in column C, Table 4); -the oralization of pseudoabbreviations potentially ending in a nasal segment, that were deliberately excluded from the scope of our study, as previously explained. VELOSO