Clinical signs of pneumonia in children: association with and prediction of diagnosis by fuzzy sets theory

Pereira, J.C.R.; Tonelli, P.A.; Barros, L.C.; Ortega, N.R.S.

doi:10.1590/S0100-879X2004000500012

Abstract

The present study compares the performance of stochastic and fuzzy models for the analysis of the relationship between clinical signs and diagnosis. Data obtained for 153 children concerning diagnosis (pneumonia, other non-pneumonia diseases, absence of disease) and seven clinical signs were divided into two samples, one for analysis and other for validation. The former was used to derive relations by multi-discriminant analysis (MDA) and by fuzzy max-min compositions (fuzzy), and the latter was used to assess the predictions drawn from each type of relation. MDA and fuzzy were closely similar in terms of prediction, with correct allocation of 75.7 to 78.3% of patients in the validation sample, and displaying only a single instance of disagreement: a patient with low level of toxemia was mistaken as not diseased by MDA and correctly taken as somehow ill by fuzzy. Concerning relations, each method provided different information, each revealing different aspects of the relations between clinical signs and diagnoses. Both methods agreed on pointing X-ray, dyspnea, and auscultation as better related with pneumonia, but only fuzzy was able to detect relations of heart rate, body temperature, toxemia and respiratory rate with pneumonia. Moreover, only fuzzy was able to detect a relationship between heart rate and absence of disease, which allowed the detection of six malnourished children whose diagnoses as healthy are, indeed, disputable. The conclusion is that even though fuzzy sets theory might not improve prediction, it certainly does enhance clinical knowledge since it detects relationships not visible to stochastic models.

Epidemiologic methods; Stochastic models; Fuzzy models; Clinical signs; Diagnosis; Data analysis

Braz J Med Biol Res, May 2004, Volume 37(5) 701-709

Clinical signs of pneumonia in children: association with and prediction of diagnosis by fuzzy sets theory

J.C.R. Pereira¹, P.A. Tonelli³, L.C. Barros⁴ and N.R.S. Ortega²

¹Departamento de Epidemiologia, ²Laboratório de Investigação Médica, Hospital das Clínicas, Faculdade de Saúde Pública, and ³Instituto de Matemática e Estatística, Universidade de São Paulo, São Paulo, SP, Brasil

⁴Instituto de Matemática, Estatística e Computação Científica, Universidade Estadual de Campinas, Campinas, SP, Brasil

References

Correspondence and Footnotes ^{Correspondence and Footnotes} Correspondence and Footnotes

Abstract

The present study compares the performance of stochastic and fuzzy models for the analysis of the relationship between clinical signs and diagnosis. Data obtained for 153 children concerning diagnosis (pneumonia, other non-pneumonia diseases, absence of disease) and seven clinical signs were divided into two samples, one for analysis and other for validation. The former was used to derive relations by multi-discriminant analysis (MDA) and by fuzzy max-min compositions (fuzzy), and the latter was used to assess the predictions drawn from each type of relation. MDA and fuzzy were closely similar in terms of prediction, with correct allocation of 75.7 to 78.3% of patients in the validation sample, and displaying only a single instance of disagreement: a patient with low level of toxemia was mistaken as not diseased by MDA and correctly taken as somehow ill by fuzzy. Concerning relations, each method provided different information, each revealing different aspects of the relations between clinical signs and diagnoses. Both methods agreed on pointing X-ray, dyspnea, and auscultation as better related with pneumonia, but only fuzzy was able to detect relations of heart rate, body temperature, toxemia and respiratory rate with pneumonia. Moreover, only fuzzy was able to detect a relationship between heart rate and absence of disease, which allowed the detection of six malnourished children whose diagnoses as healthy are, indeed, disputable. The conclusion is that even though fuzzy sets theory might not improve prediction, it certainly does enhance clinical knowledge since it detects relationships not visible to stochastic models.

Key words: Epidemiologic methods, Stochastic models, Fuzzy models, Clinical signs, Diagnosis, Data analysis

Introduction

Poincaré, in the preface to his XIX century "Science and Hypothesis", remarked that "the aim of science is not things themselves, as the dogmatists in their simplicity imagine, but the relations between things; outside those relations there is no reality knowable" (1). In the medical sciences, relations among phenomena are mainly studied as associations assessed by a stochastic or deterministic paradigm in order to provide cause and effect relationships. On the basis of Hume's principle of uniformity of nature ("like objects placed in like circumstances will always produce like effects") (2), predictions are made for future situations on the basis of relations drawn from past experience.

Much knowledge has undoubtedly been obtained in the medical sciences using these frames of reference. Nonetheless, it may be proposed that inspecting any given phenomenon from different standpoints should afford additional information and eventually more complete and accurate knowledge. Conversely, restricting choice to specific frames of reference should lead to restrictions in mapping relationships and, thus, ultimately, to restrictions in knowledge. In this respect, Susser (3), when discussing causality, stated that "to choose a frame of reference is to choose a limited set of causal relationships within an ecological system".

The theory of fuzzy sets was introduced by Lotfi A. Zadeh in the 1960's as a means to model the uncertainty that is present in natural language, e.g., expressions like big, small, strong, weak, etc. This was a turning point in what Klir and Yuan (4) called a grand paradigm shift, remarking that "Among the various paradigmatic changes in science and mathematics in this century, one such change concerns the concept of uncertainty. &ldots; According to the traditional view, science should strive for certainty in all its manifestations (precision, specificity, sharpness, consistency, etc.); hence, uncertainty (imprecision, non-specificity, vagueness, inconsistency, etc.) is regarded as unscientific. According to the alternative (or modern) view, uncertainty is considered essential to science."

According to Rouvray (5), the problem of vagueness and rigidity of the fundamental axioms in Aristotelian logic reasoning was probably first discussed by the logician George Boole in 1854. At the beginning of the XX century, Peirce acknowledged that "All that exists is continuous and such continuum governs knowledge" and some years later, in 1923, Russell stated that "both vagueness and precision are features of language, not reality. Vagueness clearly is a matter of degree" (6). But, again according to Rouvray (5), it was Lukasiewicz who, in 1930, took the first step towards a formal model of vagueness, an early logic based on more values than true and false, and was later followed by Black, in 1937, who outlined his proto-fuzzy logic with the "suggestion that degrees of vagueness could be measured by a consistency function". Eventually, it was Zadeh (7) who, in 1965, settled the matter of vagueness setting forth the mechanics of fuzzy set theory. Zadeh's key concept is graded membership, according to which a set can have members that partly belong to it. So, if one assumes that X is a set serving as the universe of discourse, a fuzzy subset A of X is associated with a function: µA: X ® [0,1] which is generally called membership function. The idea is that for each x, µA(x) indicates the extent to which x is a member of the fuzzy set A. This membership degree indicates the degree of compatibility of the assertion "x is A".

Ever since Zadeh outlined the first principles of fuzzy sets theory, both its contents and applications have experienced an extraordinary development. From then to June 2003, when these annotations were made, the databases of scientific literature of the Institute for Scientific Information (Science Citation Expanded^®, Social Sciences Citation Index^®, and Arts and Humanities Citation Index) record that 21,187 articles containing the term "fuzzy" were published. Concerning the medical sciences, Medline^®recorded 1,777 such articles, beginning with just one in 1971 and increasing exponentially to 181 in 2002 (a mean yearly increment rate of 15%).

The present study was conceived as a proposal to determine whether fuzzy relations could add information to that provided by customary stochastic relations about the association and prediction of events of medical interest.

Material and Methods

Data were taken from a study about the relationship between clinical signs and diagnosis (8). The study comprised 153 children who were randomly divided into an analysis sample (115 cases) and a validation sample (38 cases). No criteria other than random allocation separated these two samples, the former being meant to draw relationships and the latter conceived as a trial sample to test the value of such relationships in terms of prediction. Since testing predicted values yielded by any function against the same real values used to derive such a function is not more than assessing residuals and goodness-of-fit, a validation sample is required to properly assess prediction, so that data processed to give prediction have nothing to do with the way they are processed.

Diagnoses (pneumonia, non-pneumonia diseases, healthy) were originally ascertained as either present (1) or absent (0), and were mutually exclusive. Two pediatricians, on grounds of identical clinical investigations, independently made the diagnoses, and an X-ray was required for the diagnosis of pneumonia. Entry condition, apart from ethical issues, was agreement between the two specialists.

The following clinical signs were considered for analysis: dyspnea, measured on a scale from absent (0) to severe (4) taking into account the following signs: mild discomfort, lower rib in-drawing with tachypnea, intercostal in-drawing with severe tachypnea and/or presence of nasal flaring, full retraction of ribs plus cyanosis and/or poor peripheral blood perfusion; toxemia, measured as a scale from absent (0) to severe (4), according to the presence of pallor, pallor and listlessness, irritability, drowsiness; radiological signs, measured as a counting scale for the presence of signs from absent (0) up to seven: alveolar and interstitial infiltrates, atelectasis, pleural effusion, pneumatoceles, airtrapping, pneumothorax; auscultation signs, measured on a counting scale for the presence of signs from absent (0) up to three: rales, crackles, bronchial breathing; temperature, measured on a scale from normal (0) to severe fever (3): normal (£37ºC), mild fever (>37ºC and <38.5ºC), fever (³38.5ºC and <40ºC), severe fever (³40ºC); heart rate, according to age group, measured on an ordinal scale from normal (0) to highly tachycardic (4); respiratory rate, according to age group, measured on an ordinal scale from normal (0) to highly tachypneic (4).

All of these measurements were made using a scale in order to have a single definition of a fuzzy membership function to map original values into grades of membership: scales were all normalized to the unit so that full membership would mean a clinical sign present at its most severe expression. This license was allowed by taking into account that the focus of the study was neither the diagnosis nor the signs, but, as indicated above, the comparison of two different methodological approaches, namely stochastic and fuzzy. Indeed, for such a goal any other sort of measurement should equally do, which would not be true if knowledge about the subject was being sought. Under these circumstances, one would better consider signs as linguistic variables and endeavor to develop a specific fuzzy membership function for each variable according to its symbolic and semantic characteristics (9). Thus, the fact that measurement precision is excused, should not jeopardize the clinical appreciation of the argument concerning an alternative reasoning for decision about the diagnosis of children's pneumonia, or any other sort of diagnosis.

To obtain relationships between clinical signs and diagnoses under a stochastic model, multi-discriminant analysis (10) was conducted, following the pattern of the original study from which the data were taken. Mahalanobis distances were used, and functions best discriminating diagnoses were derived by stepwise selection of variables. Relationships between multi-discriminant functions and clinical signs were examined in the rotated structure matrix. These functions were used to predict the diagnoses of patients in the validation sample and overall agreement was calculated.

To run a similar analysis under fuzzy theory, a max-min composition (11) was used to combine information from two fuzzy binary relations organized as membership matrices: one concerning the relationships between clinical signs (x) and patients (y) and the other concerning the relationships between patients (y) and diagnoses (z). This yielded a matrix of relationships between clinical signs (x) and diagnoses (z) [R(x,z)]. This max-min composition may be described as follows: given S and T, two binary relations of U x V and V x W (e.g., signs x patients and patients x diagnosis), the max-min composition S*T of U x W (e.g., signs x diagnoses) is a fuzzy binary relationship with membership function given by

(Eq. 1)

To predict the diagnoses (D) of patients from the validation sample the following function was used: for each patient (P_n), his/her relationship with each diagnosis (d_m) was drawn from the composition of his/her vector of clinical signs (s_i) with the composite relationship previously identified (R(x,z), relationships between signs and diagnoses), as follows:

(Eq. 2)

The D(P_n)(d_m) value can be seen as the possibility of diagnosis d_m for patient P_n, from his signs s_i, 1 £ i £ 7, since seven signs were considered. Hence, D(P_n) stands for the membership function of patient P_n in the universe of diagnoses.

To finalize the allocation of a patient to a single diagnosis, a defuzzification rule is needed to make a choice from the relationships he shows with each diagnosis category. This rule was defined as the highest value of the resulting membership functions. In other words, one patient should be allocated to the diagnosis for which he had highest membership. Since for allocation of a patient to the healthy category he/she should have no relation with any clinical sign, healthy patients were defined as those whose membership for pneumonia or other disease was null. In the case of ties, a complementary analysis treating diagnosis as a multi-response variable should be considered.

Results

Multi-discriminant analysis identified two functions that together could represent 100% of total variance (function 1 = 98.4%, function 2 = 1.6%). As shown in Figure 1, function 1 separates pneumonia cases from other cases, and function 2 separates healthy cases from others, with increasing values towards patients with any type of disease (Figure 1).

The rotated structure matrix provided information about the relationships between clinical signs and diagnoses as shown in Table 1.

Thumbnail