SciELO - Scientific Electronic Library Online

vol.44 issue2Spatial and seasonal analysis on leptospirosis in the municipality of São Paulo, Southeastern Brazil, 1998 to 2006Recovery of the main causes of death in the Northeast of Brazil: impact on life expectancy author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand




Related links


Revista de Saúde Pública

Print version ISSN 0034-8910


MALUCELLI, Andreia et al. Classification of risk micro-areas using data mining. Rev. Saúde Pública [online]. 2010, vol.44, n.2, pp.292-300. ISSN 0034-8910.

OBJECTIVE: To identify, with the assistance of computational techniques, rules concerning the conditions of the physical environment for the classification of risk micro-areas. METHODS: Exploratory research carried out in Curitiba, Southern Brazil, in 2007. It was divided into three phases: the identification of attributes to classify a micro-area; the construction of a database; and the process of discovering knowledge in a database through the use of data mining. The set of attributes included the conditions of infrastructure; hydrography; soil; recreation area; community characteristics; and existence of vectors. The database was constructed with data obtained in interviews by community health workers using questionnaires with closed-ended questions, developed with the essential attributes selected by specialists. RESULTS: There were 49 attributes identified, 41 of which were essential and eight irrelevant. There were 68 rules obtained in the data mining, which were analyzed through the perspectives of performance and quality and divided into two sets: the inconsistent rules and the rules that confirm the knowledge of experts. The comparison between the groups showed that the rules that confirm the knowledge, despite having lower computational performance, were considered more interesting. CONCLUSIONS: The data mining provided a set of useful and understandable rules capable of characterizing risk areas based on the characteristics of the physical environment. The use of the proposed rules allows a faster and less subjective area classification, maintaining a standard between the health teams and overcoming the influence of individual perception by each team member.

Keywords : Databases as Topic; Databases, Factual; Knowledge Bases; Artificial intelligence; Environmental Indicators; Environmental Risks; Risk Map.

        · abstract in Portuguese | Spanish     · text in English | Portuguese     · English ( pdf ) | Portuguese ( pdf )


Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License