SciELO - Scientific Electronic Library Online

 
vol.27 número1A publicação de dados governamentais abertos: proposta de revisão da classe sobre Previdência Social do Vocabulário Controlado do Governo EletrônicoDireito do poeta na literatura de cordel índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

Compartilhar


Transinformação

versão impressa ISSN 0103-3786

Resumo

XAVIER, Bruno Missi; SILVA, Alcione Dias da  e  GOMES, Geórgia Regina Rodrigues. A hybrid architecture for document indexing of the Municipal Official Gazette in Cachoeiro de Itapemirim. Transinformação [online]. 2015, vol.27, n.1, pp.83-95. ISSN 0103-3786.  http://dx.doi.org/10.1590/0103-37862015000100008.

Text mining techniques have been widely used to process large volumes of documents. However, there is still a large gap when defining the architecture for systems with transactional elements of computational intelligence. The aim of the paper is to outline a proposed architecture to build a computational system that uses text mining techniques to index content from the database of the Official Gazette in the city of Cachoeiro de Itapemirim in the state of Espírito Santo, transforming the information previously available in natural language into a structured format that can be persisted. To validate the architecture we developed a prototype in Java accessible in the Web environment to evaluate the tool. To evaluate the tool, a case study featured a database composed of 22 documents, containing 198 normative acts from the database of the Official Gazette, in which good levels of accuracy and coverage of information retrieval were identified. This study contributes to the presentation of a hybrid architecture consisting of components of the model of transactional systems and elements of text mining, in addition to the use of software design patterns.

Palavras-chave : Official Gazette Cachoeiro de Itapemirim; Indexing documents; Text mining; Information retrieval.

        · resumo em Português     · texto em Português     · Português ( pdf )