Mañana, JUEVES, 24 DE ABRIL, el sistema se apagará debido a tareas habituales de mantenimiento a partir de las 9 de la mañana. Lamentamos las molestias.
Adapting Searchy to extract data using evolved wrappers
Entity
UAM. Departamento de Ingeniería InformáticaPublisher
Pergamon PressDate
2012-02Citation
10.1016/j.eswa.2011.08.168
Expert Systems with Applications: An International Journal 39.3 (2012): 3061-3070
ISSN
0957-4174 (print); 1873-6793 (online)DOI
10.1016/j.eswa.2011.08.168Funded by
The authors gratefully acknowledge Mart´ın Knoblauch for his useful suggestions and valuable comments. This work has been partially supported by the Spanish Ministry of Science and Innovation under the projects ABANT (TIN 2010-19872), COMPUBIODIVE (TIN2007-65989) and by Castilla-La Mancha project PEII09-0266-6640.Editor's Version
http://dx.doi.org/10.1016/j.eswa.2011.08.168Subjects
Genetic Algorithms; Information extraction; Wrappers; InformáticaNote
This is the author’s version of a work that was accepted for publication inExpert Systems with Applications: An International Journal. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Expert Systems with Applications: An International Journal, 39, 3 (2012) DOI: 10.1016/j.eswa.2011.08.168Rights
© 2012 Elsevier B.V. All rights reservedEsta obra está bajo una licencia de Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0 Internacional.
Abstract
Organizations need diverse information systems to deal with the increasing requirements in information storage and processing, yielding the creation of information islands and therefore an intrinsic difficulty to obtain a global view. Being able to provide such an unified view of the -likely heterogeneous-information available in an organization is a goal that provides added-value to the information systems and has been subject of intense research. In this paper we present an extension of a solution named Searchy, an agent-based mediator system specialized in data extraction and Integration. Through the use of a set of wrappers, it integrates information from arbitrary sources and semantically translates them according to a mediated scheme. Searchy is actually a domain-independent wrapper container that ease wrapper development, providing, for example, semantic mapping. The extension of Searchy proposed in this paper introduces an evolutionary wrapper that is able to evolve wrappers using regular expressions. To achieve this, a Genetic Algorithm (GA) is used to learn a regex able to extract a set of positive samples while rejects a set of negative samples.
Files in this item
Google Scholar:Barrero, David F.
-
R-Moreno, María Dolores
-
Camacho, David
This item appears in the following Collection(s)
Related items
Showing items related by title, author, creator and subject.
-
Variable length-based genetic representation to automatically evolve wrappers
Barrero, David F.; González-Pardo, Antonio; R-Moreno, María Dolores; Camacho, David
2010 -
An empirical study on the accuracy of computational effort in Genetic Programming
Barrero, David F.; R-Moreno, María Dolores; Castaño, Bonifacio; Camacho, David
2011