Adapting Searchy to extract data using evolved wrappers

Biblos-e Archivo/Manakin Repository

Show simple item record

dc.contributor.author Barrero, David F.
dc.contributor.author R-Moreno, María Dolores
dc.contributor.author Camacho, David
dc.contributor.other UAM. Departamento de Ingeniería Informática es_ES
dc.date.accessioned 2015-05-29T15:21:30Z
dc.date.available 2015-05-29T15:21:30Z
dc.date.issued 2012-02
dc.identifier.citation Expert Systems with Applications: An International Journal 39.3 (2012): 3061-3070 en_US
dc.identifier.issn 0957-4174 (print) en_US
dc.identifier.issn 1873-6793 (online) en_US
dc.identifier.uri http://hdl.handle.net/10486/666489
dc.description This is the author’s version of a work that was accepted for publication inExpert Systems with Applications: An International Journal. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Expert Systems with Applications: An International Journal, 39, 3 (2012) DOI: 10.1016/j.eswa.2011.08.168 en_US
dc.description.abstract Organizations need diverse information systems to deal with the increasing requirements in information storage and processing, yielding the creation of information islands and therefore an intrinsic difficulty to obtain a global view. Being able to provide such an unified view of the -likely heterogeneous-information available in an organization is a goal that provides added-value to the information systems and has been subject of intense research. In this paper we present an extension of a solution named Searchy, an agent-based mediator system specialized in data extraction and Integration. Through the use of a set of wrappers, it integrates information from arbitrary sources and semantically translates them according to a mediated scheme. Searchy is actually a domain-independent wrapper container that ease wrapper development, providing, for example, semantic mapping. The extension of Searchy proposed in this paper introduces an evolutionary wrapper that is able to evolve wrappers using regular expressions. To achieve this, a Genetic Algorithm (GA) is used to learn a regex able to extract a set of positive samples while rejects a set of negative samples. en_US
dc.description.sponsorship The authors gratefully acknowledge Mart´ın Knoblauch for his useful suggestions and valuable comments. This work has been partially supported by the Spanish Ministry of Science and Innovation under the projects ABANT (TIN 2010-19872), COMPUBIODIVE (TIN2007-65989) and by Castilla-La Mancha project PEII09-0266-6640. en_US
dc.format.extent 12 pág. es_ES
dc.format.mimetype application/pdf en
dc.language.iso eng en
dc.publisher Pergamon Press en_US
dc.relation.ispartof Expert Systems with Applications en_US
dc.rights © 2012 Elsevier B.V. All rights reserved en_US
dc.subject.other Genetic Algorithms en_US
dc.subject.other Information extraction en_US
dc.subject.other Wrappers en_US
dc.title Adapting Searchy to extract data using evolved wrappers en_US
dc.type article en_US
dc.subject.eciencia Informática es_ES
dc.relation.publisherversion http://dx.doi.org/10.1016/j.eswa.2011.08.168
dc.identifier.doi 10.1016/j.eswa.2011.08.168
dc.identifier.publicationfirstpage 3061
dc.identifier.publicationissue 3
dc.identifier.publicationlastpage 3070
dc.identifier.publicationvolume 39
dc.type.version info:eu-repo/semantics/acceptedVersion en
dc.contributor.group Análisis de Datos e Inteligencia Aplicada (ING EPS-012) es_ES
dc.rights.cc Reconocimiento – NoComercial – SinObraDerivada es_ES
dc.rights.accessRights openAccess en
dc.authorUAM Camacho Fernández, David (261274)


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record