dc.contributor.author | Granados Fontecha, Ana | |
dc.contributor.author | Cebrián Ramos, Manuel | |
dc.contributor.author | Camacho, David | |
dc.contributor.author | Rodríguez Ortiz, Francisco Borja | |
dc.contributor.other | UAM. Departamento de Ingeniería Informática | es_ES |
dc.date.accessioned | 2015-10-02T15:59:40Z | |
dc.date.available | 2015-10-02T15:59:40Z | |
dc.date.issued | 2008 | |
dc.identifier.citation | Coding Theory and Applications: Second International Castle Meeting, ICMCTA 2008, Castillo de la Mota, Medina del Campo, Spain, September 15-19, 2008. Proceedings. Lecture Notes in Computer Science, Volumen 5228. Springer, 2008. 69-79. | en_US |
dc.identifier.isbn | 978-3-540-87447-8 (print) | en_US |
dc.identifier.isbn | 978-3-540-87448-5 (online) | en_US |
dc.identifier.issn | 0302-9743 (print) | en_US |
dc.identifier.issn | 1611-3349 (online) | en_US |
dc.identifier.uri | http://hdl.handle.net/10486/668387 | |
dc.description | The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-540-87448-5_8 | en_US |
dc.description | Proceedings of Second International Castle Meeting, ICMCTA 2008, Castillo de la Mota, Medina del Campo, Spain, September 15-19, 2008. | en_US |
dc.description.abstract | In this paper we apply different techniques of information distortion on a set of classical books written in English. We study the impact that these distortions have upon the Kolmogorov complexity and the clustering by compression technique (the latter based on Normalized Compression Distance, NCD). We show how to decrease the complexity of the considered books introducing several modifications in them. We measure how the information contained in each book is maintained using a clustering error measure. We find experimentally that the best way to keep the clustering error is by means of modifications in the most frequent words. We explain the details of these information distortions and we compare with other kinds of modifications like random word distortions and unfrequent word distortions. Finally, some phenomenological explanations from the different empirical results that have been carried out are presented. | en_US |
dc.description.sponsorship | This work was supported by TIN 2004-04363-CO03-03, TIN 2007-65989, CAM
S-SEM-0255-2006, TIN2007-64718 and TSI 2005-08255-C07-06. We would also
like to thank Franscico Sánchez for his useful comments on this draft. | en_US |
dc.format.extent | 12 pág. | es_ES |
dc.format.mimetype | application/pdf | en |
dc.language.iso | eng | en |
dc.publisher | Springer Berlin Heidelberg | en_US |
dc.relation.ispartof | Lecture Notes in Computer Science | en_US |
dc.rights | © Springer-Verlag Berlin Heidelberg 2008 | en_US |
dc.subject.other | Coding and Information Theory | en_US |
dc.subject.other | Algebra | en_US |
dc.subject.other | Algebraic Geometry | en_US |
dc.title | Evaluating the impact of information distortion on normalized compression distance | en_US |
dc.type | conferenceObject | en |
dc.type | bookPart | en |
dc.subject.eciencia | Informática | es_ES |
dc.relation.publisherversion | http://dx.doi.org/10.1007/978-3-540-87448-5_8 | |
dc.identifier.doi | 10.1007/978-3-540-87448-5_8 | |
dc.identifier.publicationfirstpage | 69 | |
dc.identifier.publicationlastpage | 79 | |
dc.identifier.publicationvolume | 5228 | |
dc.relation.eventdate | September 15-19, 2008 | en_US |
dc.relation.eventnumber | 2 | |
dc.relation.eventplace | Castillo de la Mota (Spain) | en_US |
dc.relation.eventtitle | 2nd International Castle Meeting on Coding Theory and Applications, ICMCTA 2008 | en_US |
dc.relation.projectID | Comunidad de Madrid. S2006/SEM-0255/OLFACTOSENSE | es_ES |
dc.relation.projectID | Gobierno de España. TIN2004-04363-CO03-03 | es_ES |
dc.relation.projectID | Gobierno de España. TIN2007-65989 | es_ES |
dc.relation.projectID | Gobierno de España. TIN2007-64718 | es_ES |
dc.relation.projectID | Gobierno de España. TSI2005-08255-C07-06 | es_ES |
dc.type.version | info:eu-repo/semantics/acceptedVersion | en |
dc.contributor.group | Herramientas Interactivas Avanzadas (ING EPS-003) | es_ES |
dc.contributor.group | Neurocomputación Biológica (ING EPS-005) | es_ES |
dc.rights.accessRights | openAccess | en |
dc.authorUAM | Camacho Fernández, David (261274) | |
dc.facultadUAM | Escuela Politécnica Superior | |