Show simple item record

dc.contributor.authorGranados Fontecha, Ana
dc.contributor.authorCebrián Ramos, Manuel
dc.contributor.authorCamacho, David
dc.contributor.authorRodríguez Ortiz, Francisco Borja 
dc.contributor.otherUAM. Departamento de Ingeniería Informáticaes_ES
dc.date.accessioned2015-10-02T15:59:40Z
dc.date.available2015-10-02T15:59:40Z
dc.date.issued2008
dc.identifier.citationCoding Theory and Applications: Second International Castle Meeting, ICMCTA 2008, Castillo de la Mota, Medina del Campo, Spain, September 15-19, 2008. Proceedings. Lecture Notes in Computer Science, Volumen 5228. Springer, 2008. 69-79.en_US
dc.identifier.isbn978-3-540-87447-8 (print)en_US
dc.identifier.isbn978-3-540-87448-5 (online)en_US
dc.identifier.issn0302-9743 (print)en_US
dc.identifier.issn1611-3349 (online)en_US
dc.identifier.urihttp://hdl.handle.net/10486/668387
dc.descriptionThe final publication is available at Springer via http://dx.doi.org/10.1007/978-3-540-87448-5_8en_US
dc.descriptionProceedings of Second International Castle Meeting, ICMCTA 2008, Castillo de la Mota, Medina del Campo, Spain, September 15-19, 2008.en_US
dc.description.abstractIn this paper we apply different techniques of information distortion on a set of classical books written in English. We study the impact that these distortions have upon the Kolmogorov complexity and the clustering by compression technique (the latter based on Normalized Compression Distance, NCD). We show how to decrease the complexity of the considered books introducing several modifications in them. We measure how the information contained in each book is maintained using a clustering error measure. We find experimentally that the best way to keep the clustering error is by means of modifications in the most frequent words. We explain the details of these information distortions and we compare with other kinds of modifications like random word distortions and unfrequent word distortions. Finally, some phenomenological explanations from the different empirical results that have been carried out are presented.en_US
dc.description.sponsorshipThis work was supported by TIN 2004-04363-CO03-03, TIN 2007-65989, CAM S-SEM-0255-2006, TIN2007-64718 and TSI 2005-08255-C07-06. We would also like to thank Franscico Sánchez for his useful comments on this draft.en_US
dc.format.extent12 pág.es_ES
dc.format.mimetypeapplication/pdfen
dc.language.isoengen
dc.publisherSpringer Berlin Heidelbergen_US
dc.relation.ispartofLecture Notes in Computer Scienceen_US
dc.rights© Springer-Verlag Berlin Heidelberg 2008en_US
dc.subject.otherCoding and Information Theoryen_US
dc.subject.otherAlgebraen_US
dc.subject.otherAlgebraic Geometryen_US
dc.titleEvaluating the impact of information distortion on normalized compression distanceen_US
dc.typeconferenceObjecten
dc.typebookParten
dc.subject.ecienciaInformáticaes_ES
dc.relation.publisherversionhttp://dx.doi.org/10.1007/978-3-540-87448-5_8
dc.identifier.doi10.1007/978-3-540-87448-5_8
dc.identifier.publicationfirstpage69
dc.identifier.publicationlastpage79
dc.identifier.publicationvolume5228
dc.relation.eventdateSeptember 15-19, 2008en_US
dc.relation.eventnumber2
dc.relation.eventplaceCastillo de la Mota (Spain)en_US
dc.relation.eventtitle2nd International Castle Meeting on Coding Theory and Applications, ICMCTA 2008en_US
dc.relation.projectIDComunidad de Madrid. S2006/SEM-0255/OLFACTOSENSEes_ES
dc.relation.projectIDGobierno de España. TIN2004-04363-CO03-03es_ES
dc.relation.projectIDGobierno de España. TIN2007-65989es_ES
dc.relation.projectIDGobierno de España. TIN2007-64718es_ES
dc.relation.projectIDGobierno de España. TSI2005-08255-C07-06es_ES
dc.type.versioninfo:eu-repo/semantics/acceptedVersionen
dc.contributor.groupHerramientas Interactivas Avanzadas (ING EPS-003)es_ES
dc.contributor.groupNeurocomputación Biológica (ING EPS-005)es_ES
dc.rights.accessRightsopenAccessen
dc.authorUAMCamacho Fernández, David (261274)
dc.facultadUAMEscuela Politécnica Superior


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record