Hierarchical text clustering applied to taxonomy evaluation
Author
Muñoz Hidalgo, SamuelAdvisor
Camacho, DavidEntity
UAM. Departamento de Ingeniería InformáticaDate
2014Subjects
Biología - Clasificación; Ontología; Gestión del conocimiento; Informática
Esta obra está bajo una licencia de Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0 Internacional.
Abstract
In computer science, the use for taxonomies is widely embraced in fields such as Artifial
Inteligence, Information Retrieval, Natural Language Processing or Machine Learning.
This concept classifications provide knowledge structures to guide algorithms on the
task to find an acceptable-to-nearly-optimal solution on non deterministic problems.
The main problem with taxonomies is the huge amount of effort that requires to build
one. Traditionally, this is done by human means and involves a team of experts to assure
the quality of the result. Since this is evidently the way to get the best taxonomy
possible (knowledge is an exclusive quality of humans), due to the manpower factor, it
seems to be neither the fastest nor the cheapest one.
This thesis makes an extensive review of the state of the art on taxonomy induction
techniques as well as ontology evaluation methods. It claims the need for a fast, automatic
and arbitrary-domain taxonomy generation method and justifies the chose of the
Wikipedia encyclopedia as the dataset. A framework to deal with taxonomies is proposed
and implemented. In the experiments chapter, two statements are successfully
refuted: the Wikipedia categorization system forms an acyclic directed graph, and the
longest path between two nodes is equivalent to the taxonomic organization. Finally
the framework is used to explore three arbitrary domains.
Files in this item
Google Scholar:Muñoz Hidalgo, Samuel
This item appears in the following Collection(s)
Related items
Showing items related by title, author, creator and subject.
-
Alberta Stroke Program Early CT Score applied to CT angiography source images is a strong predictor of futile recanalization in acute ischemic stroke
Kawiorski, Michal M.; Martínez-Sánchez, Patricia; García-Pastor, Andrés; Calleja, Patricia; Fuentes Gimeno, Blanca Eulalia; Sanz-Cuesta, Borja E.; Lourido, Daniel; Marín, Begoña; Díaz-Otero, Fernando; Vicente, Agustina; Sierra-Hidalgo, Fernando; Ruiz-Ares, Gerardo; Díez Tejedor, Exuperio
; Fandiño, Eduardo; Alonso de Leciñana, María
2016-05-01