A study on semantic similarity and its application to clustering

return to the website
by Montserrat Batet
Abstract:
In the last years, the amount of textual electronic information available has been increasing rapidly. Computer understanding of texts has become an important trend in computational linguistics. Proper processing of this kind of information requires an interpretation of their meaning at a semantic level. This work presents novel measures to estimate the degree of semantic similarity between words using one or more knowledge sources. The measures are based on the exploitation of the knowledge modelled in one or several ontologies and on the estimation of the information distribution of terms in the Web. They have been applied to clustering, computing the similarity/distance between individuals described by textual attributes. Results show that a proper interpretation of textual data at a semantic level improves the quality of the clusters and eases their interpretability.
Reference:
A study on semantic similarity and its application to clustering (Montserrat Batet), VDM Verlag Dr. Müller, 2011.
Bibtex Entry:
@book{Batet2011,
abstract = {In the last years, the amount of textual electronic information available has been increasing rapidly. Computer understanding of texts has become an important trend in computational linguistics. Proper processing of this kind of information requires an interpretation of their meaning at a semantic level. This work presents novel measures to estimate the degree of semantic similarity between words using one or more knowledge sources. The measures are based on the exploitation of the knowledge modelled in one or several ontologies and on the estimation of the information distribution of terms in the Web. They have been applied to clustering, computing the similarity/distance between individuals described by textual attributes. Results show that a proper interpretation of textual data at a semantic level improves the quality of the clusters and eases their interpretability.},
author = {Batet, Montserrat},
isbn = {978-3639370607},
keywords = {SML-LIB-BIBLIO,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
pages = {172},
publisher = {VDM Verlag Dr. M\"{u}ller},
title = {{A study on semantic similarity and its application to clustering}},
year = {2011}
}
Powered by bibtexbrowser