Distributional Measures of Semantic Distance: A Survey

return to the website
by Saif Mohammad, Graeme Hirst
Abstract:
The ability to mimic human notions of semantic distance has widespread applications. Some measures rely only on raw text (distributional measures) and some rely on knowledge sources such as WordNet. Although extensive studies have been performed to compare WordNet-based measures with human judgment, the use of distributional measures as proxies to estimate semantic distance has received little attention. Even though they have traditionally performed poorly when compared to WordNet-based measures, they lay claim to certain uniquely attractive features, such as their applicability in resource-poor languages and their ability to mimic both semantic similarity and semantic relatedness. Therefore, this paper presents a detailed study of distributional measures. Particular attention is paid to flesh out the strengths and limitations of both WordNet-based and distributional measures, and how distributional measures of distance can be brought more in line with human notions of semantic distance. We conclude with a brief discussion of recent work on hybrid measures.
Reference:
Distributional Measures of Semantic Distance: A Survey (Saif Mohammad, Graeme Hirst), In ArXiv, volume 1203.1889, 2012.
Bibtex Entry:
@article{Mohammad2012,
abstract = {The ability to mimic human notions of semantic distance has widespread applications. Some measures rely only on raw text (distributional measures) and some rely on knowledge sources such as WordNet. Although extensive studies have been performed to compare WordNet-based measures with human judgment, the use of distributional measures as proxies to estimate semantic distance has received little attention. Even though they have traditionally performed poorly when compared to WordNet-based measures, they lay claim to certain uniquely attractive features, such as their applicability in resource-poor languages and their ability to mimic both semantic similarity and semantic relatedness. Therefore, this paper presents a detailed study of distributional measures. Particular attention is paid to flesh out the strengths and limitations of both WordNet-based and distributional measures, and how distributional measures of distance can be brought more in line with human notions of semantic distance. We conclude with a brief discussion of recent work on hybrid measures.},
archivePrefix = {arXiv},
arxivId = {1203.1858},
author = {Mohammad, Saif and Hirst, Graeme},
eprint = {1203.1858},
journal = {ArXiv},
keywords = {SML-LIB-BIBLIO,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
month = mar,
title = {{Distributional Measures of Semantic Distance: A Survey}},
url = {http://arxiv.org/abs/1203.1858},
volume = {1203.1889},
year = {2012}
}
Powered by bibtexbrowser