A cluster-based approach for semantic similarity in the biomedical domain.

return to the website
by Hisham Al-Mubaid, Hoa A. Nguyen
Abstract:
We propose a new cluster-based semantic similarity/distance measure for the biomedical domain within the framework of UMLS. The proposed measure is based mainly on the cross-modified path length feature between the concept nodes, and two new features: (1) the common specificity of two concept nodes, and (2) the local granularity of the clusters. We also applied, for comparison purpose, five existing general English ontology-based similarity measures into the biomedical domain within UMLS. The proposed measure was evaluated relative to human experts' ratings, and compared with the existing techniques using two ontologies (MeSH and SNOMED-CT) in UMLS. The experimental results confirmed the efficiency of the proposed method, and showed that our similarity measure gives the best overall results of correlation with human ratings. We show, further, that using MeSH ontology produces better semantic correlations with human experts' scores than SNOMED-CT in all of the tested measures.
Reference:
A cluster-based approach for semantic similarity in the biomedical domain. (Hisham Al-Mubaid, Hoa A. Nguyen), In Annual International Conference of the IEEE Engineering in Medicine and Biology Society., volume 1, 2006.
Bibtex Entry:
@inproceedings{Al-Mubaid2006,
abstract = {We propose a new cluster-based semantic similarity/distance measure for the biomedical domain within the framework of UMLS. The proposed measure is based mainly on the cross-modified path length feature between the concept nodes, and two new features: (1) the common specificity of two concept nodes, and (2) the local granularity of the clusters. We also applied, for comparison purpose, five existing general English ontology-based similarity measures into the biomedical domain within UMLS. The proposed measure was evaluated relative to human experts' ratings, and compared with the existing techniques using two ontologies (MeSH and SNOMED-CT) in UMLS. The experimental results confirmed the efficiency of the proposed method, and showed that our similarity measure gives the best overall results of correlation with human ratings. We show, further, that using MeSH ontology produces better semantic correlations with human experts' scores than SNOMED-CT in all of the tested measures.},
author = {Al-Mubaid, Hisham and Nguyen, Hoa A.},
booktitle = {Annual International Conference of the IEEE Engineering in Medicine and Biology Society.},
doi = {10.1109/IEMBS.2006.259235},
issn = {1557-170X},
keywords = {Abstracting and Indexing as Topic,Abstracting and Indexing as Topic: methods,Artificial Intelligence,Automated,Automated: methods,Cluster Analysis,Controlled,Natural Language Processing,Pattern Recognition,SML-LIB-BIBLIO,Unified Medical Language System,Vocabulary,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
month = jan,
pages = {2713--7},
pmid = {17946134},
title = {{A cluster-based approach for semantic similarity in the biomedical domain.}},
url = {http://www.ncbi.nlm.nih.gov/pubmed/17946134},
volume = {1},
year = {2006}
}
Powered by bibtexbrowser