Semantic similarity in biomedical ontologies.

return to the website
by Catia Pesquita, Daniel Faria, André O. Falcão, Phillip Lord, Francisco M. Couto
Abstract:
In recent years, ontologies have become a mainstream topic in biomedical research. When biological entities are described using a common schema, such as an ontology, they can be compared by means of their annotations. This type of comparison is called semantic similarity, since it assesses the degree of relatedness between two entities by the similarity in meaning of their annotations. The application of semantic similarity to biomedical ontologies is recent; nevertheless, several studies have been published in the last few years describing and evaluating diverse approaches. Semantic similarity has become a valuable tool for validating the results drawn from biomedical studies such as gene clustering, gene expression data analysis, prediction and validation of molecular interactions, and disease gene prioritization. We review semantic similarity measures applied to biomedical ontologies and propose their classification according to the strategies they employ: node-based versus edge-based and pairwise versus groupwise. We also present comparative assessment studies and discuss the implications of their results. We survey the existing implementations of semantic similarity measures, and we describe examples of applications to biomedical research. This will clarify how biomedical researchers can benefit from semantic similarity measures and help them choose the approach most suitable for their studies.Biomedical ontologies are evolving toward increased coverage, formality, and integration, and their use for annotation is increasingly becoming a focus of both effort by biomedical experts and application of automated annotation procedures to create corpora of higher quality and completeness than are currently available. Given that semantic similarity measures are directly dependent on these evolutions, we can expect to see them gaining more relevance and even becoming as essential as sequence similarity is today in biomedical research.
Reference:
Semantic similarity in biomedical ontologies. (Catia Pesquita, Daniel Faria, André O. Falcão, Phillip Lord, Francisco M. Couto), In PLoS Computational Biology, volume 5, 2009.
Bibtex Entry:
@article{Pesquita2009,
abstract = {In recent years, ontologies have become a mainstream topic in biomedical research. When biological entities are described using a common schema, such as an ontology, they can be compared by means of their annotations. This type of comparison is called semantic similarity, since it assesses the degree of relatedness between two entities by the similarity in meaning of their annotations. The application of semantic similarity to biomedical ontologies is recent; nevertheless, several studies have been published in the last few years describing and evaluating diverse approaches. Semantic similarity has become a valuable tool for validating the results drawn from biomedical studies such as gene clustering, gene expression data analysis, prediction and validation of molecular interactions, and disease gene prioritization. We review semantic similarity measures applied to biomedical ontologies and propose their classification according to the strategies they employ: node-based versus edge-based and pairwise versus groupwise. We also present comparative assessment studies and discuss the implications of their results. We survey the existing implementations of semantic similarity measures, and we describe examples of applications to biomedical research. This will clarify how biomedical researchers can benefit from semantic similarity measures and help them choose the approach most suitable for their studies.Biomedical ontologies are evolving toward increased coverage, formality, and integration, and their use for annotation is increasingly becoming a focus of both effort by biomedical experts and application of automated annotation procedures to create corpora of higher quality and completeness than are currently available. Given that semantic similarity measures are directly dependent on these evolutions, we can expect to see them gaining more relevance and even becoming as essential as sequence similarity is today in biomedical research.},
author = {Pesquita, Catia and Faria, Daniel and Falc\~{a}o, Andr\'{e} O. and Lord, Phillip and Couto, Francisco M.},
doi = {10.1371/journal.pcbi.1000443},
issn = {1553-7358},
journal = {PLoS Computational Biology},
keywords = {Algorithms,Biomedical Research,Biomedical Research: methods,Classification,Classification: methods,Computational Biology,Computational Biology: methods,GO sim,Natural Language Processing,SML-LIB-BIBLIO,Semantic Similarity,Semantics,Software,Terminology as Topic,lang:ENG,semantic similarity},
mendeley-tags = {GO sim,SML-LIB-BIBLIO,lang:ENG,semantic similarity},
month = jul,
number = {7},
pages = {12},
pmid = {19649320},
title = {{Semantic similarity in biomedical ontologies.}},
url = {http://www.ncbi.nlm.nih.gov/pubmed/19649320},
volume = {5},
year = {2009}
}
Powered by bibtexbrowser