Correlation between gene expression and GO semantic similarity.

return to the website
by José L Sevilla, Víctor Segura, Adam Podhorski, Elizabeth Guruceaga, José M Mato, Luis A Martínez-Cruz, Fernando J Corrales, Angel Rubio
Abstract:
This research analyzes some aspects of the relationship between gene expression, gene function, and gene annotation. Many recent studies are implicitly based on the assumption that gene products that are biologically and functionally related would maintain this similarity both in their expression profiles as well as in their Gene Ontology (GO) annotation. We analyze how accurate this assumption proves to be using real publicly available data. We also aim to validate a measure of semantic similarity for GO annotation. We use the Pearson correlation coefficient and its absolute value as a measure of similarity between expression profiles of gene products. We explore a number of semantic similarity measures (Resnik, Jiang, and Lin) and compute the similarity between gene products annotated using the GO. Finally, we compute correlation coefficients to compare gene expression similarity against GO semantic similarity. Our results suggest that the Resnik similarity measure outperforms the others and seems better suited for use in Gene Ontology. We also deduce that there seems to be correlation between semantic similarity in the GO annotation and gene expression for the three GO ontologies. We show that this correlation is negligible up to a certain semantic similarity value; then, for higher similarity values, the relationship trend becomes almost linear. These results can be used to augment the knowledge provided by clustering algorithms and in the development of bioinformatic tools for finding and characterizing gene products.
Reference:
Correlation between gene expression and GO semantic similarity. (José L Sevilla, Víctor Segura, Adam Podhorski, Elizabeth Guruceaga, José M Mato, Luis A Martínez-Cruz, Fernando J Corrales, Angel Rubio), In IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM, volume 2, 2005.
Bibtex Entry:
@article{Sevilla,
abstract = {This research analyzes some aspects of the relationship between gene expression, gene function, and gene annotation. Many recent studies are implicitly based on the assumption that gene products that are biologically and functionally related would maintain this similarity both in their expression profiles as well as in their Gene Ontology (GO) annotation. We analyze how accurate this assumption proves to be using real publicly available data. We also aim to validate a measure of semantic similarity for GO annotation. We use the Pearson correlation coefficient and its absolute value as a measure of similarity between expression profiles of gene products. We explore a number of semantic similarity measures (Resnik, Jiang, and Lin) and compute the similarity between gene products annotated using the GO. Finally, we compute correlation coefficients to compare gene expression similarity against GO semantic similarity. Our results suggest that the Resnik similarity measure outperforms the others and seems better suited for use in Gene Ontology. We also deduce that there seems to be correlation between semantic similarity in the GO annotation and gene expression for the three GO ontologies. We show that this correlation is negligible up to a certain semantic similarity value; then, for higher similarity values, the relationship trend becomes almost linear. These results can be used to augment the knowledge provided by clustering algorithms and in the development of bioinformatic tools for finding and characterizing gene products.},
author = {Sevilla, Jos\'{e} L and Segura, V\'{\i}ctor and Podhorski, Adam and Guruceaga, Elizabeth and Mato, Jos\'{e} M and Mart\'{\i}nez-Cruz, Luis A and Corrales, Fernando J and Rubio, Angel},
doi = {10.1109/TCBB.2005.50},
issn = {1545-5963},
journal = {IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM},
keywords = {Algorithms,Animals,Artificial Intelligence,Computational Biology,Computational Biology: methods,Controlled,Databases,Gene Expression,Genetic,Humans,Mice,SML-LIB-BIBLIO,Semantics,Statistics as Topic,Vocabulary,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
number = {4},
pages = {330--8},
pmid = {17044170},
title = {{Correlation between gene expression and GO semantic similarity.}},
url = {http://www.ncbi.nlm.nih.gov/pubmed/17044170},
volume = {2},
year = {2005}
}
Powered by bibtexbrowser