A New Path Length Measure Based on GO for Gene Similarity with Evaluation using SGD Pathways

return to the website
by Anurag Nagar, Hisham Al-Mubaid
Abstract:
We propose a new method for measuring the semantic similarity of genes based on path length between their annotation terms in the gene ontology. Our method applies an exponential transfer function to the average path length between two genes to compute their similarity. The non-linear measure ensures that the semantic similarity decreases with distance and proves to be quite competitive when compared to other measures. The advantage of the proposed measure is its simplicity and ease of implementation which gives it a great appeal in this domain. The measure uses only one feature (path length) for computing the similarity between genes. For validation purposes, we computed the similarity of genes from the Saccharomyces genome database (SGD) taking part in various cellular pathways. We analyzed 152 pathways from SGD and compared our similarity results with two of the leading measures. The proposed measure proved to be very competitive in all cases and the clustering results showed that our method is able to surpass the leading methods in certain cases.
Reference:
A New Path Length Measure Based on GO for Gene Similarity with Evaluation using SGD Pathways (Anurag Nagar, Hisham Al-Mubaid), In 2008 21st IEEE International Symposium on Computer-Based Medical Systems, IEEE, 2008.
Bibtex Entry:
@inproceedings{Nagar2008,
abstract = {We propose a new method for measuring the semantic similarity of genes based on path length between their annotation terms in the gene ontology. Our method applies an exponential transfer function to the average path length between two genes to compute their similarity. The non-linear measure ensures that the semantic similarity decreases with distance and proves to be quite competitive when compared to other measures. The advantage of the proposed measure is its simplicity and ease of implementation which gives it a great appeal in this domain. The measure uses only one feature (path length) for computing the similarity between genes. For validation purposes, we computed the similarity of genes from the Saccharomyces genome database (SGD) taking part in various cellular pathways. We analyzed 152 pathways from SGD and compared our similarity results with two of the leading measures. The proposed measure proved to be very competitive in all cases and the clustering results showed that our method is able to surpass the leading methods in certain cases.},
author = {Nagar, Anurag and Al-Mubaid, Hisham},
booktitle = {2008 21st IEEE International Symposium on Computer-Based Medical Systems},
doi = {10.1109/CBMS.2008.27},
isbn = {978-0-7695-3165-6},
keywords = {SML-LIB-BIBLIO,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
month = jun,
pages = {590--595},
publisher = {IEEE},
title = {{A New Path Length Measure Based on GO for Gene Similarity with Evaluation using SGD Pathways}},
year = {2008}
}
Powered by bibtexbrowser