A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method.

return to the website
by Illhoi Yoo, Xiaohua Hu, Il-Yeol Song
Abstract:
BACKGROUND: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. RESULTS: Our extensive experimental results show the approach shows 45\% cluster quality improvement and 72\% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. CONCLUSION: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries.
Reference:
A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method. (Illhoi Yoo, Xiaohua Hu, Il-Yeol Song), In BMC Bioinformatics, volume 8 Suppl 9, 2007.
Bibtex Entry:
@article{Yoo2007,
abstract = {BACKGROUND: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. RESULTS: Our extensive experimental results show the approach shows 45\% cluster quality improvement and 72\% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. CONCLUSION: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries.},
author = {Yoo, Illhoi and Hu, Xiaohua and Song, Il-Yeol},
doi = {10.1186/1471-2105-8-S9-S4},
issn = {1471-2105},
journal = {BMC Bioinformatics},
keywords = {Algorithms,Artificial Intelligence,Automated,Automated: methods,Cluster Analysis,Database Management Systems,Information Storage and Retrieval,Information Storage and Retrieval: methods,MEDLINE,Natural Language Processing,Pattern Recognition,Periodicals as Topic,SML-LIB-BIBLIO,Semantics,User-Computer Interface,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
month = jan,
pages = {S4},
pmid = {18047705},
title = {{A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method.}},
url = {http://www.ncbi.nlm.nih.gov/pubmed/18047705},
volume = {8 Suppl 9},
year = {2007}
}
Powered by bibtexbrowser