Deriving concept hierarchies from text

return to the website
by Mark Sanderson, Bruce Croft
Abstract:
This paper presents a means of automatically deriving a hierarchical organization of concepts from a set of documents without use of training data or standard clustering techniques. Instead, salient words and phrases extracted from the documents are organized hierarchically using a type of co-occurrence known as subsumption. The resulting structure is displayed as a series of hierarchical menus. When generated from a set of retrieved documents, a user browsing the menus is provided with a detailed overview of their content in a manner distinct from existing overview and summarization techniques. The methods used to build the structure are simple, but appear to be effective: a smallscale user study reveals that the generated hierarchy possesses properties expected of such a structure in that general terms are placed at the top levels leading to related and more specific terms below. The formation and presentation of the hierarchy is described along with the user study and some other inf...
Reference:
Deriving concept hierarchies from text (Mark Sanderson, Bruce Croft), In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR'99, ACM Press, 1999.
Bibtex Entry:
@inproceedings{Sanderson1999,
abstract = {This paper presents a means of automatically deriving a hierarchical organization of concepts from a set of documents without use of training data or standard clustering techniques. Instead, salient words and phrases extracted from the documents are organized hierarchically using a type of co-occurrence known as subsumption. The resulting structure is displayed as a series of hierarchical menus. When generated from a set of retrieved documents, a user browsing the menus is provided with a detailed overview of their content in a manner distinct from existing overview and summarization techniques. The methods used to build the structure are simple, but appear to be effective: a smallscale user study reveals that the generated hierarchy possesses properties expected of such a structure in that general terms are placed at the top levels leading to related and more specific terms below. The formation and presentation of the hierarchy is described along with the user study and some other inf...},
address = {New York, USA},
author = {Sanderson, Mark and Croft, Bruce},
booktitle = {Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR'99},
doi = {10.1145/312624.312679},
isbn = {1581130961},
keywords = {Information Content,SML-LIB-BIBLIO,Semantic Similarity,concept hierarchy,information content,lang:ENG,multi-,subsumption,term co-occurrence},
mendeley-tags = {SML-LIB-BIBLIO,Semantic Similarity,information content,lang:ENG},
pages = {206--213},
publisher = {ACM Press},
title = {{Deriving concept hierarchies from text}},
url = {http://portal.acm.org/citation.cfm?doid=312624.312679},
year = {1999}
}
Powered by bibtexbrowser