Indexing with WordNet synsets can improve Text Retrieval

return to the website
by Julio Gonzalo, Felisa Verdejo, Irina Chugur, Juan Cigarrin
Abstract:
The classical, vector space model for text retrieval is shown to give better results (up to 29\% better in our experiments) if WordNet synsets are chosen as the indexing space, instead of word forms. This result is obtained for a manually disambiguated test collection (of queries and documents) derived from the Semcor semantic concordance. The sensitivity of retrieval performance to (automatic) disambiguation errors when indexing documents is also measured. Finally, it is observed that if queries are not disambiguated, indexing by synsets performs (at best) only as good as standard word indexing.
Reference:
Indexing with WordNet synsets can improve Text Retrieval (Julio Gonzalo, Felisa Verdejo, Irina Chugur, Juan Cigarrin), In COLING/ACL'98 Workshop on Usage of WordNet for NLP, 1998.
Bibtex Entry:
@article{gonzalo_indexing_1998,
abstract = {The classical, vector space model for text retrieval is shown to give better results (up to 29\% better in our experiments) if WordNet synsets are chosen as the indexing space, instead of word forms. This result is obtained for a manually disambiguated test collection (of queries and documents) derived from the Semcor semantic concordance. The sensitivity of retrieval performance to (automatic) disambiguation errors when indexing documents is also measured. Finally, it is observed that if queries are not disambiguated, indexing by synsets performs (at best) only as good as standard word indexing.},
address = {Montreal},
author = {Gonzalo, Julio and Verdejo, Felisa and Chugur, Irina and Cigarrin, Juan},
journal = {COLING/ACL'98 Workshop on Usage of WordNet for NLP},
keywords = {SML-LIB-BIBLIO,lang:ENG},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
pages = {38--44},
title = {{Indexing with WordNet synsets can improve Text Retrieval}},
year = {1998}
}
Powered by bibtexbrowser