Similarity measures in formal concept analysis

Alqadah, Faris; Bhatnagar, Raj

doi:10.1007/s10472-011-9257-7

by Faris Alqadah, Raj Bhatnagar

Abstract:

Formal concept analysis (FCA) has been applied successively in diverse fields such as data mining, conceptual modeling, social networks, software engineering, and the semantic web. One shortcoming of FCA, however, is the large number of concepts that typically arise in dense datasets hindering typical tasks such as rule generation and visualization. To overcome this shortcoming, it is important to develop formalisms and methods to segment, categorize and cluster formal concepts. The first step in achieving these aims is to define suitable similarity and dissimilarity measures of formal concepts. In this paper we propose three similarity measures based on existent set-based measures in addition to developing the completely novel zeros-induced measure. Moreover, we formally prove that all the measures proposed are indeed similarity measures and investigate the computational complexity of computing them. Finally, an extensive empirical evaluation on real-world data is presented in which the utility and character of each similarity measure is tested and evaluated.

View PDF

Reference:

Similarity measures in formal concept analysis (Faris Alqadah, Raj Bhatnagar), In Annals of Mathematics and Artificial Intelligence, volume 61, 2011.

Bibtex Entry:

@article{Alqadah2011,
abstract = {Formal concept analysis (FCA) has been applied successively in diverse fields such as data mining, conceptual modeling, social networks, software engineering, and the semantic web. One shortcoming of FCA, however, is the large number of concepts that typically arise in dense datasets hindering typical tasks such as rule generation and visualization. To overcome this shortcoming, it is important to develop formalisms and methods to segment, categorize and cluster formal concepts. The first step in achieving these aims is to define suitable similarity and dissimilarity measures of formal concepts. In this paper we propose three similarity measures based on existent set-based measures in addition to developing the completely novel zeros-induced measure. Moreover, we formally prove that all the measures proposed are indeed similarity measures and investigate the computational complexity of computing them. Finally, an extensive empirical evaluation on real-world data is presented in which the utility and character of each similarity measure is tested and evaluated.},
author = {Alqadah, Faris and Bhatnagar, Raj},
doi = {10.1007/s10472-011-9257-7},
isbn = {1047201192577},
issn = {1012-2443},
journal = {Annals of Mathematics and Artificial Intelligence},
keywords = {2010,62h30,68t10,SML-LIB-BIBLIO,cluster similarity,formal concept analysis,lang:ENG,mathematics subject classifications},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
month = aug,
number = {3},
pages = {245----256},
title = {{Similarity measures in formal concept analysis}},
url = {http://www.springerlink.com/index/10.1007/s10472-011-9257-7},
volume = {61},
year = {2011}
}