The large-scale structure of semantic networks: statistical analyses and a model of semantic growth.

return to the website
by Mark Steyvers, Joshua B Tenenbaum
Abstract:
We present statistical analyses of the large-scale structure of 3 types of semantic networks: word associations, WordNet, and Roget's Thesaurus. We show that they have a small-world structure, characterized by sparse connectivity, short average path lengths between words, and strong local clustering. In addition, the distributions of the number of connections follow power laws that indicate a scale-free pattern of connectivity, with most nodes having relatively few connections joined together through a small number of hubs with many connections. These regularities have also been found in certain other complex natural networks, such as the World Wide Web, but they are not consistent with many conventional models of semantic organization, based on inheritance hierarchies, arbitrarily structured networks, or high-dimensional vector spaces. We propose that these structures reflect the mechanisms by which semantic networks grow. We describe a simple model for semantic growth, in which each new word or concept is connected to an existing network by differentiating the connectivity pattern of an existing node. This model generates appropriate small-world statistics and power-law connectivity distributions, and it also suggests one possible mechanistic basis for the effects of learning history variables (age of acquisition, usage frequency) on behavioral performance in semantic processing tasks.
Reference:
The large-scale structure of semantic networks: statistical analyses and a model of semantic growth. (Mark Steyvers, Joshua B Tenenbaum), In Cognitive science, volume 29, 2005.
Bibtex Entry:
@article{Steyvers2005a,
abstract = {We present statistical analyses of the large-scale structure of 3 types of semantic networks: word associations, WordNet, and Roget's Thesaurus. We show that they have a small-world structure, characterized by sparse connectivity, short average path lengths between words, and strong local clustering. In addition, the distributions of the number of connections follow power laws that indicate a scale-free pattern of connectivity, with most nodes having relatively few connections joined together through a small number of hubs with many connections. These regularities have also been found in certain other complex natural networks, such as the World Wide Web, but they are not consistent with many conventional models of semantic organization, based on inheritance hierarchies, arbitrarily structured networks, or high-dimensional vector spaces. We propose that these structures reflect the mechanisms by which semantic networks grow. We describe a simple model for semantic growth, in which each new word or concept is connected to an existing network by differentiating the connectivity pattern of an existing node. This model generates appropriate small-world statistics and power-law connectivity distributions, and it also suggests one possible mechanistic basis for the effects of learning history variables (age of acquisition, usage frequency) on behavioral performance in semantic processing tasks.},
author = {Steyvers, Mark and Tenenbaum, Joshua B},
doi = {10.1207/s15516709cog2901\_3},
issn = {0364-0213},
journal = {Cognitive science},
keywords = {SML-LIB-BIBLIO,growing network models,lang:ENG,semantic networks,semantic representation,small worlds},
mendeley-tags = {SML-LIB-BIBLIO,lang:ENG},
month = jan,
number = {1},
pages = {41--78},
pmid = {21702767},
title = {{The large-scale structure of semantic networks: statistical analyses and a model of semantic growth.}},
url = {http://www.ncbi.nlm.nih.gov/pubmed/21702767},
volume = {29},
year = {2005}
}
Powered by bibtexbrowser