Watts–Strogatz model

The Watts–Strogatz model is a random graph generation model that produces graphs with small-world properties, including short average path lengths and high clustering. It was proposed by Duncan J. Watts and Steven Strogatz in their article published in 1998 in the Nature scientific journal.[1] The model also became known as the (Watts) beta model after Watts used to formulate it in his popular science book Six Degrees.

Watts–Strogatz small-world model
Watts–Strogatz small-world model generated by igraph and visualized by Cytoscape 2.5. 100 nodes.

Rationale for the model

edit

The formal study of random graphs dates back to the work of Paul Erdős and Alfréd Rényi.[2] The graphs they considered, now known as the classical or Erdős–Rényi (ER) graphs, offer a simple and powerful model with many applications.

However the ER graphs do not have two important properties observed in many real-world networks:

  1. They do not generate local clustering and triadic closures. Instead, because they have a constant, random, and independent probability of two nodes being connected, ER graphs have a low clustering coefficient.
  2. They do not account for the formation of hubs. Formally, the degree distribution of ER graphs converges to a Poisson distribution, rather than a power law observed in many real-world, scale-free networks.[3]

The Watts and Strogatz model was designed as the simplest possible model that addresses the first of the two limitations. It accounts for clustering while retaining the short average path lengths of the ER model. It does so by interpolating between a randomized structure close to ER graphs and a regular ring lattice. Consequently, the model is able to at least partially explain the "small-world" phenomena in a variety of networks, such as the power grid, neural network of C. elegans, networks of movie actors, or fat-metabolism communication in budding yeast.[4]

Algorithm

edit
 
Watts–Strogatz graph

Given the desired number of nodes  , the mean degree   (assumed to be an even integer), and a parameter  , all satisfying   and  , the model constructs an undirected graph with   nodes and   edges in the following way:

  1. Construct a regular ring lattice, a graph with   nodes each connected to   neighbors,   on each side. That is, if the nodes are labeled  , there is an edge   if and only if  
  2. For every node   take every edge connecting   to its   rightmost neighbors, that is every edge   such that  , and rewire it with probability  . Rewiring is done by replacing   with   where   is chosen uniformly at random from all possible nodes while avoiding self-loops ( ) and link duplication (there is no edge   with   at this point in the algorithm).

Properties

edit

The underlying lattice structure of the model produces a locally clustered network, while the randomly rewired links dramatically reduce the average path lengths. The algorithm introduces about   of such non-lattice edges. Varying   makes it possible to interpolate between a regular lattice ( ) and a structure close to an Erdős–Rényi random graph   with   at  . It does not approach the actual ER model since every node will be connected to at least   other nodes.

The three properties of interest are the average path length, the clustering coefficient, and the degree distribution.

Average path length

edit

For a ring lattice, the average path length[1] is   and scales linearly with the system size. In the limiting case of  , the graph approaches a random graph with  , while not actually converging to it. In the intermediate region  , the average path length falls very rapidly with increasing  , quickly approaching its limiting value.

Clustering coefficient

edit

For the ring lattice the clustering coefficient[5]  , and so tends to   as   grows, independently of the system size.[6] In the limiting case of   the clustering coefficient is of the same order as the clustering coefficient for classical random graphs,   and is thus inversely proportional to the system size. In the intermediate region the clustering coefficient remains quite close to its value for the regular lattice, and only falls at relatively high  . This results in a region where the average path length falls rapidly, but the clustering coefficient does not, explaining the "small-world" phenomenon.

If we use the Barrat and Weigt[6] measure for clustering   defined as the fraction between the average number of edges between the neighbors of a node and the average number of possible edges between these neighbors, or, alternatively,
 
then we get  

Degree distribution

edit

The degree distribution in the case of the ring lattice is just a Dirac delta function centered at  . The degree distribution for a large number of nodes and   can be written as,[6]

 

where   is the number of edges that the   node has or its degree. Here  , and  . The shape of the degree distribution is similar to that of a random graph and has a pronounced peak at   and decays exponentially for large  . The topology of the network is relatively homogeneous, meaning that all nodes are of similar degree.

Limitations

edit

The major limitation of the model is that it produces an unrealistic degree distribution. In contrast, real networks are often scale-free networks inhomogeneous in degree, having hubs and a scale-free degree distribution. Such networks are better described in that respect by the preferential attachment family of models, such as the Barabási–Albert (BA) model. (On the other hand, the Barabási–Albert model fails to produce the high levels of clustering seen in real networks, a shortcoming not shared by the Watts and Strogatz model. Thus, neither the Watts and Strogatz model nor the Barabási–Albert model should be viewed as fully realistic.)

The Watts and Strogatz model also implies a fixed number of nodes and thus cannot be used to model network growth.

See also

edit

References

edit
  1. ^ a b Watts, D. J.; Strogatz, S. H. (1998). "Collective dynamics of 'small-world' networks" (PDF). Nature. 393 (6684): 440–442. Bibcode:1998Natur.393..440W. doi:10.1038/30918. PMID 9623998. S2CID 4429113. Archived (PDF) from the original on 2020-10-26. Retrieved 2018-05-18.
  2. ^ Erdos, P. (1960). "Publications Mathematicae 6, 290 (1959); P. Erdos, A. Renyi". Publ. Math. Inst. Hung. Acad. Sci. 5: 17.
  3. ^ Ravasz, E. (30 August 2002). "Hierarchical Organization of Modularity in Metabolic Networks". Science. 297 (5586): 1551–1555. arXiv:cond-mat/0209244. Bibcode:2002Sci...297.1551R. doi:10.1126/science.1073374. PMID 12202830. S2CID 14452443.
  4. ^ Al-Anzi, Bader; Arpp, Patrick; Gerges, Sherif; Ormerod, Christopher; Olsman, Noah; Zinn, Kai (2015). "Experimental and Computational Analysis of a Large Protein Network That Controls Fat Storage Reveals the Design Principles of a Signaling Network". PLOS Computational Biology. 11 (5): e1004264. Bibcode:2015PLSCB..11E4264A. doi:10.1371/journal.pcbi.1004264. PMC 4447291. PMID 26020510.
  5. ^ Albert, R., Barabási, A.-L. (2002). "Statistical mechanics of complex networks". Reviews of Modern Physics. 74 (1): 47–97. arXiv:cond-mat/0106096. Bibcode:2002RvMP...74...47A. doi:10.1103/RevModPhys.74.47. S2CID 60545.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  6. ^ a b c Barrat, A.; Weigt, M. (2000). "On the properties of small-world network models". European Physical Journal B. 13 (3): 547–560. arXiv:cond-mat/9903411. doi:10.1007/s100510050067. S2CID 13483229.
  NODES
Note 1