Watts and Strogatz model

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

Lua error in package.lua at line 80: module 'strict' not found.

File:Watts-Strogatz small-world model 100nodes.png
Watts-Strogatz small-world model generated by igraph and visualized by Cytoscape 2.5. 100 nodes.

The Watts–Strogatz model is a random graph generation model that produces graphs with small-world properties, including short average path lengths and high clustering. It was proposed by Duncan J. Watts and Steven Strogatz in their joint 1998 Nature paper.[1] The model also became known as the (Watts) beta model after Watts used \beta to formulate it in his popular science book Six Degrees.

Rationale for the model

The formal study of random graphs dates back to the work of Paul Erdős and Alfréd Rényi.[2] The graphs they considered, now known as the classical or Erdős–Rényi (ER) graphs, offer a simple and powerful model with many applications.

However the ER graphs do not have two important properties observed in many real-world networks:

  1. They do not generate local clustering and triadic closures. Instead because they have a constant, random, and independent probability of two nodes being connected, ER graphs have a low clustering coefficient.
  2. They do not account for the formation of hubs. Formally, the degree distribution of ER graphs converges to a Poisson distribution, rather than a power law observed in many real-world, scale-free networks.[3]

The Watts and Strogatz model was designed as the simplest possible model that addresses the first of the two limitations. It accounts for clustering while retaining the short average path lengths of the ER model. It does so by interpolating between an ER graph and a regular ring lattice. Consequently, the model is able to at least partially explain the "small-world" phenomena in a variety of networks, such as the power grid, neural network of C. elegans, and a network of movie actors. In 2015, researchers from the California Institute of Technology and Princeton University found that the Watts and Strogatz model explains fat-metabolism communication models in budding yeast.[4]

Algorithm

File:Watts strogatz.svg
Watts–Strogatz graph

Given the desired number of nodes N, the mean degree K (assumed to be an even integer), and a special parameter \beta, satisfying 0 \le \beta \le 1 and N\gg K \gg \ln(N)\gg 1, the model constructs an undirected graph with N nodes and \frac{NK}{2} edges in the following way:

  1. Construct a regular ring lattice, a graph with N nodes each connected to K neighbors, K/2 on each side. That is, if the nodes are labeled n_0 \ldots n_{N-1}, there is an edge (n_i, n_j) if and only if  0 < |i - j|\mod {(}N-1-{\frac{K}{2}}{)} \leq \frac{K}{2}.
  2. For every node n_i=n_0,\dots, n_{N-1} take every edge (n_i, n_j) with i < j, and rewire it with probability \beta. Rewiring is done by replacing (n_i, n_j) with (n_i, n_k) where k is chosen with uniform probability from all possible values that avoid self-loops (k \ne i) and link duplication (there is no edge (n_i, n_{k'}) with k' = k at this point in the algorithm).

Properties

The underlying lattice structure of the model produces a locally clustered network, and the random links dramatically reduce the average path lengths. The algorithm introduces about \beta\frac{NK}{2} non-lattice edges. Varying \beta makes it possible to interpolate between a regular lattice (\beta=0) and a random graph (\beta=1) approaching the Erdős–Rényi random graph G(n, p) with n=N and p = \frac{NK}{2{N \choose 2}}.

The three properties of interest are the average path length, the clustering coefficient, and the degree distribution.

Average path length

For a ring lattice the average path length is l(0)=N/2K\gg 1 and scales linearly with the system size. In the limiting case of \beta \rightarrow 1 the graph converges to a classical random graph with l(1)=\frac{\ln{N}}{\ln{K}}. However, in the intermediate region 0<\beta<1 the average path length falls very rapidly with increasing \beta, quickly approaching its limiting value.

Clustering coefficient

For the ring lattice the clustering coefficient[5] C(0)=\frac{3(K-2)}{4(K-1)}, and so tends to 3/4 as K grows, independently of the system size.[6] In the limiting case of \beta \rightarrow 1 the clustering coefficient attains the value for classical random graphs, C(1)=K/N and is thus inversely proportional to the system size. In the intermediate region the clustering coefficient remains quite close to its value for the regular lattice, and only falls at relatively high \beta. This results in a region where the average path length falls rapidly, but the clustering coefficient does not, explaining the "small-world" phenomenon.

If we use the Barrat and Weigt[6] measure for clustering C'(\beta) defined as the fraction between the average number of edges between the neighbors of a node and the average number of possible edges between these neighbors, or, alternatively,
C'(\beta)\equiv\frac{3\times \mbox{number of triangles}}{\mbox{number of connected triples}}
then we get  C'(\beta)\sim C(0)\left(1-\beta\right)^3.

Degree distribution

The degree distribution in the case of the ring lattice is just a Dirac delta function centered at K. In the limiting case of \beta \rightarrow 1 it is Poisson distribution, as with classical graphs. The degree distribution for 0<\beta<1 can be written as,[6]

P(k) = \sum_{n=0}^{f\left(k,K\right)} C^n_{K/2} \left(1-\beta\right)^{n} \beta^{K/2-n} \frac{(\beta K/2)^{k-K/2-n}}{\left(k-K/2-n\right)!} e^{-\beta K/2}

where k_i is the number of edges that the i^{th} node has or its degree. Here k\geq K/2, and f(k,K)=\min(k-K/2,K/2). The shape of the degree distribution is similar to that of a random graph and has a pronounced peak at k=K and decays exponentially for large |k-K|. The topology of the network is relatively homogeneous, and all nodes have more or less the same degree.

Limitations

The major limitation of the model is that it produces an unrealistic degree distribution. In contrast, real networks are often scale-free networks inhomogeneous in degree, having hubs and a scale-free degree distribution. Such networks are better described in that respect by the preferential attachment family of models, such as the Barabási–Albert (BA) model. (On the other hand, the Barabási–Albert model fails to produce the high levels of clustering seen in real networks, a shortcoming not shared by the Watts and Strogatz model. Thus, neither the Watts and Strogatz model nor the Barabási–Albert model should be viewed as fully realistic.)

The Watts and Strogatz model also implies a fixed number of nodes and thus cannot be used to model network growth.

See also

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. Lua error in package.lua at line 80: module 'strict' not found.
  4. http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004264
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. 6.0 6.1 6.2 Lua error in package.lua at line 80: module 'strict' not found.