Graph Motifs¶

Formula¶

\[ Z_i = \frac{N_i^{\mathrm{obs}} - \mu_i^{\mathrm{null}}}{\sigma_i^{\mathrm{null}}} \]

Parameters¶

\(N_i^{\mathrm{obs}}\): observed count of motif \(i\) in the graph
\(\mu_i^{\mathrm{null}}\): mean motif count under a chosen null model
\(\sigma_i^{\mathrm{null}}\): standard deviation under the null model
\(Z_i\): motif over/under-representation score

What it means¶

Graph motifs are small subgraph patterns (for example triangles, feed-forward loops, or wedges) that appear significantly more or less often than expected under a null model.

They are used as building blocks for characterizing local network structure.

What it's used for¶

Comparing structural patterns across networks.
Identifying functionally meaningful local wiring patterns.
Building motif-based signatures and significance profiles.

Key properties¶

Motifs are defined relative to a subgraph size (for example 3-node or 4-node motifs).
Significance depends on the null model, not just raw counts.
Directed and undirected motif sets differ.

Common gotchas¶

Raw motif counts are hard to compare across graphs of different sizes/densities.
Different null models can produce different significance conclusions.
Motif enumeration can become expensive for larger subgraph sizes.

Example¶

In a directed network, a feed-forward loop may occur much more often than in degree-preserving random graphs, giving it a large positive motif z-score.

How to Compute (Pseudocode)¶

Input: graph G, motif family, null-model generator, number of null samples R
Output: motif counts and z-scores Z_i

count motif occurrences in G to get N_obs[i]
for r from 1 to R:
  G_r <- sample a null graph under the chosen null model
  count motifs in G_r to get N_r[i]
compute null means mu_i and standard deviations sigma_i from {N_r[i]}
for each motif i:
  Z_i <- (N_obs[i] - mu_i) / sigma_i

return motif counts and z-scores

Complexity¶

Time: Dominated by motif counting plus repeated null-model sampling/counting; exact cost depends strongly on motif size, graph size/density, and counting algorithm
Space: Depends on graph representation and whether null samples/count tables are stored simultaneously; often \(O(|V|+|E|)\) plus motif-count summaries
Assumptions: \(R\) null samples; directed vs undirected motifs and exact vs sampled motif counting change complexity substantially