Appendix B: Causal Orderings and Adjacency Matrices

This appendix provides mathematical background on graph-theoretic concepts that are fundamental to causal modeling, particularly focusing on causal orderings and their representation through adjacency matrices.

B.1 Basic Graph Theory

Directed Acyclic Graphs (DAGs)

A directed acyclic graph G = (V, E) consists of:

A set of vertices (nodes) V = {X₁, X₂, ..., X_d}
A set of directed edges E ⊆ V × V

The graph is acyclic if it contains no directed cycles.

Causal Orderings

Definition B.1 (Topological ordering) A topological ordering of a DAG G is a linear ordering of its vertices such that for every directed edge (X_i, X_j), vertex X_i comes before X_j in the ordering.

Definition B.2 (Causal ordering) A causal ordering is a topological ordering that reflects the causal relationships: if X_i causally precedes X_j, then X_i appears before X_j in the ordering.

Properties of Causal Orderings

Existence: Every DAG has at least one topological ordering.
Non-uniqueness: A DAG may have multiple valid topological orderings.
Causal interpretation: In causal DAGs, a valid causal ordering ensures that causes precede their effects.

B.2 Adjacency Matrix Representation

Definition

For a directed graph G with vertices {X₁, ..., X_d}, the adjacency matrix A is a d × d binary matrix where:

$$ A_{ij} = \begin{cases} 1 & \text{if there is an edge } X_i \to X_j \ 0 & \text{otherwise} \end{cases} $$

Properties

Acyclicity condition: A directed graph is acyclic if and only if its adjacency matrix can be permuted to upper triangular form.
Causal relationships: A_{ij} = 1 indicates that X_i is a direct cause of X_j.
Path relationships: The (i,j)-th entry of A^k counts the number of directed paths of length k from X_i to X_j.

B.3 Algorithms for Causal Ordering

Kahn's Algorithm

A standard algorithm for finding topological orderings:

Start with all vertices that have no incoming edges
Remove a vertex with no incoming edges and add it to the ordering
Remove all outgoing edges from this vertex
Repeat until all vertices are ordered

Application to Causal Discovery

In causal structure learning:

Constraint-based methods use conditional independence tests to determine edge orientations
Score-based methods search over possible orderings and structures
Hybrid methods combine both approaches

B.4 Markov Equivalence

Definition

Two DAGs are Markov equivalent if they entail the same set of conditional independence relationships.

Completed Partially Directed Acyclic Graph (CPDAG)

A CPDAG represents an equivalence class of Markov equivalent DAGs by:

Using directed edges for orientations that are the same in all equivalent DAGs
Using undirected edges for orientations that differ across equivalent DAGs

Implications for Causal Learning

Identifiability limitations: Some causal relationships cannot be determined from observational data alone
Intervention design: Strategic interventions can help orient undirected edges
Equivalence class representation: CPDAGs provide a compact representation of the set of possible causal structures

B.5 Special Cases and Extensions

Trees and Forests

Tree: Connected acyclic undirected graph
Forest: Collection of disjoint trees
Directed tree: DAG where each node has at most one parent

Trees have unique causal orderings up to the choice of root node.

Partially Directed Graphs

Mixed graphs: Contain both directed and undirected edges
Chain graphs: Sequence of chain components connected by directed edges
Ancestral graphs: Allow bidirected edges representing confounding

Cyclic Graphs

While structural causal models typically assume acyclicity:

Cyclic SCMs: Can represent feedback loops but require special treatment
Equilibrium analysis: Focus on steady-state solutions
Dynamic models: Unfold cycles over time

B.6 Computational Complexity

Counting Problems

Counting topological orderings: #P-complete for general DAGs
Counting Markov equivalent DAGs: Exponential in worst case
Structure learning complexity: Generally NP-hard for exact methods

Approximation Approaches

Greedy algorithms: Polynomial-time approximations for structure learning
MCMC methods: Sample from the space of possible structures
Constraint propagation: Efficiently enforce consistency conditions

B.7 Connections to Linear Algebra

Matrix Properties

For adjacency matrix A of a DAG:

Nilpotency: A^d = 0 for some finite d
Spectral properties: All eigenvalues have zero real part
Determinant: det(I - A) ≠ 0

Structural Equation Perspective

The linear SEM Y = BY + ε corresponds to:

(I - B)Y = ε
Solution: Y = (I - B)⁻¹ε when I - B is invertible
Causal interpretation: B represents direct causal effects

Path Analysis

Wright's path analysis uses the adjacency matrix to:

Compute path coefficients: Products along directed paths
Calculate correlations: Sum over all connecting paths
Decompose effects: Separate direct and indirect causal effects

This mathematical framework provides the foundation for understanding how causal structures can be represented, manipulated, and learned from data.