While standard Vector Autoregression (VAR) models excel at capturing temporal dependencies and forecasting multiple time series simultaneously, and Granger causality tests assess predictive influence, they often fall short of identifying structural causal relationships. As discussed in the previous section, the reduced-form errors (ut) in a VAR model, Yt=c+∑i=1pAiYt−i+ut, are typically contemporaneously correlated (E[utut′]=Σu, where Σu is non-diagonal). This correlation obscures the immediate, structural impact one variable has on another within the same time period. Attributing an observed correlation in errors to a specific causal link requires imposing additional structure.
Structural Vector Autoregression (SVAR) extends the VAR framework precisely to address this challenge. It allows us to move beyond mere forecasting or predictive causality to estimate the contemporaneous causal effects among variables, by leveraging theoretical assumptions or domain knowledge about the underlying data generating process.
The core idea of SVAR is to posit a structural model related to the reduced-form VAR. Consider a system with k endogenous variables represented by the vector Yt. The SVAR model can be written as:
B0Yt=k+i=1∑pBiYt−i+ϵtHere:
We can relate the SVAR to the standard VAR by multiplying by B0−1 (assuming B0 is invertible):
Yt=B0−1k+i=1∑p(B0−1Bi)Yt−i+B0−1ϵtComparing this to the reduced-form VAR, Yt=c+∑i=1pAiYt−i+ut, we see the mapping:
The covariance matrix of the reduced-form residuals Σu is related to the structural shocks ϵt and the contemporaneous effects matrix B0:
Σu=E[utut′]=E[(B0−1ϵt)(ϵt′(B0−1)′)]=B0−1E[ϵtϵt′](B0−1)′=B0−1Σϵ(B0−1)′If we assume Σϵ=I (uncorrelated structural shocks with unit variance), then:
Σu=B0−1(B0−1)′=(B0′B0)−1or equivalently,
B0′B0=Σu−1The standard VAR estimation yields estimates for Ai and Σu. The challenge in SVAR is to recover the structural parameters, particularly the contemporaneous effects matrix B0 and the properties of the structural shocks ϵt (often just their variances if assumed diagonal), from the estimated Σu.
The equation B0′B0=Σu−1 (assuming Σϵ=I) involves k2 unknown elements in B0. However, since Σu−1 is symmetric, this equation only provides k(k+1)/2 unique restrictions. To uniquely determine the k2 elements of B0, we need an additional k2−k(k+1)/2=k(k−1)/2 restrictions. These restrictions must come from outside the model, typically based on economic theory, domain knowledge, or specific assumptions about the causal structure. This is known as the identification problem in SVAR.
Several strategies exist to impose the necessary restrictions to identify B0:
Short-Run Restrictions (Recursive Ordering): This is perhaps the most common approach, pioneered by Christopher Sims. It assumes a recursive structure for the contemporaneous effects. Specifically, it imposes zero restrictions on certain elements of the B0 matrix, making it lower (or upper) triangular. For a lower triangular B0, the assumption is that the first variable in the system (Y1t) is only affected contemporaneously by its own shock (ϵ1t), the second variable (Y2t) is affected by ϵ1t and ϵ2t, and so on. The last variable (Ykt) can be affected by all structural shocks contemporaneously.
A recursive (lower triangular B0) structure for k=3. Variable Y1 only responds contemporaneously to shock ϵ1. Y2 responds to ϵ1,ϵ2 and Y1. Y3 responds to all shocks and variables. Dashed lines indicate potential non-zero off-diagonal elements in B0.
Long-Run Restrictions (Blanchard-Quah): Instead of restricting immediate effects (B0), this approach imposes restrictions on the long-run effects of certain shocks. The long-run impact matrix is given by ∑i=0∞Ci, where Ci are the moving average (MA) coefficient matrices representing the impulse responses. Restrictions are placed such that certain shocks (e.g., supply shocks) have a permanent effect on some variables (e.g., output), while others (e.g., demand shocks) only have temporary effects. This requires assumptions about the long-run behavior of the system.
Sign Restrictions: This method uses qualitative theoretical knowledge. Instead of setting specific coefficients in B0 or the long-run matrix to zero, it restricts the sign of the impulse responses of certain variables to specific shocks over a given horizon (e.g., a positive monetary policy shock should decrease inflation and output for some periods). This approach doesn't yield a unique B0 but rather a set of possible models consistent with the sign restrictions. Inference is often done by analyzing the distribution of results across this set (e.g., using median responses and confidence bands). Computationally more demanding as it involves searching or sampling over possible rotation matrices.
Proxy SVAR / External Instruments: This modern approach leverages external variables (proxies) that are correlated with a specific structural shock of interest but are uncorrelated with other structural shocks and do not directly affect the endogenous variables except through that specific shock. This is analogous to using instrumental variables (as discussed in Chapter 4) in a time-series context. Finding valid and strong proxies is the main challenge, but when available, this method can be very powerful for identifying specific shocks (e.g., using high-frequency financial data to proxy for monetary policy shocks).
Once the SVAR model is identified (i.e., B0 is estimated), the primary tools for causal interpretation are:
Structural Impulse Response Functions (IRFs): These trace out the dynamic effect of a one-unit (or typically, one-standard-deviation) structural shock (ϵj,t) on the future path of all endogenous variables (Yk,t+h for h=0,1,2,…). Unlike reduced-form IRFs, which show responses to correlated residuals ut, structural IRFs isolate the causal impact of distinct, uncorrelated underlying shocks. This allows for more meaningful causal narratives (e.g., "What is the effect of an unexpected 1% increase in the policy interest rate on inflation and unemployment over the next 24 months?").
Example impulse responses showing the dynamic effect of a one standard deviation structural shock originating in variable Y2 on both Y1 and Y2 over 10 periods.
Forecast Error Variance Decomposition (FEVD): FEVD decomposes the variance of the forecast error for each variable (Yk) at different horizons (h) into proportions attributable to each of the structural shocks (ϵj). It helps quantify the relative importance of different sources of variation or causal influences driving the fluctuations in each variable over time. For example, FEVD might show that 60% of the 12-month forecast error variance in inflation is due to supply shocks, 30% to demand shocks, and 10% to monetary policy shocks.
While powerful, SVAR analysis rests on several assumptions and faces limitations:
SVAR provides a framework for imposing causal structure onto multivariate time series models, enabling the estimation of contemporaneous effects and the analysis of dynamic causal impacts through structural shocks. Its identification relies heavily on external assumptions, making careful justification and sensitivity analysis essential components of any applied SVAR study. The insights gained, particularly through IRFs, can be valuable for understanding dynamic systems and informing policy or business decisions in contexts where temporal dynamics and feedback are present.
© 2025 ApX Machine Learning