Main Page > Articles > Correlation Factor > Granger Causality vs. Correlation: A Practical Guide for Discretionary Traders

Granger Causality vs. Correlation: A Practical Guide for Discretionary Traders

From TradingHabits, the trading encyclopedia · 7 min read · February 28, 2026
The Black Book of Day Trading Strategies
Free Book

The Black Book of Day Trading Strategies

1,000 complete strategies · 31 chapters · Full trade plans

Experienced traders understand that identifying relationships between assets is fundamental to profitable strategies. For decades, correlation has been the primary tool for this purpose, measuring the degree to which two assets move in tandem. However, correlation is a symmetric, non-causal measure; it tells us nothing about whether one asset's price movement precedes and influences another. A high correlation between crude oil prices and an airline's stock price, for instance, does not reveal if oil price changes are a predictive signal for the stock. This is where the concept of Granger causality provides a more nuanced and potentially more effective tool for the professional trader.

Developed by Nobel laureate Clive Granger, Granger causality is a statistical hypothesis test for determining whether one time series is useful in forecasting another. A time series X is said to Granger-cause Y if it can be shown, through a series of F-tests on lagged values of X, that those lagged values provide statistically significant information about future values of Y. This is a important distinction from correlation: it introduces directionality and a temporal, predictive element that simple correlation lacks. It moves the analysis from "do these assets move together?" to "does the past of asset X help predict the future of asset Y?".

The Granger Causality Framework

The mathematical foundation of Granger causality rests on vector autoregression (VAR) models. In simple terms, to test if X Granger-causes Y, we perform two regressions. First, we model Y using only its own past (lagged) values. This is the restricted model. Second, we model Y using both its own past values and the past values of X. This is the unrestricted model.

We then use an F-test to compare the residual sum of squares (RSS) of the two models. If the unrestricted model, which includes X's past values, has a significantly smaller RSS than the restricted model, we can reject the null hypothesis that X does not Granger-cause Y. The F-statistic is calculated as:

F = ((RSS_restricted - RSS_unrestricted) / p) / (RSS_unrestricted / (n - 2p - 1))

Where 'p' is the number of lags and 'n' is the number of observations. A p-value below a chosen significance level (e.g., 0.05) indicates evidence of Granger causality.

A Practical Example: Oil and Airline Stocks

Consider the relationship between the price of West Texas Intermediate (WTI) crude oil and the stock price of a major carrier like American Airlines (AAL). A simple correlation analysis might show a strong negative correlation, which is intuitive—higher fuel costs hurt airline profitability. A discretionary trader might use this to short AAL when oil prices rise.

However, a Granger causality test could provide a more refined signal. Let's hypothesize we run the test and find that WTI Granger-causes AAL, but AAL does not Granger-cause WTI. This directional insight is invaluable. It suggests that past changes in oil prices have predictive power for AAL's stock price, but not the other way around. The information flow is primarily from oil to the airline. This allows a trader to build a more robust strategy, focusing on oil price movements as a leading indicator for AAL, rather than treating the relationship as symmetric.

Implementation and Testing

Fortunately, traders do not need to perform these calculations by hand. Statistical software packages and programming libraries have made Granger causality testing accessible. In Python, the statsmodels library is the standard tool. The statsmodels.tsa.stattools.grangercausalitytests function takes a two-dimensional array containing the two time series and the maximum number of lags to test.

Before running the test, it is important to ensure the time series are stationary. Non-stationary data, which have trends or other time-dependent structures, can produce spurious results. Traders must first test for and correct for non-stationarity using techniques like differencing. The Augmented Dickey-Fuller (ADF) test is a common method for checking stationarity.

Limitations and Important Considerations

While effective, Granger causality is not a magic bullet for uncovering true economic causality. Its primary limitation is the "omitted variable bias." A third, unobserved factor could be driving both time series, creating a statistical artifact that appears as a causal link. For example, global economic growth expectations could be driving both oil demand (and price) and travel demand (and airline stock prices). In this case, oil is not directly causing the stock's movement; both are reacting to a common, external factor.

Furthermore, Granger causality only captures linear relationships. Complex, non-linear dynamics between assets will not be detected. It is also sensitive to the choice of the number of lags. Too few lags may miss the relationship, while too many can introduce noise. Therefore, results should always be interpreted with a healthy dose of skepticism and confirmed with sound economic reasoning and further analysis. For the discretionary trader, Granger causality is not a replacement for fundamental analysis but a significant enhancement to the quantitative toolkit, offering a way to move beyond simple correlations and identify potentially predictive, directional relationships in the markets.

Categories: Granger Causality