Statistical Arbitrage#
Statistical arbitrage encompasses a broad family of market-neutral strategies that exploit temporary mispricings between related assets. The classic implementation is pairs trading, where two historically correlated assets are traded when their price spread diverges beyond a statistical threshold, with the expectation of mean reversion. More advanced implementations extend this to baskets of assets, factor residuals, and machine learning-driven signal extraction. Statistical arbitrage differs from simple mean reversion in that it explicitly exploits cross-asset relationships and aims for market neutrality.
This collection covers the full spectrum of statistical arbitrage research: the seminal distance-method papers that established the field, cointegration-based approaches using Engle-Granger and Johansen tests, deep learning frameworks that extract arbitrage signals from factor residuals, reinforcement learning for optimal execution timing, clustering methods for portfolio construction, and applications to cryptocurrency and international markets. The coverage spans from theoretical foundations to practical implementation details including transaction cost modeling and execution mechanics.
Related topics include Mean Reversion for single-asset reversion strategies, Order Flow for microstructure-driven signals used alongside stat arb, and Cryptocurrency for digital asset applications.
Pairs Trading: Performance of a Relative-Value Arbitrage Rule#
The paper that established the distance method for pairs trading on U.S. equities. Using data from 1962 to 2002, the authors show that pairs selected by minimum distance between normalized price series deliver approximately 11% annualized excess returns with a Sharpe ratio around 1.5 and near-zero market beta. This is the most-cited empirical foundation for statistical arbitrage and remains a standard benchmark for all subsequent methods.
By Evan Gatev, William N. Goetzmann, and K. Geert Rouwenhorst.
The methodology operates in two stages: a 12-month formation period matching all liquid stocks by minimizing the sum of squared deviations between normalized cumulative total return indices, followed by a 6-month trading period where positions open when prices diverge by more than two historical standard deviations and close upon convergence. The top 5 pairs portfolio earns an average 6-month excess return of 5.98% on committed capital. Pairs portfolio returns have low volatility (4-6% annualized), are uncorrelated with the S&P 500, and exhibit high Sharpe ratios. Fama-French three-factor regressions show intercepts remain significantly positive, indicating returns are not explained by standard risk factors.
Statistical Arbitrage in the U.S. Equities Market#
The foundational paper on modern PCA-based statistical arbitrage. The authors study model-driven statistical arbitrage strategies in U.S. equities, generating trading signals using Principal Component Analysis and sector ETFs. Residual (idiosyncratic) stock returns are modeled as a mean-reverting process, leading naturally to contrarian trading signals. Back-testing over 1997-2007 shows PCA-based strategies generally outperform ETF-based strategies, yielding Sharpe ratios of 1.1-1.5 after transaction costs. This paper established the framework that most subsequent stat arb research builds upon.
By Marco Avellaneda and Jeong-Hyun Lee (NYU).
Review of Statistical Arbitrage, Cointegration, and Multivariate Ornstein-Uhlenbeck#
A classic tutorial introducing the multivariate Ornstein-Uhlenbeck process and its relationship to cointegration and statistical arbitrage. The paper provides a geometrical interpretation of the dynamics, illustrates applications to swap contract strategies, and includes fully documented MATLAB code. An essential reference for understanding the mathematical foundations of cointegration-based stat arb.
By Attilio Meucci (ARPM).
Statistical Arbitrage Pairs Trading Strategies: Review and Outlook#
A comprehensive survey covering more than 90 papers across five categories: distance, cointegration, time-series, stochastic control, and other approaches. The review concludes that cointegration and stochastic control methods often excel in out-of-sample tests. An essential starting point for anyone entering the field, providing a structured map of the entire statistical arbitrage literature up to 2017.
By Christopher Krauss.
A Survey of Statistical Arbitrage Pairs Trading: Non-ML Methods, 2016-2023#
An updated survey reviewing over 100 non-ML statistical arbitrage papers published between 2016 and 2023, categorizing them into distance, cointegration, stochastic control, and other approaches. The review highlights strengths like simplicity (distance methods) versus adaptability (stochastic control), while noting weaknesses such as spurious pairs or parameter sensitivity.
By Sun (University of Warsaw Working Paper 2025-19).
Deep Learning Statistical Arbitrage#
A comprehensive framework for statistical arbitrage using deep learning. The authors construct arbitrage portfolios of similar assets as residual portfolios from conditional latent asset pricing factors, extract time series signals with a CNN+Transformer architecture, and form an optimal trading policy that maximizes risk-adjusted returns under constraints. On top-500 U.S. stocks over 19 years, the method achieves approximately 20% annualized returns with volatility below 6%, Sharpe ratios up to 4 (gross), and near-zero Fama-French exposure, robust to transaction costs. Published in Management Science (2025).
By Jorge Guijarro-Ordonez (BlackRock), Markus Pelger, and Greg Zanotti (Stanford).
Mentioned by Ivan Blanco in this discussion.
Attention Factors for Statistical Arbitrage#
A framework that jointly identifies similar assets through factors, identifies mispricing, and forms a trading policy maximizing risk-adjusted performance after trading costs. The Attention Factors are conditional latent factors learned from firm characteristic embeddings that allow for complex interactions. Time-series signals are extracted from residual portfolios using a general sequence model. On top-500 U.S. stocks (1998-2021), the model achieves gross Sharpe 3.97-4+, net Sharpe 2.28-2.3 after 5bp costs, annualized returns of 9.5-16%, and low beta.
By Elliot L. Epstein, Rose Wang, Jaewon Choi, and Markus Pelger.
Mentioned by @quantbeckman in this discussion.
A Deep Learning Approach for Trading Factor Residuals#
A replication and extension of the Deep Learning Statistical Arbitrage methodology, applied to a more recent out-of-sample period from 2016 to 2024. The authors adhere strictly to point-in-time principles and ensure no information leakage, following the same data pre-processing, factor modeling, and deep learning architectures (CNN, LSTM, Transformer). The study examines whether the DLSA framework’s strong historical performance persists in more recent market conditions.
By Wo Long and Victor Xiao.
End-to-End Policy Learning of a Statistical Arbitrage Autoencoder Architecture#
This paper studies Autoencoder architectures as a generalization of classical StatArb, first using a standard Autoencoder trained on U.S. stock returns to derive strategies based on the Ornstein-Uhlenbeck process, then introducing a novel end-to-end architecture that jointly learns the latent factor structure and the trading policy. On CRSP daily U.S. equities (2000-2022), the end-to-end AE Policy achieves a best Sharpe ratio of 1.81 with 6.24% mean return and 3.46% volatility (pre-cost).
By Fabian Krause and Jan-Peter Calliess.
Advanced Statistical Arbitrage with Reinforcement Learning#
An innovative model-free reinforcement learning framework for statistical arbitrage. For spread construction, the authors establish an empirical reversion time metric and optimize asset coefficients by minimizing mean reversion time. In the trading phase, a reinforcement learning framework identifies optimal mean reversion trading strategies. On 10 S&P 500 pairs (2022-2023), the RL method achieves cumulative P&L of 37.76% on MSFT-GOOGL and 48.82% on OXY-XOM.
By Boming Ning and Kiseop Lee.
Finding Moving-Band Statistical Arbitrages via Convex-Concave Optimization#
A new method for finding statistical arbitrages that can contain more assets than just the traditional pair. The problem is formulated as seeking a portfolio with the highest volatility, subject to its price remaining in a band and a leverage limit. On CRSP US stocks (2010-2023), moving-band stat-arbs yield 70% profitable portfolios with average annualized return 15% and Sharpe 0.84.
By Kasper Johansson, Thomas Schmelzer, and Stephen Boyd (Stanford).
A Markowitz Approach to Managing a Dynamic Basket of Moving-Band Statistical Arbitrages#
Building on the moving-band statistical arbitrage framework, this paper considers the problem of managing a portfolio of MBSAs using Markowitz-inspired optimization. On CRSP US stocks (2010-2023) with realistic costs, the portfolio achieves 19% annualized return, 12% volatility, Sharpe ratio 1.61, maximum drawdown 15%, and very low S&P 500 correlation (15%).
By Kasper Johansson, Thomas Schmelzer, and Stephen Boyd (Stanford).
Optimal Entry and Exit with Signature in Statistical Arbitrage#
This paper explores optimal timing strategies for trading price spreads exhibiting mean-reverting characteristics using a sequential optimal stopping framework and a refined signature optimal stopping method. On simulated OU paths and real U.S. pairs (e.g., UAL-DAL 2022), it achieves higher cumulative returns (e.g., 42% vs. 9.6% baseline) and better Sharpe ratios with fewer but higher-quality trades.
By Boming Ning, Prakash Chakraborty, and Kiseop Lee.
On Statistical Arbitrage Under a Conditional Factor Model of Equity Returns#
A conditional factor model for a multivariate portfolio of U.S. equities in the context of statistical arbitrage trading. A state space framework underlies the factor model whereby asset returns are assumed to be a noisy observation of a linear combination of factor values and latent factor risk premia. Filter and state prediction estimates are retrieved online, and large deviations represent candidate mean reversion trades.
By Trent Spears, Stefan Zohren, and Stephen Roberts (Oxford).
Statistical Arbitrage in Rank Space#
Equity market dynamics are investigated in rank space where stocks are indexed by their capitalization ranks rather than company names. Using PCA residuals and either OU parametric or CNN-Transformer neural networks, rank-space neural networks produced 206% gross annualized returns and Sharpe 9.04 (2007-2022); net of 2bp costs, this fell to 35.68% returns and Sharpe 3.28, vastly outperforming name-space equivalents.
By Y.-F. Li and G. Papanicolaou (Stanford).
Mentioned by Vivek V Rao in this discussion.
Correlation Matrix Clustering for Statistical Arbitrage Portfolios#
A framework to construct statistical arbitrage portfolios with graph clustering algorithms. Various clustering methods partition the correlation matrix of market residual returns into clusters, then mean-reverting stat arb portfolios are constructed and evaluated within each cluster. Five clustering algorithms are explored, generating profitable trading strategies with over 10% annualized returns and statistically significant Sharpe ratios above one.
By Alvaro Cartea, Mihai Cucuringu, and Qi Jin (Oxford).
Statistical Arbitrage in Multi-Pair Trading Strategy Based on Graph Clustering Algorithms in US Equities Market#
A framework combining graph clustering (SPONGE on residuals) with quantitative and machine learning methods, including the Kelly criterion and an ensemble of ML classifiers. On S&P 500 constituents (2000-2022), the approach delivers 49% annualized returns and information ratio 1.30 out-of-sample after 5bp costs.
By Adam Korniejczuk and Robert Slepaczuk.
Diversified Statistical Arbitrage: Dynamically Combining Mean Reversion and Momentum Strategies#
A quantitative investment strategy combining mean reversion and momentum strategies to construct a diversified statistical arbitrage approach. The mean reversion strategy decomposes stock returns into market and idiosyncratic components using PCA. The momentum strategy uses technical trading rules to trade at the industry sector level. Dynamic portfolio optimization rebalances exposures as the market environment changes.
By James Velissaris.
Statistical Arbitrage in Options Markets by Graph Learning and Synthetic Long Positions#
This paper targets options markets using graph learning (RNConv architecture incorporating tree structures on tabular features) and synthetic long-short arbitrage (SLSA) positions neutral to Black-Scholes risks. On KOSPI 200 index options, RNConv outperforms GCN/SAGE/GAT baselines; SLSA delivers consistent positive P&L with average information ratio 0.1627 and hit rates above 57%.
By Hong and Klabjan.
Graph Learning for Foreign Exchange Rate Prediction and Statistical Arbitrage#
Applies graph learning methods to foreign exchange rate prediction in the context of statistical arbitrage. The graph structure captures dependencies between currency pairs, enabling identification of mispricing opportunities in FX markets. On ten of the most traded currencies, the FXSA method achieves a 61.89% higher information ratio and 45.51% higher Sortino ratio compared to benchmark.
LLMs for Time Series: an Application for Single Stocks and Statistical Arbitrage#
Tests the Chronos LLM for single-stock and long/short stat-arb portfolios on large U.S. equities (2002-2016). Pretrained and fine-tuned versions generate gross Sharpe ratios up to 3.97 on PCA residuals, outperforming autoARIMA but trailing specialized CNN-Transformer benchmarks. Alpha is strongest pre-2008, highlighting market-efficiency evolution, yet confirms LLMs can extract signals from near-random time series.
By Valeyre and Aboura.
Statistical Arbitrage in Polish Equities Market Using Deep Learning Techniques#
Adapts the Avellaneda-Lee framework to Polish equities (WIG20/mWIG40, 2017-2020) using PCA eigenportfolios, sector ETFs, or stacked LSTMs for replication. PCA delivers approximately 20% cumulative returns and annualized Sharpe up to 2.63 (2017-2019); ETFs alone remain profitable (approximately 5% annual) during the 2020 COVID recession while PCA/LSTM incur losses.
By Adamczyk and Dabrowski.
Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading#
Applies Deep Q-Learning to statistical arbitrage in high-frequency trading settings. The reinforcement learning agent learns to exploit short-lived mean-reversion opportunities at high frequency, adapting its trading policy dynamically to changing market microstructure conditions.
Statistical Arbitrage in Cryptocurrency Markets#
A landmark high-frequency study applying random forest classifiers to minute-binned cryptocurrency data. Using 40 liquid coins (Jan-Sep 2018), the strategy delivers 7.1 basis points per day net of 15 bps round-trip costs — annualizing to approximately 29% with Sharpe 2.55 and MDD -2.4%. Past 60-minute returns emerge as the strongest predictors.
By Nick Fischer, Christopher Krauss, and Alexander Deinert.
Constructing Cointegrated Cryptocurrency Portfolios for Statistical Arbitrage#
Foundational work applying cointegration to cryptocurrency markets using Engle-Granger and Johansen tests to form multi-coin portfolios. On daily Coinbase prices (Dec 2017-Jun 2018) for BTC, ETH, LTC, and BCH, the basic strategy at 1.5-sigma threshold produces an annualized Sharpe of 109 (with zero transaction cost assumption).
By Tim Leung and Hung Nguyen.
Evaluation of Dynamic Cointegration-Based Pairs Trading Strategy in the Cryptocurrency Market#
Advances high-frequency crypto pairs trading with minute-binned BitMEX data (Sep 2018-Oct 2019). The authors calibrate Ornstein-Uhlenbeck half-lives for optimal look-back windows and test three scenarios: single pairs (Engle-Granger/KSS), multi-coin baskets (Johansen), and all-pairs. Basket trading yields the strongest results: total P&L 1.44 XBT, Sharpe 7.94.
By Jan Tadi and Igor Kortchemski.
Copula-Based Trading of Cointegrated Cryptocurrency Pairs#
Introduces non-parametric dependence modeling to crypto stat arb using hourly Binance USDT-margined futures (2021-2023, 20 pairs, 104 monthly cycles). After Engle-Granger/KSS cointegration screening and copula fitting, annualized returns reach 35-37% with Sharpe approximately 0.95.
By Jan Tadi and Jiri Witzany.
High-Frequency Trading of Cryptocurrencies Through Short-Term Cointegration Pairs-Trading Strategies#
A dedicated high-frequency study on 1-minute Binance data resampled to 60-second intervals (May-Sep 2018, 17 USDT pairs). The Johansen-assisted version averages 6.81% weekly returns versus 5.97% for pure Engle-Granger.
By Lennart de Bruijn (Erasmus University, in collaboration with Blocktraders HFT firm).
Reinforcement Learning Pair Trading: A Dynamic Scaling Approach#
Applies actor-critic reinforcement learning agents (A2C, PPO, SAC, DQN) to 1-minute Binance BTC-GBP/EUR pairs, learning timing, direction, and position sizing. Traditional Gatev-style pairs trading yields 8.33% annualized; RL variants reach 9.94-31.53%.
By Harry Yang and Akhtar Malik.
Altcoin-Bitcoin Arbitrage#
Focuses on cross-sectional mean-reversion driven by prior-day momentum using daily open-to-close returns. Long low-momentum altcoins versus short Bitcoin produces Sharpe ratios up to 14.65 in illiquid tiers, though capacity is constrained by market impact.
By Zura Kakushadze and Willie Yu.
Trading and Arbitrage in Cryptocurrency Markets#
This paper studies cross-exchange arbitrage opportunities in cryptocurrency markets, documenting persistent pricing discrepancies between exchanges and quantifying the returns available from exploiting them. The authors analyze the structural reasons for arbitrage persistence including capital transfer frictions, withdrawal limits, and timing risks.
Pairs Trading - Selection via Scoring Systems#
Lukas Reichmann develops a cointegration-based pairs trading selection framework that replaces the usual single-factor squared-sum-of-distances ranking with an aggregated score built from statistically significant panel-regression coefficients. The paper estimates which inter-pair characteristics predict future mean-reversion activity, then uses those coefficients to rank candidate stock pairs. Published in Finance Research Letters, Vol. 93, March 2026.
Our summary: the paper is useful because it treats pair selection as a forecasting problem rather than a purely descriptive closeness screen. The score combines training-period mean reversion, squared spread distance, spread volatility, shared industry classification, momentum-rank distance, and volume correlation. The regression score improves returns and risk-adjusted performance versus the SSD benchmark.
Data and code: the study uses Refinitiv Eikon stock-level data for all firms that were S&P 500 constituents at any point between 2010 and 2023.
Key metrics: the best-reported configuration uses lambda = 2 with six-month training and six-month trading periods. Including the COVID period, the updated 20-pair scoring strategy earns a 20.59% cumulative gross return and 11.33% net return after transaction costs, versus 4.06% gross and -5.49% net for the SSD 20-pair benchmark.
Short Scamtrash / Long Distinguished Crypto#
A practical investigation into the “quality factor” for cryptocurrency perpetuals trading. The core strategy is simple: short scammy, low-quality crypto perps while going long large-cap crypto against it. This works better in crypto than traditional equities because markets are less efficient and there are abundant opportunities to identify low-quality assets.
The post demonstrates how to build a simple “trashmetric” derived from trading volume to sort perpetuals into buckets by quality. Backtests show that trashier assets have worse average returns while more distinguished assets perform better. The strategy trades once a week, going long the least trashy third of the universe and short the most trashy third, with volatility-targeted position sizing.
By Robot James.
Pairs Trading and Statistical Arbitrage Across Top Cryptocurrencies#
A comprehensive article on implementing pairs trading and statistical arbitrage strategies across the top 23 cryptocurrencies by market capitalization. The article covers the full implementation including data normalization, the Sum of Squared Differences (SSD) metric for measuring price divergence, and the formation and trading period methodology.
The article addresses practical challenges including the computational complexity of 253 possible pair combinations, the importance of proper price normalization for fair comparison, and the trade-off between data granularity and trading costs.
By Pedma.
BTC-ETH Pairs Trading: Correlation Analysis and Statistical Arbitrage#
An article exploring statistical arbitrage opportunities between Bitcoin and Ethereum through correlation analysis and linear regression. The study uses Pearson correlation coefficients on 4-hour interval data to understand the dynamic relationship between the two largest cryptocurrencies.
The article demonstrates how the BTC-ETH correlation fluctuates with market conditions, creating trading opportunities when the pair deviates from its historical relationship.
By Pedma.