Market Microstructure#

Market microstructure is the branch of financial economics that studies how asset prices are determined through the trading process — how buyers and sellers interact, how information is incorporated into prices, and how market structure affects the cost and quality of price discovery. Key concepts include the bid-ask spread and its components (order processing, inventory, and adverse selection costs), price impact of trades, the limit order book as the primary mechanism for price formation in modern electronic markets, and the role of market makers as liquidity providers.

The research here spans foundational theory — the Glosten-Milgrom and Kyle models of informed trading, inventory models for optimal market maker quoting, the structure of strategic trading — and empirical applications: how market makers navigate the fill probability versus post-fill returns tradeoff, optimal exchange fee design, Bitcoin price discovery across fragmented venues, and the mechanics of the two-tiered funding rate market. Practical implementations range from the Avellaneda-Stoikov framework for high-frequency market making to order type taxonomy and execution strategy design.

Related topics include Order Flow for OFI-based signals and flow toxicity, Statistical Arbitrage for cross-venue relative-value strategies, and Algorithmic Trading for execution and live deployment topics.

The Market Maker’s Dilemma: Navigating the Fill Probability vs. Post-Fill Returns Trade-Off#

This paper presents results from a live trading experiment on the Binance Bitcoin perpetual futures market, the most liquid crypto market globally. The authors submitted 232,897 minimum-sized maker orders over one week in a continuous quoting mode (always maintaining orders at top-of-book on both sides), of which 127,051 filled and 105,846 were cancelled unfilled. The experiment reveals a fundamental negative correlation between an order’s fill probability and its post-fill returns: conditions that make fills more likely (such as adverse order book imbalance where the opposite queue is large and the near-side queue is small) are precisely the conditions that predict poor post-fill returns due to price persistence.

The paper demonstrates the consequences with a naive market making strategy that continuously quotes at the touch and rebalances to zero inventory after each fill. This strategy lost nearly 60% in three days, producing an annualised Sharpe ratio of −109 with an average holding time of 11 seconds and an average loss of 0.44 basis points per roundtrip. The authors then model “reversals” using logistic regression and random forest classifiers. A balanced-inventory strategy that only trades when the model predicts a reversal with probability exceeding the 0.24 threshold improves the Sharpe ratio to 11.97 (logistic regression) while maintaining 327 roundtrips per day.

Read the paper

Optimal Make-Take Fees in a Multi Market Maker Environment#

This paper develops the optimal exchange fee structure when multiple market makers compete to provide liquidity. The analysis shows how maker rebates and taker fees should be set to maximize market quality, volume, and exchange revenue simultaneously, with implications for how exchange design affects market maker behavior and trading costs.

Read the paper

Where Is the Price of Bitcoin Determined? Price Discovery in a Fragmented Market#

Riccardo Cosenza and Simon Stalder investigate price discovery for Bitcoin across regulated and unregulated spot and perpetual-futures venues using high-frequency data. They apply Hasbrouck information shares, Gonzalo–Granger component shares, and Putniņš information-leadership shares.

Our summary: this is the reference paper for the question “which venue sets the Bitcoin price?” The answer is blunt: unregulated crypto-native venues (primarily Binance spot and perpetual futures) dominate price discovery during the vast majority of trading hours. Regulated venues like Coinbase gain relative importance around specific fixing windows, notably the NY 4pm fixing used as an ETF NAV reference.

Key metrics: Binance’s information share dominates across most hours of the trading day; Coinbase’s share rises around the NY 4pm fixing.

Read the paper

The Two-Tiered Structure of Cryptocurrency Funding Rate Markets#

This paper examines the price discovery dynamics for perpetual futures funding rates across CEX and DEX venues, documenting a two-tiered structure where CEX funding rates lead DEX rates. The analysis quantifies the lead-lag relationship and identifies the structural factors — trading volume, liquidity, and participant composition — that determine which venue plays the price discovery role.

Read the paper

Market Microstructure Foundations#

A comprehensive introduction to market microstructure, the branch of financial economics that studies how investors’ latent demands are translated into executed trades. The article covers the core building blocks including order-driven versus quote-driven markets, the role of market makers, price discovery mechanisms, and the concept of information asymmetry between market participants.

The post explores foundational theoretical models and their practical implications for trading decisions, risk management, and understanding the forces that shape asset prices. Topics include bid-ask spreads, transaction costs, market efficiency at the microstructure level, and the impact of market structure on price formation.

By Simone Menaldo.

Read the article.

Strategic Trading Models#

An exploration of strategic trading models that provide rigorous frameworks for executing large orders while minimizing market impact and preserving informational advantages. The article covers foundational concepts including Kyle’s Lambda estimation, the mechanics of price impact, and advanced execution tactics such as trade splitting, stealth trading, and the logic behind parent order strategies.

The post delves into how institutional traders balance the trade-off between execution speed and market impact, drawing on Kyle’s 1985 seminal model and its extensions. It examines how informed traders optimally reveal their information through trading, and how these theoretical insights translate into practical algorithmic execution strategies used by quantitative trading firms.

By Simone Menaldo.

Read the article.

Information-Based Models in Market Microstructure#

A comprehensive overview of information-based models in market microstructure, providing a framework for understanding how private information is incorporated into asset prices. The article traces the development of these models from Bagehot’s foundational insights through to modern metrics like the Probability of Informed Trading (PIN) and Volume-Synchronized PIN (VPIN).

The post explores the theoretical underpinnings of information asymmetry, adverse selection, and the strategic behavior of informed and uninformed traders. It covers the Glosten-Milgrom sequential trade model, the Kyle continuous auction model, and how these frameworks explain the mechanics of price discovery, liquidity provision, and trading costs in modern markets.

By Simone Menaldo.

Read the article.

Inventory Models in Market Microstructure#

A comprehensive overview of inventory models in market microstructure, which provide a systematic framework for understanding how market makers quote bid-ask spreads to control inventory risk and maximize profitability. The article covers foundational models by Garman (1976), Stoll (1978), and Ho and Stoll (1981), as well as advanced topics such as optimal inventory control and the Avellaneda-Stoikov framework.

The post explores how uncontrolled inventory exposes market makers to adverse price movements, and how optimal quoting strategies dynamically adjust spreads based on current inventory levels, risk aversion, and market volatility. The Avellaneda-Stoikov model is presented as the modern standard for high-frequency market making.

By Simone Menaldo.

Read the article.

Bid-Ask Spread Decomposition#

A detailed guide to decomposing the bid-ask spread into its fundamental components: order processing costs, inventory holding costs, and adverse selection costs. The article covers the theoretical models behind each component, methods for empirical estimation including the Roll (1984) and Madhavan-Richardson-Roomans (1997) approaches, and their behavior across different market conditions.

The post also examines time-varying spread dynamics, cross-sectional variation in spread components across different assets, intraday U-shaped spread patterns, and the impact of electronic versus floor trading on spread composition.

By Simone Menaldo.

Read the article.

Market Liquidity#

An overview of essential measures of market liquidity, from classic spread-based measures to more advanced model-based indicators. The article covers effective spread, realized spread, quoted depth, Amihud illiquidity ratio, Kyle’s Lambda, and other key metrics used to quantify the ease with which assets can be traded without significantly affecting their price.

The post explores the theoretical underpinnings and practical applications of each liquidity measure, explaining how they capture different dimensions of liquidity including tightness, depth, resiliency, and immediacy.

By Simone Menaldo.

Read the article.

Limit Order Book Structure#

A comprehensive overview of the limit order book (LOB), the fundamental data structure representing real-time supply and demand for financial instruments. The article covers key components including price-time priority, pro-rata allocation, order book depth, Level I/II/III market data, order book imbalance as a predictor of short-term price movements, queue position, and visible versus hidden depth.

The post explains how understanding LOB structure provides insights into market sentiment, liquidity, and short-term price dynamics. For traders and portfolio managers, this knowledge informs trading strategies, improves execution quality, and enhances risk management.

By Simone Menaldo.

Read the article.

Understanding Order Types#

A detailed guide to the order types that translate trading strategies into executed trades in modern financial markets. The article covers market orders, limit orders, stop orders, stop-limit orders, iceberg orders, and more complex conditional and algorithmic order variants, examining their mechanics and strategic applications within the limit order book framework.

The post emphasizes that understanding order types is fundamental to managing execution costs, controlling market impact, and optimizing portfolio performance.

By Simone Menaldo.

Read the article.

Why Cumulative Volume Delta Fails Traders#

An examination of why Cumulative Volume Delta (CVD), despite its apparent objectivity, consistently misleads traders. The post argues that CVD wins traders over in hindsight — after a session ends, the CVD chart appears to confirm moves cleanly — but in real time, the signal breaks down because CVD measures aggression totals, not who is actually in control of the auction.

The core issue is that CVD assumes aggressive buying should push price up and aggressive selling should push it down, but this ignores the role of passive absorption by limit orders. Instead of relying on CVD as a directional signal, the author advocates examining delta on lower timeframes to observe how aggression is being absorbed or accepted at specific price levels.

By Sierra Trading.

Read the blog post.

TradeFM: A Generative Foundation Model for Trade-flow and Market Microstructure#

TradeFM is a 524M-parameter generative Transformer that learns unified trade-flow dynamics directly from billions of trade events across more than 9,000 US equities. The model introduces scale-invariant features and a universal tokenization scheme that map the heterogeneous, multi-modal event stream of order flow into a unified discrete sequence, eliminating asset-specific calibration. Integrated with a deterministic market simulator, TradeFM-generated rollouts reproduce key stylized facts of financial returns including heavy tails, volatility clustering, and absence of return autocorrelation. The model generalizes zero-shot to geographically out-of-distribution APAC markets (Japan and China) with moderate perplexity degradation.

Our summary: the paper’s core contribution is demonstrating that the foundation model paradigm — pre-train a large model on massive heterogeneous data, then transfer — works for market microstructure. Rather than calibrating separate models per asset, TradeFM learns a single model across all liquidity regimes and asset types. Practical applications include synthetic data generation for illiquid assets, stress testing under counterfactual scenarios, and training reinforcement-learning-based trading agents.

Key metrics: TradeFM achieves 2-3x lower distributional error than Compound Hawkes baselines across all return intervals.

Mentioned by Iqbal Zainal in this discussion.

Read the paper

Trading Costs#

Frazzini, Israel, and Moskowitz (AQR / Yale, working paper, 2018) measure real-world trading costs and the price impact function of a large institutional money manager using a singular dataset: $1.7 trillion of live executed trades across 21 developed equity markets from August 1998 to June 2016. Where previous academic work largely relied on TAQ (trade-and-quote) data aggregated across all traders or on theoretical models calibrated to daily price-and-spread series, this paper uses actual orders and live executions from a single large manager — including the market price at trade initiation, the amount traded, and the per-share execution price. This lets the authors compute a precise implementation-shortfall measure of price impact and decompose it across trade type (initiate vs. cover, long vs. short), stock characteristics, trade size as a fraction of daily volume, time, and exchange.

The headline empirical finding is that actual trading costs for a patient institutional trader are roughly an order of magnitude smaller than the cost estimates produced by TAQ-calibrated models in the academic literature (e.g., Korajczyk and Sadka, Lesmond–Schill–Zhou, Novy-Marx and Velikov). The price-impact function is non-linear in trade size, well described by a square-root law with exponent ≈ 0.5 — consistent with Kyle (1985), Almgren et al. (2005), and the BARRA models. Price impact rises with market volatility, declines over time across markets, and is larger for smaller stocks and stocks with higher idiosyncratic risk. The authors validate the model out-of-sample by predicting the realised costs of large passive index funds (Vanguard’s S&P 500 fund and BlackRock’s iShares Russell 2000 ETF) accurately, while TAQ-calibrated models overshoot index-fund costs by several multiples.

Our summary: this is the most important practical reference for whether systematic strategies — momentum, trend following, value, low volatility — survive real-world implementation. The implication for the Hurst–Ooi–Pedersen “Demystifying Managed Futures” framework, and for cross-sectional momentum more broadly, is that the gap between gross and net Sharpe is materially smaller for a patient, sophisticated trader than naive TAQ-based academic backtests suggest. Conversely, the paper is a caution against drawing strong conclusions from any single cost number: the authors explicitly note their estimates apply to a patient trader who provides rather than demands liquidity, executes via proprietary algorithms that decide when to trade but not what to trade, and operates at institutional scale. Retail traders, intra-day strategies, and impatient liquidity demanders face costs much closer to the TAQ estimates. The paper is also notable for separating temporary from permanent price impact and for being one of the few papers to verify cost models against independent broker data (ITG, Deutsche Bank, JP Morgan) and the ANcerno database, which collectively cover 2,000+ institutions and 2,000+ brokers globally.

Data: $1.7 trillion of live executed trades from a single large money manager (AQR) across 21 developed equity markets, August 1998 to June 2016. Out-of-sample validation uses broker-reported costs (ITG, Deutsche Bank, JP Morgan), the ANcerno institutional database, and live cost data from Vanguard’s S&P 500 fund and BlackRock’s iShares Russell 2000 ETF. The live trade dataset is proprietary and not released. The fitted cost model — a square-root price impact function in trade size with calibrated coefficients for market conditions and stock characteristics — is fully specified in the paper and can be reimplemented by practitioners, but exact replication of the calibration requires the AQR execution database.

Key metrics: average market impact across all trades is 9.97 basis points, mean implementation shortfall is 11.02 basis points (a 1.05 bp gap reflecting timing slippage between order arrival and execution). Median costs are lower at 6.18 and 8.63 basis points respectively; value-weighted means are higher at 15.14 and 16.06 basis points, indicating that the largest trades carry the highest costs. Effective bid-ask spread averages less than 0.015% per year across all trades, because most orders are executed passively as limit orders providing liquidity. The average bid-ask spread at order arrival is 21.33 bps, so the live trader pays well under half the quoted spread by patient execution. Most of the price impact is permanent: only 1.26 bps of the 9 bps average is reversed within the next 24 hours, and only 2.08 bps of total price impact reverses the next day. Large-cap stock trades generate 8.90 bps of market impact versus 18.95 bps for small-cap stocks. Cost as percentage of VWAP is just 4.50 bps on average. The fitted price-impact function has a power-law coefficient close to 0.5 in trade size as a fraction of daily volume. 99.9% of intended trades complete, with average realised completion horizon under one day.

By Andrea Frazzini, Ronen Israel, and Tobias J. Moskowitz.

Read the paper

High-Frequency Market-Making with Inventory Constraints and Directional Bets#

This paper by Pietro Fodra and Mauricio Labadie extends the classical Avellaneda-Stoikov market-making framework (“High-frequency trading in a limit-order book”, Quantitative Finance 2008) and the Gueant-Lehalle-Fernandez-Tapia model (“Dealing with inventory risk”, 2011) to a general class of non-martingale mid-price processes. The authors solve for optimal bid and ask quotes under both linear and exponential PNL utility functions, using stochastic control and Hamilton-Jacobi-Bellman equations. A key innovation is an inventory-risk-aversion parameter that penalises the market-maker for ending the day with non-zero inventory, giving her direct control over inventory risk and indirect control over the moments of her PNL distribution (variance, skewness, kurtosis, VaR).

Our summary: the paper’s central contribution is enabling a market-maker to place directional bets on price trends while mathematically controlling inventory risk. When the mid-price follows a mean-reverting Ornstein-Uhlenbeck process rather than a martingale, the market-maker places non-symmetric limit orders – tightening her bid and widening her ask when she expects prices to fall, and vice versa. The framework derives closed-form solutions for optimal quotes under both utility functions and provides explicit formulas for the optimal spread and indifference price as functions of the drift, volatility, mean-reversion speed, inventory risk aversion, and current inventory level. The paper also generalises to arbitrary penalty functions on terminal inventory.

The authors use simulated data with an Ornstein-Uhlenbeck mid-price process (parameters: k=100, A=1500, n=1000 limit orders per day, daily volatility 5%, mean-reversion speed a=1). 100,000 Monte Carlo simulations were run across three directional-bet scenarios (price down 2%, flat, up 2%) and three inventory-penalty levels. No real market data or code is provided; the framework is fully analytical with closed-form solutions that can be implemented by practitioners.

Key metrics: under a mean-reverting mid-price with a correct 2% directional bet, the mean-reverting strategy without inventory penalty achieves a mean PNL of 14.29 versus the martingale benchmark’s 11.04 (a 29% increase), but at the cost of enormous inventory risk (inventory std dev 418 vs 33). Adding a small inventory penalty (eta=0.0001) brings inventory std dev down to 1.54 while maintaining a mean PNL of 11.58 and improving the Sharpe ratio to 7.51 (vs 1.04 without penalty). Under the exponential utility with the same penalty, the Sharpe ratio reaches 24.65 with tight inventory control. With higher inventory penalty (eta=0.001), the linear ABM benchmark achieves Sharpe 31.06 and the exponential ABM reaches 27.47, all with inventory 90% quantile of [-3,3]. The paper demonstrates that the market-maker can increase average PNL by more than 15% versus the martingale benchmark while carrying higher risk, or sacrifice 5% of benchmark PNL to more than double her Sharpe ratio.

Mentioned by Ruuj in this discussion.

Read the paper

Why Hyperliquid Lags Binance: Perp DEX Lead-Lag Analysis#

Arrakis Finance ran a modified Hayashi-Yoshida lead-lag estimator across three crypto perpetual swap venues — Hyperliquid, Binance Futures USDT-M, and Lighter — covering the top 29 crypto assets by market cap over a 16-day window ending 26 February 2026. The method takes two asynchronous trade tapes, shifts one in time across a grid from −2,000 ms to +2,000 ms in 100 ms steps, and reads off the shift where the cross-correlation peaks. Fills are split into bid-side and ask-side streams to suppress bid-ask bounce noise at sub-second resolution.

The results are unambiguous: Binance led Hyperliquid for all 29 assets (median lag ~700 ms), Lighter led Hyperliquid for 27 of 29 assets, and Binance led Lighter for 23 of 29 assets (median lag ~100 ms). The three pairwise lags satisfy transitivity with a median residual of only −33 ms, confirming that the lag is structural rather than an artefact of any single pair. The article traces Hyperliquid’s ~700 ms lag to its on-chain matching architecture: every fill waits for HyperBFT consensus finality (~200 ms per block), and a maker-taker round-trip spanning two consecutive blocks adds another ~500 ms. Lighter avoids this by matching off-chain in a Sequencer and enforcing decentralisation at the settlement layer via zk-proofs to Ethereum, buying speed at the cost of relocating the trust boundary from matching to settlement. The article discusses three potential architectural changes Hyperliquid could adopt — tighter BFT pipelining, a pre-confirmation layer, and decoupling matching from consensus — each with its own trust-assumption trade-offs.

Discussed by ltrd (@ltrd_) in this thread, who cautions that the 100 ms analysis window is too coarse for HFT-grade conclusions: crypto venue latency is already sophisticated enough that 10/50/100 ms bins measure exchange clock drift more than genuine price discovery. ltrd argues that Lighter does not truly “lead” Hyperliquid in any trading sense — the apparent lead is an artefact of faster matching engine execution rather than independent information arrival — and that Binance’s leadership over Lighter on assets like ETH is masked by the coarse binning.

By Arrakis Finance (@web3_pastel).

Read the blog post