How Much Signal Is in L1? — Tick Data Intelligence

The Question

Twenty rows of book. How many of them carry signal?

A trader looking at a quote terminal sees ten levels of book on each side. Twenty rows of bids and offers. The unspoken assumption is that all twenty rows carry information — that's why exchanges sell deeper book feeds, why infrastructure teams subscribe to L10, and why analytics tools display the full ladder.

But how much of that information is actually predictive? If we built a model of the next mid-price move using only L1, vs L1 + L2, vs the full L1–L10, how much extra predictive power would the deeper levels contribute?

This chapter answers that question on SET data — and the answer is surprising.

What the Research Says

L1 dominance is the consensus — on US and European data.

The relationship between order book depth and short-horizon price prediction has been studied for two decades. Three findings are well-established in the literature.

The information content of deeper levels has been measured in early empirical work and consistently found to be modest beyond the first few levels of the book. Most price-relevant information sits at the top, with deeper levels contributing diminishing marginal value.
Order flow imbalance at L1 has been shown to explain the majority of short-horizon mid-price variation in liquid markets. Linear models on imbalance achieve R² in the 1–25% range for sub-minute horizons, depending on stock liquidity and modeling choices.
Modern deep-learning studies on full LOB data confirm the same hierarchy — L1 dominates, L2–L5 add marginal value, deeper levels contribute little for short-horizon forecasting.

These findings are consistent across US equities, European equities, and major futures contracts. What hasn't been measured is whether the same hierarchy holds on SET, with its binding tick grid (Chapter 4) and L1-concentrated depth (Chapter 5). The empirical contribution of this chapter is to settle that question for SET data — and the result is sharper than expected.

The Method

750 regressions, every cell measured out-of-sample.

For each (stock × horizon × depth-config) cell:

Predict: future mid-price change in basis points at horizon H seconds.
Predictors: order flow imbalance per level (bid_qty[k] − ask_qty[k]), included for k = 1…depth.
Model: multivariate OLS with HAC (Newey-West) standard errors.
Train: first 15 trading days. Test: last 7 trading days; report OOS R².
Confidence: 95% bootstrap CI on OOS R² (300 block-bootstrap reps).

5 depth configs (L1, L1–L2, L1–L3, L1–L5, L1–L10) × 6 horizons (5s, 10s, 30s, 60s, 5min, 15min) × 25 stocks = 750 regressions. Each cell evaluated on its own out-of-sample data.

Sessions filtered to continuous trading only (10:00–12:30 and 14:00–16:30 Bangkok). Predictions whose horizon spans the lunch break or end of day are dropped.

The Headline

L1 alone wins at every horizon.

Mean OOS R² across the 25 stocks, in percentage points. The L1-only row is best at every column.

OOS R² by Depth and Horizon

Heatmap of mean out-of-sample R-squared across 25 stocks for 5 depth configurations and 6 horizons

Brighter = better OOS predictability. The L1-only row is darkest-positive at every horizon.

Depth	5s	10s	30s	60s	5min	15min
L1 only	0.78	1.11	1.76	2.18	2.53	1.49
L1–L2	0.74	1.05	1.65	1.97	2.11	0.99
L1–L3	0.71	0.99	1.52	1.77	1.49	−0.70
L1–L5	0.69	0.95	1.43	1.62	0.80	−2.25
L1–L10	0.57	0.71	0.98	0.87	−1.39	−8.02

1. L1 alone is the best predictor at every horizon. Adding levels never helps on average.

2. At long horizons (5min, 15min), deeper-book models go negative OOS — meaning the L10 model fits the test set worse than just predicting the average return. That is a strong overfitting signal: deeper levels look meaningful in training, then collapse out of sample.

The Slope

Every line slopes down. The longer the horizon, the steeper the damage.

Each line traces what happens as we increase the depth of book used in the regression. All slopes are negative.

OOS R² vs Depth, by Horizon

Line chart of OOS R-squared declining with book depth, with one line per horizon, all sloping downward

The deeper the book in the regression, the worse it predicts.

Going from L1 to L1–L10	Median R² change
at 5s	−0.17 pp
at 10s	−0.34 pp
at 30s	−0.66 pp
at 60s	−1.25 pp
at 5min	−2.76 pp
at 15min	−6.37 pp

The deeper the book you use, the worse you predict. The longer the horizon, the more pronounced the damage.

The Mechanism

Why deeper levels hurt.

A reasonable interpretation, consistent with this chapter's findings and the prior chapters of the series.

Deeper levels are noisier than informative. L2 through L10 quantities reflect resting orders far from the action — they update infrequently, get cancelled often, and rarely participate in price discovery. When fed into a regression, they add degrees of freedom but not signal, which causes overfitting on the training set and worse performance on the test set.

This is consistent with the SET microstructure findings from earlier chapters:

Chapter 3 showed 99.9% of L1 updates are quantity changes at the same price — most action concentrates at the top.
Chapter 4 showed the spread is pinned at one tick ~98% of the time, so price discovery happens almost entirely at L1.
Chapter 5 showed depth concentrates at L1; the cliff between displayed L1 and L2+ is steep.

If almost all the trading happens at L1, almost all the signal lives at L1. Deeper levels become decoration — useful to display, useless to predict from.

Per-Stock View

Aggregate numbers hide individual variation.

L1 imbalance only, 30-second horizon, 95% bootstrap confidence interval per stock.

OOS R² per Stock — L1, 30s Horizon

Sorted bar chart of OOS R-squared for 25 stocks at L1 and 30-second horizon, with 95 percent bootstrap confidence intervals

Bars sorted by R². CIs that cross zero indicate non-significant predictability.

18 of 25 stocks show statistically significant predictability (95% bootstrap CI excludes zero).

Best: SCM (4.8% R²), CPALL (3.3%), KCE (2.9%), VGI (2.8%), HANA (2.6%), GULF (2.6%).

Essentially unpredictable: PTT (−0.2%, CI crosses zero), EA (0.0%), TOP (0.3%, not significant).

Significance breadth by horizon

Percentage of stocks where the 95% CI lower bound is above zero.

Horizon	Significant
5s	76% (19/25)
10s	80% (20/25)
30s	72% (18/25)
60s	72% (18/25)
5min	52% (13/25)
15min	32% (8/25)

Short-horizon prediction is broadly significant. Long-horizon prediction works only on a minority.

The Damage, Per Stock

No row is rescued by deeper book.

Heatmap of 25 stocks × 5 depth configurations at 30-second horizon. The best column for nearly every row is L1 or L1–L2.

OOS R² per Stock × Depth Config — 30s Horizon

Heatmap of OOS R-squared for 25 stocks across 5 depth configurations at 30-second horizon

Greens are at the top of the chart and on the left columns; reds appear as depth grows.

Stocks where L1 alone works (top of chart) usually keep working at L1–L3, then degrade at L5 and L10. Stocks that don't work at L1 (PTT, EA, TOP) don't get rescued by adding depth — they get worse. Across nearly every row, the best column is L1 or L1–L2, never L1–L10.

There is no stock where adding deeper book levels improved predictability meaningfully out-of-sample.

What This Means

Three audiences, one conclusion.

For Traders

If your strategy needs short-horizon mid-price prediction, L1 is enough. You can ignore L2–L10 for forecasting purposes.
The marginal data feed cost of deeper levels is real; the marginal predictive value is negative.
Stock selection matters more than feature engineering: 18 of 25 have meaningful L1 signal; the other 7 are flat regardless of method.

For Market Structure

SET behaves more extreme than developed markets. US equity studies show diminishing returns past L3–L5; SET shows negative returns past L1.
The tick grid + L1 concentration explain why.
Deeper book on SET is a display artifact, not an information channel.

For Data Infrastructure

Whole-book LOB reconstruction is what enables this measurement. You can't compare L1 vs L10 predictability without the full book to test against.
For forecasting, you only need L1.
The full book matters for execution simulation (Chapter 5 walk-the-book) and for measuring market structure (this chapter), but not for predicting the next move.

Beyond Imbalance

Richer L1 features — no deeper book required.

If L1 is where the signal lives, the natural follow-up is: how much more signal can we extract by using L1 behavior, not just L1 state?

We tested an enriched feature set built entirely from L1 — no deeper book required:

L1 imbalance — bid_qty − ask_qty (the baseline).
Normalized imbalance — imbalance scaled by total L1 size.
Microprice deviation — depth-weighted fair value vs midpoint, in bps.
L1 spread — current bid-ask gap in bps.
Imbalance velocity — change in imbalance from the previous snapshot.
Message activity — number of order book updates per second.

OOS R² — Baseline (L1 imbalance) vs Enriched (L1 dynamics)

Line chart comparing baseline L1 imbalance OOS R-squared against enriched L1 dynamics OOS R-squared across six horizons

Enriched L1 features roughly double predictability at sweet-spot horizons.

Horizon	Baseline (L1 imbalance)	Enriched (L1 dynamics)	Lift
5s	0.78%	1.04%	+0.26 pp
10s	1.11%	2.15%	+1.04 pp
30s	1.76%	3.00%	+1.24 pp
60s	2.18%	3.52%	+1.34 pp
5min	2.53%	3.00%	+0.47 pp
15min	1.49%	−0.23%	−1.72 pp (overfits)

At the sweet-spot horizons (10s–60s), enriching L1 features roughly doubles predictability. The lift is broad-based — 84–96% of stocks see improvement at these horizons.

The contrast with Section 5 is the key finding of this chapter:

Hurts

Add L2–L10 (more depth)

Negative effect on OOS R² at every horizon.

Helps

Add L1 dynamics (more behavior)

Positive effect on OOS R² at every mid-range horizon.

The path to better short-horizon prediction on SET is not deeper book data. It is richer features at L1. Microprice, spread state, imbalance change, and quote-update rate carry information that imbalance alone misses — and they all live at the top of the book.

A trader who can compute these features in real time on L1 data extracts more signal than one paying for L10 and using it naively. This also reinforces why message-level LOB reconstruction matters: most of these features (velocity, message rate, microprice) cannot be computed from a snapshot feed. They require the full message stream.

Honest Caveats

What this analysis does not claim.

Read these before citing the numbers.

Sample is one month. Results across regimes (volatility, news, earnings) may differ.
5-second snapshot interval. Sub-5-second horizons (where US literature finds the highest R²) are not measured. We don't know if the L1-dominance pattern holds at millisecond scale.
R² ≠ tradeable alpha. A model with 2% R² predicting 30-second returns is valuable only if execution latency is much shorter than 30s and trading cost is far below the predicted return. Most trading desks lose the signal to slippage and fees before they can capture it.
Linear OLS only. Non-linear models (random forests, neural networks) might extract additional signal from deeper levels. Our finding is that under linear assumptions, deeper levels hurt. Deep-learning studies on US equities have also found L1 dominance — suggesting the linearity is not the binding constraint — but this should be verified for SET.
One feature per level (raw imbalance). More elaborate features (volume-weighted prices, cancellation rates, queue position) might extract more from deeper levels.
Top-tier quant funds will already have done variants of this analysis on their universes. The contribution here is the SET-specific result, transparently presented, not the methodology.

How Much Signal
Is in L1?

Twenty rows of book. How many of them carry signal?

L1 dominance is the consensus — on US and European data.

750 regressions, every cell measured out-of-sample.

L1 alone wins at every horizon.

Every line slopes down. The longer the horizon, the steeper the damage.

Why deeper levels hurt.

Aggregate numbers hide individual variation.

Significance breadth by horizon

No row is rescued by deeper book.

Three audiences, one conclusion.

For Traders

For Market Structure

For Data Infrastructure

Richer L1 features — no deeper book required.

Add L2–L10 (more depth)

Add L1 dynamics (more behavior)

What this analysis does not claim.

Read these before citing the numbers.

References

Weak Signals, Strong Verdicts

How Much SignalIs in L1?

Twenty rows of book. How many of them carry signal?

L1 dominance is the consensus — on US and European data.

750 regressions, every cell measured out-of-sample.

L1 alone wins at every horizon.

Every line slopes down. The longer the horizon, the steeper the damage.

Why deeper levels hurt.

Aggregate numbers hide individual variation.

Significance breadth by horizon

No row is rescued by deeper book.

Three audiences, one conclusion.

For Traders

For Market Structure

For Data Infrastructure

Richer L1 features — no deeper book required.

Add L2–L10 (more depth)

Add L1 dynamics (more behavior)

What this analysis does not claim.

Read these before citing the numbers.

References

Weak Signals, Strong Verdicts

How Much Signal
Is in L1?