← The Cost of Moving Size
Ch 6 of 13
Weak Signals, Strong Verdicts →
Tick Data Intelligence — 06 of 13

How Much Signal
Is in L1?

Deeper book levels add noise, not signal. On SET, the top of the book is the entire game.

750 out-of-sample regressions across 25 stocks, 5 depth configurations, and 6 horizons from 5 seconds to 15 minutes. L1-only beats every deeper-book configuration at every horizon. At long horizons, deeper models go negative — predicting worse than the average return.

750
OOS Regressions
25
Stocks
22
Trading Days
L1
Wins Every Horizon
ⓘ About the sample
Results come from 22 trading days of SET tick data on the 25 most-traded mainboard common stocks. Stocks were selected by activity over the first 5 days only — not the whole sample — to avoid selection on the test period. The 5-second snapshot interval means horizons shorter than 5s are not measured. A multi-quarter sample is needed to confirm whether the patterns hold across regimes.
📚 Quick definitions
The Question

Twenty rows of book. How many of them carry signal?

A trader looking at a quote terminal sees ten levels of book on each side. Twenty rows of bids and offers. The unspoken assumption is that all twenty rows carry information — that's why exchanges sell deeper book feeds, why infrastructure teams subscribe to L10, and why analytics tools display the full ladder.

But how much of that information is actually predictive? If we built a model of the next mid-price move using only L1, vs L1 + L2, vs the full L1–L10, how much extra predictive power would the deeper levels contribute?

This chapter answers that question on SET data — and the answer is surprising.

What the Research Says

L1 dominance is the consensus — on US and European data.

The relationship between order book depth and short-horizon price prediction has been studied for two decades. Three findings are well-established in the literature.

These findings are consistent across US equities, European equities, and major futures contracts. What hasn't been measured is whether the same hierarchy holds on SET, with its binding tick grid (Chapter 4) and L1-concentrated depth (Chapter 5). The empirical contribution of this chapter is to settle that question for SET data — and the result is sharper than expected.

The Method

750 regressions, every cell measured out-of-sample.

For each (stock × horizon × depth-config) cell:

  • Predict: future mid-price change in basis points at horizon H seconds.
  • Predictors: order flow imbalance per level (bid_qty[k] − ask_qty[k]), included for k = 1…depth.
  • Model: multivariate OLS with HAC (Newey-West) standard errors.
  • Train: first 15 trading days. Test: last 7 trading days; report OOS R².
  • Confidence: 95% bootstrap CI on OOS R² (300 block-bootstrap reps).

5 depth configs (L1, L1–L2, L1–L3, L1–L5, L1–L10) × 6 horizons (5s, 10s, 30s, 60s, 5min, 15min) × 25 stocks = 750 regressions. Each cell evaluated on its own out-of-sample data.

Sessions filtered to continuous trading only (10:00–12:30 and 14:00–16:30 Bangkok). Predictions whose horizon spans the lunch break or end of day are dropped.

The Headline

L1 alone wins at every horizon.

Mean OOS R² across the 25 stocks, in percentage points. The L1-only row is best at every column.

OOS R² by Depth and Horizon
Heatmap of mean out-of-sample R-squared across 25 stocks for 5 depth configurations and 6 horizons
Brighter = better OOS predictability. The L1-only row is darkest-positive at every horizon.
Depth 5s 10s 30s 60s 5min 15min
L1 only0.781.111.762.182.531.49
L1–L20.741.051.651.972.110.99
L1–L30.710.991.521.771.49−0.70
L1–L50.690.951.431.620.80−2.25
L1–L100.570.710.980.87−1.39−8.02

1. L1 alone is the best predictor at every horizon. Adding levels never helps on average.

2. At long horizons (5min, 15min), deeper-book models go negative OOS — meaning the L10 model fits the test set worse than just predicting the average return. That is a strong overfitting signal: deeper levels look meaningful in training, then collapse out of sample.

The Slope

Every line slopes down. The longer the horizon, the steeper the damage.

Each line traces what happens as we increase the depth of book used in the regression. All slopes are negative.

OOS R² vs Depth, by Horizon
Line chart of OOS R-squared declining with book depth, with one line per horizon, all sloping downward
The deeper the book in the regression, the worse it predicts.
Going from L1 to L1–L10 Median R² change
at 5s−0.17 pp
at 10s−0.34 pp
at 30s−0.66 pp
at 60s−1.25 pp
at 5min−2.76 pp
at 15min−6.37 pp

The deeper the book you use, the worse you predict. The longer the horizon, the more pronounced the damage.

The Mechanism

Why deeper levels hurt.

A reasonable interpretation, consistent with this chapter's findings and the prior chapters of the series.

Deeper levels are noisier than informative. L2 through L10 quantities reflect resting orders far from the action — they update infrequently, get cancelled often, and rarely participate in price discovery. When fed into a regression, they add degrees of freedom but not signal, which causes overfitting on the training set and worse performance on the test set.

This is consistent with the SET microstructure findings from earlier chapters:

If almost all the trading happens at L1, almost all the signal lives at L1. Deeper levels become decoration — useful to display, useless to predict from.

Per-Stock View

Aggregate numbers hide individual variation.

L1 imbalance only, 30-second horizon, 95% bootstrap confidence interval per stock.

OOS R² per Stock — L1, 30s Horizon
Sorted bar chart of OOS R-squared for 25 stocks at L1 and 30-second horizon, with 95 percent bootstrap confidence intervals
Bars sorted by R². CIs that cross zero indicate non-significant predictability.

18 of 25 stocks show statistically significant predictability (95% bootstrap CI excludes zero).

Best: SCM (4.8% R²), CPALL (3.3%), KCE (2.9%), VGI (2.8%), HANA (2.6%), GULF (2.6%).

Essentially unpredictable: PTT (−0.2%, CI crosses zero), EA (0.0%), TOP (0.3%, not significant).

Significance breadth by horizon

Percentage of stocks where the 95% CI lower bound is above zero.

HorizonSignificant
5s76% (19/25)
10s80% (20/25)
30s72% (18/25)
60s72% (18/25)
5min52% (13/25)
15min32% (8/25)

Short-horizon prediction is broadly significant. Long-horizon prediction works only on a minority.

The Damage, Per Stock

No row is rescued by deeper book.

Heatmap of 25 stocks × 5 depth configurations at 30-second horizon. The best column for nearly every row is L1 or L1–L2.

OOS R² per Stock × Depth Config — 30s Horizon
Heatmap of OOS R-squared for 25 stocks across 5 depth configurations at 30-second horizon
Greens are at the top of the chart and on the left columns; reds appear as depth grows.

Stocks where L1 alone works (top of chart) usually keep working at L1–L3, then degrade at L5 and L10. Stocks that don't work at L1 (PTT, EA, TOP) don't get rescued by adding depth — they get worse. Across nearly every row, the best column is L1 or L1–L2, never L1–L10.

There is no stock where adding deeper book levels improved predictability meaningfully out-of-sample.

What This Means

Three audiences, one conclusion.

For Traders

  • If your strategy needs short-horizon mid-price prediction, L1 is enough. You can ignore L2–L10 for forecasting purposes.
  • The marginal data feed cost of deeper levels is real; the marginal predictive value is negative.
  • Stock selection matters more than feature engineering: 18 of 25 have meaningful L1 signal; the other 7 are flat regardless of method.

For Market Structure

  • SET behaves more extreme than developed markets. US equity studies show diminishing returns past L3–L5; SET shows negative returns past L1.
  • The tick grid + L1 concentration explain why.
  • Deeper book on SET is a display artifact, not an information channel.

For Data Infrastructure

  • Whole-book LOB reconstruction is what enables this measurement. You can't compare L1 vs L10 predictability without the full book to test against.
  • For forecasting, you only need L1.
  • The full book matters for execution simulation (Chapter 5 walk-the-book) and for measuring market structure (this chapter), but not for predicting the next move.
Beyond Imbalance

Richer L1 features — no deeper book required.

If L1 is where the signal lives, the natural follow-up is: how much more signal can we extract by using L1 behavior, not just L1 state?

We tested an enriched feature set built entirely from L1 — no deeper book required:

OOS R² — Baseline (L1 imbalance) vs Enriched (L1 dynamics)
Line chart comparing baseline L1 imbalance OOS R-squared against enriched L1 dynamics OOS R-squared across six horizons
Enriched L1 features roughly double predictability at sweet-spot horizons.
Horizon Baseline (L1 imbalance) Enriched (L1 dynamics) Lift
5s0.78%1.04%+0.26 pp
10s1.11%2.15%+1.04 pp
30s1.76%3.00%+1.24 pp
60s2.18%3.52%+1.34 pp
5min2.53%3.00%+0.47 pp
15min1.49%−0.23%−1.72 pp (overfits)

At the sweet-spot horizons (10s–60s), enriching L1 features roughly doubles predictability. The lift is broad-based — 84–96% of stocks see improvement at these horizons.

The contrast with Section 5 is the key finding of this chapter:

Hurts

Add L2–L10 (more depth)

Negative effect on OOS R² at every horizon.

Helps

Add L1 dynamics (more behavior)

Positive effect on OOS R² at every mid-range horizon.

The path to better short-horizon prediction on SET is not deeper book data. It is richer features at L1. Microprice, spread state, imbalance change, and quote-update rate carry information that imbalance alone misses — and they all live at the top of the book.

A trader who can compute these features in real time on L1 data extracts more signal than one paying for L10 and using it naively. This also reinforces why message-level LOB reconstruction matters: most of these features (velocity, message rate, microprice) cannot be computed from a snapshot feed. They require the full message stream.

Honest Caveats

What this analysis does not claim.

Read these before citing the numbers.

  • Sample is one month. Results across regimes (volatility, news, earnings) may differ.
  • 5-second snapshot interval. Sub-5-second horizons (where US literature finds the highest R²) are not measured. We don't know if the L1-dominance pattern holds at millisecond scale.
  • R² ≠ tradeable alpha. A model with 2% R² predicting 30-second returns is valuable only if execution latency is much shorter than 30s and trading cost is far below the predicted return. Most trading desks lose the signal to slippage and fees before they can capture it.
  • Linear OLS only. Non-linear models (random forests, neural networks) might extract additional signal from deeper levels. Our finding is that under linear assumptions, deeper levels hurt. Deep-learning studies on US equities have also found L1 dominance — suggesting the linearity is not the binding constraint — but this should be verified for SET.
  • One feature per level (raw imbalance). More elaborate features (volume-weighted prices, cancellation rates, queue position) might extract more from deeper levels.
  • Top-tier quant funds will already have done variants of this analysis on their universes. The contribution here is the SET-specific result, transparently presented, not the methodology.

This study uses licensed market data obtained through commercial agreement. Infozense is not affiliated with the Stock Exchange of Thailand. No market data is distributed through this website. This content is for educational and analytical purposes only and does not constitute investment advice.

References

Next Chapter

Weak Signals, Strong Verdicts

Chapter 7 stacks weak detectors into strong verdicts — how 588 alerts narrow to 214 confirmed manipulation events through a four-stage funnel.

Continue to Chapter 7
← The Cost of Moving Size
Ch 6 of 13
Weak Signals, Strong Verdicts →