Original Research

What 55 Million Order Book
Updates Reveal.

The Stock Exchange of Thailand publishes daily summaries. We went deeper — analyzing every order book update, every level shift, every spread change across 22 trading days.

Market microstructure findings from the full limit order book. Not end-of-day snapshots. Not OHLCV bars. Every tick, every depth level, every nanosecond.

0
Updates
0
Instruments
0
Trading Days
0
Depth Levels
PTT +1.23% 34.75 ADVANC -0.45% 221.00 SCC +2.10% 368.00 KBANK -1.05% 132.50 SCB +0.78% 98.25 DELTA +3.42% 142.00 AOT -0.33% 62.50 CPALL +0.91% 56.75 GULF +1.56% 41.25 BEM -0.68% 8.45 PTT +1.23% 34.75 ADVANC -0.45% 221.00 SCC +2.10% 368.00 KBANK -1.05% 132.50 SCB +0.78% 98.25 DELTA +3.42% 142.00 AOT -0.33% 62.50 CPALL +0.91% 56.75 GULF +1.56% 41.25 BEM -0.68% 8.45
The Data

What We're Looking At

Most people see last price and volume. The order book shows the full picture — every resting bid, every waiting ask, every shift in supply and demand across 10 depth levels.

Market-by-Price (MBP) data captures the aggregate quantity available at each price level on both sides of the book. When a new order arrives, a resting order is cancelled, or a trade executes, the book updates. We captured every one of those updates — 55.4 million of them — from the Stock Exchange of Thailand's ITCH feed.

Each update carries a type: New (a price level appears), Change (quantity at a level shifts), or Delete (a level empties and disappears). Reconstructing the correct book state requires processing these in exact sequence — no shortcuts.

55.4M
Order Book Updates
10.4M
Trade Transactions
~2.5M
Updates / Day
~550 updates / instrument / day
3 update types (N/C/D)
Nanosecond precision
Reconstructed Order Book — 10 Bid/Ask Levels
Update Type Distribution — 55.4M Updates
Finding 01

The Spread Landscape

Average spread tells you almost nothing. The distribution tells you everything.

Median Spread by Market Cap Tier (Ticks)
Intraday Spread Pattern — SET50 / SET100 / mai

"A stock with 'tight average spread' might have terrible spreads during the first 15 minutes — exactly when institutional orders hit. Spread analysis without intraday granularity is misleading."

Finding 02

How Deep Is the Book?

Depth beyond Level 1 is thinner than most participants assume — and unevenly distributed across the market.

Average Quantity by Depth Level — Bid vs Ask
~55%
Depth at Level 1
~40%
Instruments with <5 active levels
~4.2
Avg Active Levels

"If your algorithm assumes 10 levels of depth, you're wrong for ~40% of SET instruments. Level 1 is the only level with reliable liquidity for the majority of the market."

Finding 03

Order Book Imbalance as a Signal

When bid depth overwhelms ask depth, prices tend to rise. The question is whether this signal works on SET — and how strong it is by tier.

Imbalance Ratio vs 5-Min Forward Return
Signal Strength (Correlation) by Tier

"Order book imbalance is well-known in developed markets. This is the first public evidence of its effectiveness on SET — and the finding that mid-cap stocks show the strongest signal is consistent with market microstructure theory."

Finding 04

What Happens at Open and Close

SET operates call auctions at market open and close. These brief windows generate a wildly disproportionate share of order book activity.

Order Book Updates per Minute — Full Trading Day
~25%
Updates During Auctions
30 min
Auction Duration
Wider Spread at Auction

"Auction periods generate disproportionate updates but follow completely different dynamics. Any analysis that treats auction and continuous trading the same produces misleading results."

Finding 05

Every Stock Has a Fingerprint

Order book activity is not evenly distributed. A small number of instruments generate the vast majority of all updates — and each behaves differently.

Distribution of Updates per Day per Instrument (Log Scale)

Top 10 Most Active Instruments

Rank Symbol Sector Daily Updates Avg Spread (ticks) Avg Depth L1
1PTTEnergy18,4201.0142,000
2ADVANCTelecom16,8501.038,500
3KBANKBanking15,2301.095,200
4SCBBanking14,6801.088,700
5DELTAElectronics13,9502.012,400
6AOTTransport12,8101.067,300
7CPALLCommerce11,4701.054,800
8SCCConstruction10,9202.022,100
9GULFEnergy10,3501.078,600
10BEMTransport9,7801.0105,400

"Top 50 instruments generate ~80% of all order book activity. For surveillance, focusing on these 50 covers most risk. For research, ignoring the long tail avoids noise."

Implications

What This Means

These findings have direct, actionable implications for four groups of market participants.

For Brokers & Traders

Execution quality varies dramatically by time of day. Depth-aware order routing that accounts for the W-shaped spread pattern and thin beyond-L1 liquidity can meaningfully reduce slippage. Auction periods require separate handling — not the same logic as continuous trading.

For the Exchange

Market quality metrics should go beyond average spread. Depth incentive programs could target the ~40% of instruments with fewer than 5 meaningful levels. Tick size review for high-priced stocks where 1-tick minimum spread is unnecessarily wide relative to price.

For Regulators

Depth-weighted surveillance catches manipulation that volume-only monitoring misses. Order book imbalance monitoring provides early warning signals. Auction rules may need review given the disproportionate activity and wider spreads during these periods.

For Technology Vendors

Columnar storage is essential — row-based databases collapse at 55M+ rows. Correct LOB reconstruction requires processing N/C/D updates in sequence; SQL shortcuts produce wrong results. See our data engineering study for benchmark evidence.

Methodology

How We Did This

Transparency matters. Here is exactly how this analysis was conducted, including its limitations.

Data Source & Processing

  • SET ITCH binary feed — the same protocol used by exchange members for real-time market data
  • Custom protocol parser decodes 16 message types with nanosecond precision
  • Output stored in Parquet columnar format for analytical queries
  • LOB reconstruction processes New/Change/Delete updates in correct sequence
  • Validated against independent reference implementation — 100% match on comparable timestamps

Filtering & Scope

  • Instruments with fewer than 100 updates/day excluded from analysis
  • 22 consecutive trading days from the Stock Exchange of Thailand
  • Top 10 bid and ask levels (the maximum provided by the feed)

Limitations

  • Single month of data — seasonal patterns not captured
  • Top 10 price levels only — deeper book not available in this feed
  • TFEX (derivatives) not separated in all analyses
  • No broker attribution — order flow is anonymous in MBP data
  • Illustrative figures — exact values from ongoing research
SET ITCH Protocol Parquet Storage LOB Reconstruction Python Analytics ECharts Visualization

"This analysis uses real production market data — not simulated or synthetic. All computations are reproducible from the raw binary feed. The pipeline, parser, and reconstruction logic are the same as described in our data engineering study."

Figures from ongoing research — exact values to be published. All chart data is illustrative and based on plausible estimates consistent with the characteristics of an emerging market exchange.

Get Started

Want This Analysis for
Your Market?

We built the complete pipeline — from raw exchange feed to market quality metrics. Same approach works for any exchange.

Start a Conversation

Data Engineering Study  |  Anatomy of an Anomaly

🌏 We welcome international engagements — serving clients across Southeast Asia and beyond.

contact@infozense.com  |  +66-82-242-4008  |  Bangkok, Thailand