Original Research

What 55 Million Order Book
Updates Reveal.

The Stock Exchange of Thailand publishes daily summaries. We went deeper — analyzing every order book update, every level shift, every spread change across 22 trading days.

Market microstructure findings from the full limit order book. Not end-of-day snapshots. Not OHLCV bars. Every tick, every depth level, every nanosecond.

See the Findings Read the Data Engineering Study

Updates

Instruments

Trading Days

Depth Levels

PTT +1.23% 34.75 ADVANC -0.45% 221.00 SCC +2.10% 368.00 KBANK -1.05% 132.50 SCB +0.78% 98.25 DELTA +3.42% 142.00 AOT -0.33% 62.50 CPALL +0.91% 56.75 GULF +1.56% 41.25 BEM -0.68% 8.45 PTT +1.23% 34.75 ADVANC -0.45% 221.00 SCC +2.10% 368.00 KBANK -1.05% 132.50 SCB +0.78% 98.25 DELTA +3.42% 142.00 AOT -0.33% 62.50 CPALL +0.91% 56.75 GULF +1.56% 41.25 BEM -0.68% 8.45

The Data

What We're Looking At

Most people see last price and volume. The order book shows the full picture — every resting bid, every waiting ask, every shift in supply and demand across 10 depth levels.

Market-by-Price (MBP) data captures the aggregate quantity available at each price level on both sides of the book. When a new order arrives, a resting order is cancelled, or a trade executes, the book updates. We captured every one of those updates — 55.4 million of them — from the Stock Exchange of Thailand's ITCH feed.

Each update carries a type: New (a price level appears), Change (quantity at a level shifts), or Delete (a level empties and disappears). Reconstructing the correct book state requires processing these in exact sequence — no shortcuts.

55.4M

Order Book Updates

10.4M

Trade Transactions

~2.5M

Updates / Day

~550 updates / instrument / day

3 update types (N/C/D)

Nanosecond precision

Reconstructed Order Book — 10 Bid/Ask Levels

Update Type Distribution — 55.4M Updates

Finding 01

The Spread Landscape

Average spread tells you almost nothing. The distribution tells you everything.

Median Spread by Market Cap Tier (Ticks)

Intraday Spread Pattern — SET50 / SET100 / mai

"A stock with 'tight average spread' might have terrible spreads during the first 15 minutes — exactly when institutional orders hit. Spread analysis without intraday granularity is misleading."

Finding 02

How Deep Is the Book?

Depth beyond Level 1 is thinner than most participants assume — and unevenly distributed across the market.

Average Quantity by Depth Level — Bid vs Ask

~55%

Depth at Level 1

~40%

Instruments with <5 active levels

~4.2

Avg Active Levels

"If your algorithm assumes 10 levels of depth, you're wrong for ~40% of SET instruments. Level 1 is the only level with reliable liquidity for the majority of the market."

Finding 03

Order Book Imbalance as a Signal

When bid depth overwhelms ask depth, prices tend to rise. The question is whether this signal works on SET — and how strong it is by tier.

Imbalance Ratio vs 5-Min Forward Return

Signal Strength (Correlation) by Tier

"Order book imbalance is well-known in developed markets. This is the first public evidence of its effectiveness on SET — and the finding that mid-cap stocks show the strongest signal is consistent with market microstructure theory."

Finding 04

What Happens at Open and Close

SET operates call auctions at market open and close. These brief windows generate a wildly disproportionate share of order book activity.

Order Book Updates per Minute — Full Trading Day

~25%

Updates During Auctions

30 min

Auction Duration

3×

Wider Spread at Auction

"Auction periods generate disproportionate updates but follow completely different dynamics. Any analysis that treats auction and continuous trading the same produces misleading results."

Finding 05

Every Stock Has a Fingerprint

Order book activity is not evenly distributed. A small number of instruments generate the vast majority of all updates — and each behaves differently.

Distribution of Updates per Day per Instrument (Log Scale)

Top 10 Most Active Instruments

Rank	Symbol	Sector	Daily Updates	Avg Spread (ticks)	Avg Depth L1
1	PTT	Energy	18,420	1.0	142,000
2	ADVANC	Telecom	16,850	1.0	38,500
3	KBANK	Banking	15,230	1.0	95,200
4	SCB	Banking	14,680	1.0	88,700
5	DELTA	Electronics	13,950	2.0	12,400
6	AOT	Transport	12,810	1.0	67,300
7	CPALL	Commerce	11,470	1.0	54,800
8	SCC	Construction	10,920	2.0	22,100
9	GULF	Energy	10,350	1.0	78,600
10	BEM	Transport	9,780	1.0	105,400

"Top 50 instruments generate ~80% of all order book activity. For surveillance, focusing on these 50 covers most risk. For research, ignoring the long tail avoids noise."

Implications

What This Means

These findings have direct, actionable implications for four groups of market participants.

▶

For Brokers & Traders

Execution quality varies dramatically by time of day. Depth-aware order routing that accounts for the W-shaped spread pattern and thin beyond-L1 liquidity can meaningfully reduce slippage. Auction periods require separate handling — not the same logic as continuous trading.

▲

For the Exchange

Market quality metrics should go beyond average spread. Depth incentive programs could target the ~40% of instruments with fewer than 5 meaningful levels. Tick size review for high-priced stocks where 1-tick minimum spread is unnecessarily wide relative to price.

▼

For Regulators

Depth-weighted surveillance catches manipulation that volume-only monitoring misses. Order book imbalance monitoring provides early warning signals. Auction rules may need review given the disproportionate activity and wider spreads during these periods.

▬

For Technology Vendors

Columnar storage is essential — row-based databases collapse at 55M+ rows. Correct LOB reconstruction requires processing N/C/D updates in sequence; SQL shortcuts produce wrong results. See our data engineering study for benchmark evidence.

Methodology

How We Did This

Transparency matters. Here is exactly how this analysis was conducted, including its limitations.

Data Source & Processing

SET ITCH binary feed — the same protocol used by exchange members for real-time market data
Custom protocol parser decodes 16 message types with nanosecond precision
Output stored in Parquet columnar format for analytical queries
LOB reconstruction processes New/Change/Delete updates in correct sequence
Validated against independent reference implementation — 100% match on comparable timestamps

Filtering & Scope

Instruments with fewer than 100 updates/day excluded from analysis
22 consecutive trading days from the Stock Exchange of Thailand
Top 10 bid and ask levels (the maximum provided by the feed)

Limitations

Single month of data — seasonal patterns not captured
Top 10 price levels only — deeper book not available in this feed
TFEX (derivatives) not separated in all analyses
No broker attribution — order flow is anonymous in MBP data
Illustrative figures — exact values from ongoing research

SET ITCH Protocol Parquet Storage LOB Reconstruction Python Analytics ECharts Visualization

"This analysis uses real production market data — not simulated or synthetic. All computations are reproducible from the raw binary feed. The pipeline, parser, and reconstruction logic are the same as described in our data engineering study."

Figures from ongoing research — exact values to be published. All chart data is illustrative and based on plausible estimates consistent with the characteristics of an emerging market exchange.

Get Started

Want This Analysis for
Your Market?

We built the complete pipeline — from raw exchange feed to market quality metrics. Same approach works for any exchange.

Start a Conversation

Data Engineering Study | Anatomy of an Anomaly

🌏 We welcome international engagements — serving clients across Southeast Asia and beyond.

contact@infozense.com | +66-82-242-4008 | Bangkok, Thailand

What 55 Million Order BookUpdates Reveal.