3rd Edition

The ML4T Primer

Foundations the book builds on and deeper technical treatments it references — standalone companion topics organized by chapter.

112
Topics
20
Foundational
66
Intermediate
26
Advanced
Cross-Chapter

Concepts referenced throughout the book

Foundational building blocks

ACF and PACF Interpretation foundational
How total and direct lag dependence leave different signatures in time-series plots, and why those signatures are useful but never definitive.
Bayes' Theorem and Posterior Distributions foundational
Bayesian inference updates a probability distribution over unknown quantities rather than replacing uncertainty with a single estimate.
Causality, Confounding, and Why Good Signals Can Be Misleading foundational
A predictive relationship can be real in the data and still fail as an explanation of what would happen under intervention.
Covariance Matrices, Estimation, and Why They Break foundational
How a covariance matrix summarizes co-movement, why the sample version becomes unstable in high dimensions, and why shrinkage is the default repair.
Eigenvalues, Eigenvectors, and the Geometry of Covariance foundational
How symmetric matrices reveal natural directions of variation, and why that matters for PCA and statistical risk factors.
Hypothesis Testing and P-Values foundational
How hypothesis tests turn noisy evidence into a structured decision, and how to read p-values without treating them as proof.
Markov Chains and the Markov Property foundational
How a state representation can compress history into current conditions, and why that matters for regime models and decision processes.
Momentum and Mean Reversion foundational
How return predictability changes with horizon, how cross-sectional and time-series momentum differ, and why the same signal that works for months can fail violently in a rebound.
Multiple Testing and the Researcher’s Trap foundational
Why searching many ideas makes false discoveries and overstated winners inevitable.
Point-in-Time Data and Decision-Time Correctness foundational
A value is usable only if it was actually knowable when the strategy had to decide.
Sharpe Ratio: Definition, Annualization, and Estimation Noise foundational
What the Sharpe ratio measures, when the usual scaling works, and why a backtest Sharpe is noisier than it looks.
Simple Returns vs Log Returns foundational
One aggregates exactly across assets, the other aggregates exactly across time. Most mistakes come from asking one definition to do both jobs.
Stationarity and Unit Roots foundational
Why time-series stability matters, why random walks mislead regressions, and what differencing fixes and destroys.
Stylized Facts of Financial Time Series for Simulation and Validation foundational
A synthetic market path is only useful if it reproduces the empirical pathologies that make financial returns hard to model in the first place.
The Bias-Variance Tradeoff foundational
Why a model that is deliberately a little wrong can generalize better than one that fits the past too closely.
The Information Coefficient foundational
How cross-sectional rank correlation measures signal quality, and why a small edge can still matter when it is applied repeatedly.
Trading Costs: Spread, Slippage, and Market Impact foundational
How execution costs arise, why the components are different, and how turnover turns a predictive signal into a net strategy.
Training Neural Networks foundational
How forward passes, losses, backpropagation, optimization, and regularization work together so the architecture
Volatility: Realized, Implied, and Why It Clusters foundational
Volatility is not one number but a family of related objects: what happened, what the options market prices, and what a model forecasts next.
Walk-Forward Validation for Time Series foundational
Why model evaluation must preserve temporal order, and how expanding or rolling splits approximate live deployment.