Chapter 21: Reinforcement Learning

Distributional RL and Tail-Aware Action Selection advanced

Learn the return distribution, not just its mean, so execution and hedging policies can be chosen by tail risk as well as average performance.

Learn the return distribution, not just its mean, so execution and hedging policies can be chosen by tail risk as well as average performance.

Register to Read

Sign up for a free account to access all 112 primer topics.

Create Free Account

Already have an account? Sign in

References

Deep Reinforcement Learning for Trading—A Critical Survey
Adrian Millea (2021) — Data
TIME Benchmark: Fresh Datasets for Zero-Shot Forecasting Integrity
Han Zou, et al. (2025)
Recent Advances in Reinforcement Learning in Finance
Ben Hambly, Renyuan Xu, Huining Yang (2023)
Deep hedging
H. Buehler, L. Gonon, J. Teichmann, B. Wood (2019) — Quantitative Finance