User Guide¶
Use the User Guide when you are moving from a first fetch to a repeatable workflow with configuration, storage, validation, and updates.
Core Features¶
-
CLI Reference
Command-line interface for fetching and managing data.
-
Configuration
YAML configuration for automated data pipelines.
-
Incremental Updates
Efficient daily updates with gap detection.
-
Data Quality
OHLC validation and anomaly detection.
-
Storage
Hive-partitioned Parquet storage system.
Performance¶
ML4T Data is optimized for speed:
| Operation | Performance |
|---|---|
| Sequential fetch (100 symbols) | 50-100s |
| Async batch fetch (100 symbols) | 5-15s |
| Load from storage (100 symbols) | 0.165s |
Next Steps¶
- API Reference - Auto-generated from source
- Tutorials - Step-by-step guides
- Book Guide - Chapter and notebook mapping for ML4T
- Contributing - Add your own provider