Advances in Financial Machine Learning

Absolute return attribution method
Accounting data
Accuracy
- binary classification problems and
- measurement of
AdaBoost implementation
Adaptable I/O System (ADIOS)
Alternative data
Amihud's lambda
Analytics
Annualized Sharpe ratio
Annualized turnover, in backtesting
Asset allocation
- classical areas of mathematics used in
- covariance matrix in
- diversification in
- Markowitz's approach to
- Monte Carlo simulations for
- numerical example of
- practical problems in
- quasi-diagonalization in
- recursive bisection in
- risk-based. See also Risk-based asset allocation approaches
- tree clustering approaches to
Attribution
Augmented Dickey-Fuller (ADF) test. See also Supremum augmented Dickey-Fuller (SADF) test
Average holding period, in backtesting
Average slippage per turnover

Backfilled data
Backtesters
Backtesting
- bet sizing in
- common errors in
- combinatorial purged cross-validation (CPCV) method in
- cross-validation (CV) for
- customization of
- definition of
- “false discovery” probability and
- flawless completion as daunting task in
- general recommendations on
- machine learning asset allocation and
- purpose of
- as research tool
- strategy risk and
- strategy selection in
- synthetic data in
- uses of results of
- walk-forward (WF) method of
Backtest overfitting
- backtesters’ evaluation of probability of
- bagging to reduce
- combinatorial purged cross-validation (CPCV) method for
- concerns about risk of
- cross-validation (CV) method and
- decision trees and proneness to
- definition of
- discretionary portfolio managers (PMs) and
- estimating extent of
- features stacking to reduce
- general recommendations on
- historical simulations in trading rules and
- hyper-parameter tuning and
- need for skepticism
- optimal trading rule (OTR) framework for
- probability of. See Probability of backtest overfitting (PBO)
- random forest (RF) method to reduce
- selection bias and
- support vector machines (SVMs) and
- trading rules and
- walk-forward (WF) method and
Backtest statistics
- classification measurements in
- drawdown (DD) and time under water (TuW) in
- efficiency measurements in
- general description of
- holding period estimator in
- implementation shortfall in
- performance attribution and
- performance measurements in
- returns concentration in
- runs in
- run measurements in
- time-weighted rate of returns (TWRR) in
- timing of bets from series of target positions in
- types of
Bagging
- accuracy improvement using
- boosting compared with
- leakage reduction using
- observation redundancy and
- overfitting reduction and
- random forest (RF) method compared with
- scalability using
- variance reduction using
Bars (table rows)
- categories of
- dollar bars
- dollar imbalance bars
- dollar runs bars
- information-driven bars
- standard bars
- tick bars
- tick imbalance bars
- tick runs bars
- time bars
- volume bars
- volume imbalance bars
- volume runs bars
Becker-Parkinson volatility algorithm
Bet sizing
- average active bets approach in
- bet concurrency calculation in
- budgeting approach to
- dynamic bet sizes and limit prices in
- holding periods and
- investment strategies and
- meta-labeling approach to
- performance attribution and
- predicted probabilities and
- runs and increase in
- size discretization in
- strategy-independent approaches to
- strategy's capacity and
Bet timing, deriving
Betting frequency
- backtesting and
- computing
- implied precision computation and
- investment strategy with trade-off between precision and
- strategy risk and
- targeting Sharpe ratio for
- trade size and
Bias
Bid-ask spread estimator
Bid wanted in competition (BWIC)
big data analysis
Bloomberg
Boosting
- AdaBoost implementation of
- bagging compared with
- implementation of
- main advantage of
- variance and bias reduction using
Bootstrap aggregation. See Bagging
Bootstraps, sequential
Box-Jenkins analysis
Broker fees per turnover
Brown-Durbin-Evans CUSUM test

Cancellation rates
Capacity, in backtesting
Chow-type Dickey-Fuller test
Chu-Stinchcombe-White CUSUM test
Classification models
Classification problems
- class weights for underrepresented labels in
- generating synthetic dataset for
- meta-labeling and
Classification statistics
Class weights
- decision trees using
- functionality for handling
- underrepresented label correction using
Cloud systems
Combinatorially symmetric cross-validation (CSCV) method
Combinatorial purged cross-validation (CPCV) method
- algorithm steps in
- backtest overfitting and
- combinatorial splits in
- definition of
- examples of
Compressed markets
Computational Intelligence and Forecasting Technologies (CIFT) project
- Adaptable I/O System (ADIOS) and
- business applications developed by
- Flash Crash of 2010 response and
- mission of
Conditional augmented Dickey-Fuller (CADF) test
Correlation to underlying, in backtesting
Corwin-Schultz algorithm
Critical Line Algorithm (CLA)
- description of
- Markowitz's development of
- Monte Carlo simulations using
- numerical example with
- open-source implementation of
- practical problems with
Cross-entropy loss (log loss) scoring
Cross-validation (CV)
- backtesting through
- combinatorial purged cross-validation (CPCV) method in
- embargo on training observations in
- failures in finance using
- goal of
- hyper-parameter tuning with
- k-fold
- leakage in
- model development and
- overlapping training observations in
- purpose of
- purging process in training set for leakage reduction in
- sklearn bugs in
CUSUM filter
CUSUM tests
CV. See Cross-validation

Data analysis
- financial data structures and
- fractionally differentiated features and
- labeling and
- sample weights and
Data curators
Data mining and data snooping
Decision trees
Decompressed markets
Deflated Sharpe ratio (DSR)
Deployment team
Dickey-Fuller test
- Chow type
- supremum augmented (SADF)
Discretionary portfolio managers (PMs)
Discretization of bet size
Diversification
Dollar bars
Dollar imbalance bars (DIBs)
Dollar performance per turnover
Dollar runs bars (DRBs)
Downsampling
Drawdown (DD)
- definition of
- deriving
- example of
- run measurements using
Dynamic bet sizes

Econometrics
- financial Big Data analysis and
- financial machine learning versus
- HRP approach compared with
- investment strategies based in
- paradigms used in
- substitution effects and
- trading rules and
Efficiency measurements
- annualized Sharpe ratio and
- deflated Sharpe ratio (DSR) and
- information ratio and
- probabilistic Sharpe ratio (PSR) and
- Sharpe ratio (SR) definition in
Efficient frontier
Electricity consumption analysis
Engle-Granger analysis
Ensemble methods
- boosting and
- bootstrap aggregation (bagging) and
- random forest (RF) method and
Entropy features
- encoding schemes in estimates of
- financial applications of
- generalized mean and
- Lempel-Ziv (LZ) estimator in
- maximum likelihood estimator in
- Shannon's approach to
ETF trick
Event-based sampling
Excess returns, in information ratio
Execution costs
Expanding window method
Explosiveness tests
- Chow-type Dickey-Fuller test
- supremum augmented Dickey-Fuller (SADF) test

Factory plan
Feature analysts
Feature importance
- features stacking approach to
- importance of
- mean decrease accuracy (MDA) and
- mean decrease impurity (MDI) and
- orthogonal features and
- parallelized approach to
- plotting function for
- random forest (RF) method and
- as research tool
- single feature importance (SFI) and
- synthetic data experiments with
- weighted Kendall's tau computation in
- without substitution effects
- with substitution effects
Features stacking importance
Filter trading strategy
Finance
- algorithmization of
- human investors’ abilities in
- purpose and role of
- usefulness of ML algorithms in
Financial data
- alternative
- analytics and
- essential types of
- fundamental
- market
Financial data structures
- bars (table rows) in
- multi-product series in
- sampling features in
- unstructured, raw data as starting point for
Financial Information eXchange (FIX) messages
Financial machine learning
- econometrics versus
- prejudices about use of
- standard machine learning separate from
Financial machine learning projects
- reasons for failure of
- structure by strategy component in
Fixed-time horizon labeling method
Fixed-width window fracdiff (FFD) method
Flash crashes
- class weights for predicting
- “early warning” indicators in
- high performance computing (HPC) tools and
- signs of emerging illiquidity events and
Flash Crash of 2010
F1 scores
- measurement of
- meta-labeling and
Fractional differentiation
- data transformation method for stationarity in
- earlier methods of
- expanding window method for
- fixed-width window fracdiff (FFD) method for
- maximum memory preservation in
Frequency. See Betting frequency
Fundamental data
Fusion collaboration project
Futures
- dollar bars and
- ETF trick with
- non-negative rolled price series and
- single futures roll method with
- volume bars and

Gaps series, in single future roll method
Global Investment Performance Standards (GIPS)
Graph theory
Grid search cross-validation

Hasbrouck's lambda
Hedging weights
Herfindahl-Hirschman Index (HHI) concentration
HHI indexes
- on negative returns
- on positive returns
- on time between bets
Hierarchical Data Format 5 (HDF5)
Hierarchical Risk Parity (HRP) approach
- econometric regression compared with
- full implementation of
- Monte Carlo simulations using
- numerical example of
- practical application of
- quasi-diagonalization in
- recursive bisection in
- standard approaches compared with
- traditional risk parity approach compared with
- tree clustering approaches to
High-frequency trading
High-low volatility estimator
High-performance computing (HPC)
- ADIOS and
- atoms and molecules in parallelization and
- CIFT business applications and
- cloud systems compared with
- combinatorial optimization and
- electricity consumption analysis using
- Flash Crash of 2010 response and
- fusion collaboration project using
- global dynamic optimal trajectory and
- hardware for
- integer optimization approach and
- multiprocessing and
- objective function and
- pattern-finding capability in
- software for
- streaming data analysis using
- supernova hunting using
- use cases for
- vectorization and
Holding periods
- backtesting and
- bet sizing and
- estimating in strategy
- optimal trading rule (OTR) algorithm with
- triple-period labeling method and
Hyper-parameter tuning
- grid search cross-validation and
- log loss scoring used with
- randomized search cross-validation and

Implementation shortfall statistics
Implied betting frequency
Implied precision computation
Indicator matrix
Information-driven bars (table rows)
- dollar imbalance bars
- dollar runs bars
- purpose of
- tick imbalance bars
- tick runs bars
- volume imbalance bars
- volume runs bars
Information ratio
Inverse-Variance Portfolio (IVP)
- asset allocation numerical example of
- Monte Carlo simulations using
Investment strategies
- algorithmization of
- bet sizing in
- evolution of
- exit conditions in
- human investors’ abilities and
- log loss scoring used with hyper-parameter tuning in
- profit-taking and stop-loss limits in
- risk in. See Strategy risk
- structural breaks and
- trade-off between precision and frequency in
- trading rules and algorithms in
Investment strategy failure probability
- algorithm in
- implementation of algorithm in
- probabilistic Sharpe ratio (PSR) similarity to
- strategy risk and

K-fold cross-validation (CV)
- description of
- embargo on training observations in
- leakage in
- mean decrease accuracy (MDA) feature with
- overlapping training observations in
- purging process in training set for leakage reduction in
- when used
Kyle's lambda

Labeling
- daily volatility at intraday estimation for
- dropping unnecessary or under-populated labels in
- fixed-time horizon labeling method for
- learning side and size in
- meta-labeling and
- quantamental approach using
- triple-barrier labeling method for
Labels
- average uniqueness over lifespan of
- class weights for underrepresented labels
- estimating uniqueness of
Lawrence Berkeley National Laboratory (LBNL, Berkeley Lab)
Leakage, and cross-validation (CV)
Leakage reduction
- bagging for
- purging process in training set for
- sequential bootstraps for
- walk-forward timefolds method for
Lempel-Ziv (LZ) estimator
Leverage, in backtesting
Limit prices, in bet sizing
Log loss scoring, in hyper-parameter tuning
Log-uniform distribution
Look-ahead bias

Machine learning (ML)
- finance and
- financial machine learning separate from
- HRP approach using
- human investors and
- prejudices about use of
Machine learning asset allocation. See also Hierarchical Risk Parity (HRP) approach
- Monte Carlo simulations for
- numerical example of
- quasi-diagonalization in
- recursive bisection in
- tree clustering approaches to
Market data
Markowitz, Harry
Maximum dollar position size, in backtesting
Maximum likelihood estimator, in entropy
Mean decrease accuracy (MDA) feature importance
- computed on synthetic dataset
- considerations in working with
- implementation of
- single feature importance (SFI) and
Mean decrease impurity (MDI) feature importance
- computed on synthetic dataset
- considerations in working with
- implementation of
- single feature importance (SFI) and
Message Passing Interface (MPI)
Meta-labeling
- bet sizing using
- code enhancements for
- description of
- dropping unnecessary or under-populated labels in
- how to use
- quantamental approach using
Meta-strategy paradigm
Microstructural features
- alternative features of
- Amihud's lambda and
- bid-ask spread estimator (Corwin-Schultz algorithm) and
- Hasbrouck's lambda and
- high-low volatility estimator and
- Kyle's lambda and
- microstructural information definition and
- probability of informed trading and
- Roll model and
- sequential trade models and
- strategic trade models and
- tick rule and
- volume-synchronized probability of informed trading (VPIN) and
Mixture of Gaussians (EF3M)
Model development
- cross-validation (CV) for
- overfitting reduction and
- single feature importance method and
Modelling
- applications of entropy to
- backtesting in
- cross-validation in
- econometrics and
- ensemble methods in
- entropy features in
- feature importance in
- hyper-parameter tuning with cross-validation in
- market microstructure theories and
- three sources of errors in
Monte Carlo simulations
- machine learning asset allocation and
- sequential bootstraps evaluation using
Multi-product series
- ETF trick for
- PCA weights for
- single future roll in

National laboratories
Negative (neg) log loss scores
- hyper-parameter tuning using
- measurement of
Noise, causes of
Non-negative rolled price series

Optimal trading rule (OTR) framework
- algorithm steps in
- cases with negative long-run equilibrium in
- cases with positive long-run equilibrium in
- cases with zero long-run equilibrium in
- experimental results using simulation in
- implementation of
- overfitting and
- profit-taking and stop-loss limits in
- synthetic data for determination of
Options markets
Ornstein-Uhlenbeck (O-U) process
Orthogonal features
- benefits of
- computation of
- implementation of
Outliers, in quantitative investing
Overfitting. See Backtest overfitting

Parallelized feature importance
PCA (see Principal components analysis)
Performance attribution
Plotting function for feature importance
PnL (mark-to-market profits and losses)
- ETF trick and
- performance attribution using
- as performance measurement
- rolled prices for simulating
PnL from long positions
Portfolio construction. See also Hierarchical Risk Parity (HRP) approach
- classical areas of mathematics used in
- covariance matrix in
- diversification in
- entropy and concentration in
- Markowitz's approach to
- Monte Carlo simulations for
- numerical example of
- practical problems in
- tree clustering approaches to
Portfolio oversight
Portfolio risk. See also Hierarchical Risk Parity (HRP) approach; Risk; Strategy risk
- portfolio decisions based on
- probability of strategy failure and
- strategy risk differentiated from
Portfolio turnover costs
Precision
- binary classification problems and
- investment strategy with trade-off between frequency and
- measurement of
Predicted probabilities, in bet sizing
Principal components analysis (PCA)
- hedging weights using
- linear substitution effects and
Probabilistic Sharpe ratio (PSR)
- calculation of
- as efficiency statistic
- probability of strategy failure, similarity to
Probability of backtest overfitting (PBO)
- backtest overfitting evaluation using
- combinatorially symmetric cross-validation (CSCV) method for
- strategy selection based on estimation of
Probability of informed trading (PIN)
Probability of strategy failure
- algorithm in
- implementation of algorithm in
- probabilistic Sharpe ratio (PSR), similarity to
- strategy risk and
Profit-taking, and investment strategy exit
Profit-taking limits
- asymmetric payoff dilemma and
- cases with negative long-run equilibrium and
- cases with positive long-run equilibrium and
- cases with zero long-run equilibrium and
- daily volatility at intraday estimation points computation and
- investment strategies using
- learning side and size and
- optimal trading rule (OTR) algorithm for
- strategy risk and
- triple-barrier labeling method for
Purged K-fold cross-validation (CV)
- grid search cross-validation and
- hyper-parameter tuning with
- implementation of
- randomized search cross-validation and
Python

Quantamental approach
Quantamental funds
Quantitative investing
- backtest overfitting in
- failure rate in
- meta-strategy paradigm in
- quantamental approach in
- seven sins of
Quantum computing

Random forest (RF) method
- alternative ways of setting up
- bagging compared with
- bet sizing using
Randomized search cross-validation
Recall
- binary classification problems and
- measurement of
Reinstated value
Return attribution method
Return on execution costs
Returns concentration
RF. See Random forest (RF) method
Right-tail unit-root tests
Risk. See also Hierarchical Risk Parity (HRP) approach; Strategy risk
- backtest statistics for uncovering
- entropy application to portfolio concentration and
- liquidity and
- ML algorithms for monitoring
- PCA weights and
- portfolio oversight and
- profit taking and stop-loss limits and
- structural breaks and
- walk-forward (WF) approach and
Risk-based asset allocation approaches
- HRP approach comparisons in
- structural breaks and
Risk parity. See also Hierarchical Risk Parity (HRP) approach
- HRP approach compared with traditional approach to
Rolled prices
Roll model

Sample weights
- average uniqueness of labels over lifespan and
- bagging classifiers and uniqueness and
- indicator matrix for
- mean decrease accuracy (MDA) feature importance with
- number of concurrent labels and
- overlapping outcomes and
- return attribution method and
- sequential bootstrap and
- time-decay factors and
Sampling features
- downsampling and
- event-based sampling and
Scalability
- bagging for
- sample size in ML algorithms and
Scikit-learn (sklearn)
- class weights in
- cross-validation (CV) bugs in
- grid search cross-validation in
- labels and bug in
- mean decrease impurity (MDI) and
- neg log loss as scoring statistic and bug in
- observation redundancy and bagging classifiers in
- random forest (RF) overfitting and
- support vector machine (SVM) implementation in
- synthetic dataset generation in
- walk-forward timefolds method in
Selection bias
Sequential bootstraps
- description of
- implementation of
- leakage reduction using
- Monte Carlo experiments evaluating
- numerical example of
Shannon, Claude
Sharpe ratio (SR) in efficiency measurements
- annualized
- definition of
- deflated (DSR)
- information ratio and
- probabilistic (PSR)
- purpose of
- targeting, for various betting frequencies
Shorting, in quantitative investing
Signal order flows
Simulations, overfitting of
Single feature importance (SFI)
Single future roll
Sklearn. See Scikit-learn
Stacked feature importance
Standard bars (table rows)
- dollar bars
- purpose of
- tick bars
- time bars
- volume bars
Stationarity
- data transformation method to ensure
- fractional differentiation applied to
- fractional differentiation implementation methods for
- integer transformation for
- maximum memory preservation for
- memory loss dilemma and
Stop-loss, and investment strategy exit
Stop-loss limits
- asymmetric payoff dilemma and
- cases with negative long-run equilibrium and
- cases with positive long-run equilibrium and
- cases with zero long-run equilibrium and
- daily volatility computation and
- fixed-time horizon labeling method and
- investment strategies using
- learning side and size and
- optimal trading rule (OTR) algorithm for
- strategy risk and
- triple-barrier labeling method for
Storytelling
Strategists
Strategy risk
- asymmetric payouts and
- calculating
- implied betting frequency and
- implied precision and
- investment strategies and understanding of
- portfolio risk differentiated from
- probabilistic Sharpe ratio (PSR) similarity to
- strategy failure probability and
- symmetric payouts and
Structural breaks
- CUSUM tests in
- explosiveness tests in
- sub- and super-martingale tests in
- types of tests in
Sub- and super-martingale tests
Supernova research
Support vector machines (SVMs)
Supremum augmented Dickey-Fuller (SADF) test
- conditional ADF
- implementation of
- quantile ADF
Survivorship bias
SymPy Live
Synthetic data
- backtesting using
- experimental results using simulation combinations with
- optimal trading rule (OTR) framework using

Tick bars
Tick imbalance bars (TIBs)
Tick rule
Tick runs bars (TRBs)
Time bars
- description of
- fixed-time horizon labeling method using
Time-decay factors, and sample weights
Time period, in backtesting
Time series
- fractional differentiation applied to
- integer transformation for stationarity in
- stationarity vs. memory loss dilemma in
Time under water (TuW)
- definition of
- deriving
- example of
- run measurements using
Time-weighted average price (TWAP)
Time-weighted rate of returns (TWRR)
Trading rules
- investment strategies and algorithms in
- optimal trading rule (OTR) framework for
- overfitting in
Transaction costs, in quantitative investing
Tree clustering approaches, in asset allocation
Triple-barrier labeling method
Turnover costs

Variance
- boosting to reduce
- causes of
- ensemble methods to reduce
- random forest (RF) method for
Vectorization
Volume bars
Volume imbalance bars (VIBs)
Volume runs bars (VRBs)
Volume-synchronized probability of informed trading (VPIN)

Walk-forward (WF) method
- backtesting using
- overfitting in
- pitfalls of
- Sharpe ratio estimation in
- two key advantages of
Walk-forward timefolds method
Weighted Kendall's tau
Weights. See Class weights; Sample weights