01Configurable decay rates (eta) and reward weighting for environment tuning
021 GitHub stars
03Automated stability scaling and clamping to prevent numerical instability
04Vectorized PyTorch implementation for high-performance GPU environments
05Dynamic risk-adjusted gradient signals for better position sizing
06Online, incremental Sharpe ratio estimation via EMA statistics