About
This skill provides specialized logic and parameter configurations to fix the common 'HOLD bias' in Reinforcement Learning (RL) trading agents. It specifically addresses scenarios where models learn that staying out of the market is the only rational choice due to asymmetric payoffs, excessive slippage penalties, or insufficient exploration incentives. By adjusting trading incentives, direction thresholds, and rebalancing reward weights, it helps models identify profitable entry points and achieve realistic trade rates (30-60%) in volatile environments like BTCUSD.