소개
This skill provides specialized guidance for sizing Proximal Policy Optimization (PPO) network architectures specifically for algorithmic trading models. It helps developers determine the ideal hidden layer dimensions based on hardware constraints—such as NVIDIA A100 or H100 VRAM—as well as market data complexity and inference latency requirements. By detailing the relationship between layer width and parameter count, it enables precise control over model capacity, ensuring that traders can maximize predictive power without compromising the execution speed necessary for live market environments.