소개
This skill acts as an intelligent entry point for deep reinforcement learning (RL) projects, guiding users through the selection and implementation of algorithms like DQN, PPO, and SAC. By analyzing specific problem variables—such as discrete versus continuous action spaces, online versus offline data regimes, and multi-agent requirements—it ensures the most effective RL framework is applied. Beyond algorithm selection, it provides specialized guidance for debugging non-convergent agents, designing reward functions, and configuring custom environments, making it an essential tool for robotics, game AI, and complex control systems.