Acerca de
This skill provides comprehensive guidance for the entire reinforcement learning lifecycle using the Stable Baselines3 library. It enables developers to implement state-of-the-art RL algorithms like PPO, SAC, and DQN, create and validate custom Gymnasium environments, and leverage advanced features like vectorized environments for parallel training. Whether you are building complex agentic behaviors or conducting RL experimentation, this skill offers standardized patterns for callback implementation, model persistence, and evaluation to ensure robust and reproducible results.