关于
The PufferLib skill enables Claude Code to handle sophisticated reinforcement learning (RL) tasks with extreme efficiency. It leverages the PufferLib library to achieve training speeds of millions of steps per second through optimized vectorization and native multi-agent support. This skill is ideal for developers and researchers building custom environments via the PufferEnv API, training agents with PPO (PuffeRL), or integrating existing frameworks like Gymnasium and PettingZoo into high-throughput discovery workflows.