概要
PufferLib is a specialized skill for Claude Code designed to streamline the development and training of high-performance reinforcement learning agents. It provides optimized implementations of Proximal Policy Optimization (PPO) and LSTM architectures, achieving training speeds of millions of steps per second through advanced environment vectorization. Whether you are building custom environments with the PufferEnv API or integrating with standard frameworks like Gymnasium and PettingZoo, this skill offers the implementation patterns and architectural guidance needed to scale RL experimentation and achieve 2-10x speedups over standard implementations.