概要
PufferLib is a specialized skill for Claude Code that enables developers to build, train, and scale reinforcement learning agents at millions of steps per second. It provides a comprehensive toolkit for creating custom environments using the PufferEnv API, optimizing parallel simulations through efficient vectorization, and implementing advanced policy architectures like CNNs and LSTMs. Whether you are integrating existing Gymnasium or PettingZoo environments or developing a high-performance multi-agent system from scratch, this skill provides the domain-specific guidance and implementation patterns needed to maximize training throughput and experiment iteration speed.