概要
Empower your AI agents with the ability to learn from experience using a suite of nine reinforcement learning algorithms, including Decision Transformer, Q-Learning, and Actor-Critic. This skill facilitates the creation, training, and deployment of learning plugins that optimize agent behavior through offline RL, value-based learning, and policy gradients. With WASM-accelerated neural inference providing up to 100x faster performance, it is an essential tool for developers building sophisticated, adaptive multi-agent swarms and autonomous workflows that require continuous improvement and domain-specific decision-making capabilities.