关于
Provides access to nine reinforcement learning algorithms via the AgentDB plugin system, enabling the creation of autonomous agents that learn and optimize their behavior through experience. It supports a wide range of techniques from value-based Q-Learning and policy gradients to advanced offline learning with Decision Transformers, all powered by WASM-accelerated neural inference for significant performance gains. This skill is ideal for developers building self-learning systems, implementing imitation learning, or orchestrating multi-agent environments where agents must adapt to historical data or real-time feedback.