关于
This skill integrates AgentDB's advanced reinforcement learning capabilities directly into the Claude Code environment, allowing developers to build self-improving agents. It provides a comprehensive toolkit of 9 algorithms, including Decision Transformer for offline learning and Actor-Critic for continuous control, enabling agents to optimize their behavior based on historical data or real-time interaction. With WASM-accelerated neural inference, it offers significant performance gains, making it an essential resource for developing autonomous agents that evolve and adapt to complex environments over time.