01Inter-service signaling protocol with /request-unload endpoints
020 GitHub stars
03Configurable idle auto-unload patterns for multiple AI backends
04Passive and active memory management strategies
05Resilient OOM retry logic for PyTorch and Transformers
06Pre-configured setup guides for Ollama and ComfyUI/Flux