01Comprehensive local model management (list, pull, copy, delete)
02Simplified multi-turn conversation state management
03Seamless text generation and chat completion with streaming support
04Real-time performance metrics and response evaluation
05High-performance embedding generation for RAG applications
060 GitHub stars