01Supports multiple upstream MCP servers with flexible authentication forwarding
02Configurable routing parameters (top_k, confidence_threshold, context_window_turns)
03Local embedding inference (all-MiniLM-L6-v2) for no API costs
04Always-available meta-tools for server discovery, tool searching, and execution
051 GitHub stars
06Semantic routing of tools based on conversation context