01Direct endpoint returns for agents to call models without intermediary layers
02Real-time compatibility filtering for tool calls, context windows, and reasoning formats
03Dynamic quota tracking to prevent exhaustion of provider limits
042 GitHub stars
05Live LLM model discovery from various cloud and local providers
06Configurable cost control for free, cheap, or premium inference