01Full API reference for chat, generation, and embedding endpoints
02Structured JSON output and tool-calling implementation patterns
03GPU utilization monitoring and performance troubleshooting guides
04Multi-platform server configuration including Docker and proxy setups
05OpenAI-compatible library configuration for Python and JavaScript
068 GitHub stars