01Strict mode function calling schemas for high-reliability tool use
02Local inference setup for Ollama featuring DeepSeek and Qwen models
03Parameter-efficient fine-tuning workflows using LoRA and QLoRA
04Advanced prompt engineering patterns including CoT, ReAct, and DSPy
05Real-time SSE and WebSocket streaming implementations for FastAPI
06116 GitHub stars