01Five-layer architecture for structured system, tool, and document management
02Compression triggers and sliding window memory management for history
03Automated MCP tool deferral to save up to 7,200 tokens per session
0469 GitHub stars
05Attention-aware positioning to mitigate information loss in long contexts
06Dynamic token budget calculation for major models like Claude, GPT, and Gemini