01Automated CoreML conversion and model stitching via MLTensor
02Advanced model compression including quantization, palettization, and pruning
03Implementation patterns for stateful models and KV-cache for LLMs
04Comprehensive Speech-to-Text integration using SpeechAnalyzer
05Diagnostic workflows for debugging inference performance and memory pressure
06233 GitHub stars