01Cross-prompt activation sharing and multi-token generation tracing
02Activation patching and hidden state intervention capabilities
03Unified API for any PyTorch architecture including Transformers and Mamba
04Remote execution on massive models (70B, 405B) via NDIF
05Deferred execution model for efficient computation graph manipulation
06384 GitHub stars