0111 GitHub stars
02Multi-dimensional scoring across 6 metrics including IFD, factuality, and reasoning.
03Automated DPO pair validation with length-bias auditing to prevent model shortcuts.
04Six specialized scorer types ranging from FastIFD to high-fidelity ensemble models.
05RLVR verifiability checks specifically designed for math and coding domains.
06High-performance CLI support for distributed processing on Apple Silicon and other backends.