01Advanced 6-axis quality scoring and evaluation engine
02Adversarial probe types to identify agent vulnerabilities
03Battle arena with Bayesian ELO rating and fair matchmaking
04IRT adaptive testing for cost-efficient, accurate evaluations
05W3C Verifiable Credentials and x402 payment verification support
060 GitHub stars