01Template generation and validation for mcpbr configuration files
02Detailed result exporting in JSON and Markdown report formats
03Automated prerequisite validation for Docker and API environment variables
04Support for granular testing including sample sizing and specific task selection
0520 GitHub stars
06Standardized benchmarking across SWE-bench, CyberGym, and MCPToolBench++