Evaluates the performance of MCP servers for web search and database query tasks.
MCPBench is an evaluation framework designed to assess the performance of MCP servers, focusing on web search and database query functionalities. Compatible with both local and remote servers, it measures task completion accuracy, latency, and token consumption under consistent LLM and Agent configurations. The framework provides datasets and evaluation scripts to benchmark different MCP servers and is based on LangProBe: a Language Programs Benchmark.