MCPBench icon

MCPBench

117

Evaluates the performance of MCP servers for web search and database query tasks.

Acerca de

MCPBench is an evaluation framework designed to assess the performance of MCP servers, focusing on web search and database query functionalities. Compatible with both local and remote servers, it measures task completion accuracy, latency, and token consumption under consistent LLM and Agent configurations. The framework provides datasets and evaluation scripts to benchmark different MCP servers and is based on LangProBe: a Language Programs Benchmark.

Características Principales

  • Measures task completion accuracy
  • 54 GitHub stars
  • Supports local and remote MCP servers
  • Evaluates database query MCP servers
  • Evaluates web search MCP servers
  • Provides datasets for evaluation

Casos de Uso

  • Benchmarking database query MCP servers
  • Benchmarking web search MCP servers
  • Comparing performance of different MCP servers
Craft Better Prompts with AnyPrompt
Sponsored