Jieba
Encapsulates the powerful jieba Chinese word segmentation library as a Model Context Protocol (MCP) server, offering robust text processing capabilities for various applications.
About
This MCP server is specifically designed to bring sophisticated Chinese text segmentation functionality to any MCP-compatible client, such as Claude Desktop. Built upon the efficient FastMCP framework, it seamlessly integrates the widely-used jieba library, transforming its capabilities into a standardized, accessible tool. Users can leverage three distinct segmentation modes—precise for detailed analysis, full for comprehensive word extraction, and search for improved recall in search contexts—all while benefiting from automatic whitespace filtering and efficient data transmission via Server-Sent Events (SSE). It further simplifies deployment through Docker containerization, making it easy to set up and integrate into existing workflows for data analysis, search, and other text-processing needs.
Key Features
- Supports three distinct Chinese segmentation modes: precise, full, and search.
- Automatically filters whitespace characters from segmented output.
- Utilizes Server-Sent Events (SSE) for efficient data transmission.
- Offers Docker containerization for simplified deployment and scalability.
- Encapsulates the jieba segmentation library as a standard MCP tool.
- 0 GitHub stars
Use Cases
- Integrating advanced Chinese word segmentation into MCP-compatible clients (e.g., Claude Desktop).
- Performing in-depth text analysis on Chinese content with precise word boundaries.
- Enhancing search engine recall by applying specialized segmentation for long words.