Provides speech processing services, including audio validation, speech transcription, and voice activity detection, using Alibaba's FunASR library.
FunASR leverages Alibaba's FunASR library within the FastMCP framework to offer robust speech processing capabilities. It includes tools for validating audio files, transcribing speech asynchronously with advanced ASR models like Paraformer (including detailed timestamps), and detecting voice activity. Designed to be extensible, it supports dynamic loading and switching of ASR and VAD models, configurable model parameters, and easy integration using MCP clients or HTTP requests.