Gladia icon

Gladia

Createdgladiaio

Enables interaction with Speech-to-Text and Audio Intelligence APIs for transcription, analysis, and translation.

About

Gladia is an official Model Context Protocol (MCP) server that facilitates interaction with Speech-to-Text and Audio Intelligence APIs. It allows MCP clients like Claude Desktop, Cursor, Windsurf, and OpenAI Agents to transcribe audio, analyze speech, translate content, and perform other audio-related tasks. With its easy-to-use CLI and asynchronous API, Gladia seamlessly integrates advanced audio processing capabilities into various applications and workflows.

Key Features

  • Configurable logging and CORS support
  • Audio transcription with speaker diarization
  • Async API with FastAPI
  • Real-time speech-to-text
  • 0 GitHub stars
  • Audio intelligence capabilities (translation, summarization, NER, sentiment analysis, content moderation, chapterization)

Use Cases

  • Transcribing audio files and identifying different speakers
  • Converting recordings to text and translating them to other languages
  • Analyzing sentiment and emotions in speech
Craft Better Prompts with AnyPrompt