Video Digest is a service designed to process video content by extracting audio from online sources such as YouTube, Bilibili, and TikTok, then converting it into text. It supports multiple transcription service providers including Deepgram, Gladia, Speechmatics, and AssemblyAI, allowing for flexible service selection based on available API keys. The service is designed for asynchronous processing, error handling, and includes speaker diarization.