010 GitHub stars
02Generate images, videos, and audio from text prompts across multiple AI providers.
03Perform image editing with support for OpenAI, xAI, Gemini, and BFL (FLUX Kontext).
04Transcribe audio to text using OpenAI Whisper or ElevenLabs Scribe.
05Save all generated media files to disk with descriptive filenames.
06Automatically discover and configure available media generation providers from environment variables.