Generates high-quality videos from text prompts or images using Google's Veo 3 and Gemini APIs, featuring realistic motion and audio.
Veo 3 is an MCP server designed to empower users with advanced video generation capabilities leveraging Google's cutting-edge Veo 3 API via the Gemini API. It enables the creation of dynamic videos from simple text descriptions (text-to-video) or by animating static images with motion prompts (image-to-video). The server supports various Veo models, including Veo 3, Veo 3 Fast, and Veo 2, and offers features like native audio generation, aspect ratio control, negative prompting, and asynchronous processing for efficient video creation and management.