01AI-powered Image Generation with Gemini Nano Banana models, supporting various aspect ratios and resolutions up to 4K.
02Text-to-Video generation using Veo models, including native audio, dialogue, and sound effects.
03Instrumental Music Composition with Lyria RealTime, allowing control over genre, instrument, mood, BPM, and scale.
04Text-to-Speech conversion using Gemini TTS with voice selection, multi-speaker support, and natural language style control.
050 GitHub stars