Rostro: Extend LLMs with On-the-Fly Multimodal Generation