Generates asynchronous text responses using a distilgpt2 model, managed through a Redis queue and accelerated with GPU support, exposing an MCP-compliant API.
Waifu Queue is an asynchronous text generation service designed for conversational AI applications. It leverages the distilgpt2 language model and utilizes a Redis queue to manage requests efficiently, especially under concurrent load. GPU acceleration enhances the speed of text generation. The service exposes an MCP (Model Context Protocol) compliant API through FastMCP, allowing seamless integration with other MCP-compatible systems. Configuration is managed through environment variables, ensuring flexibility and ease of deployment.