Acerca de
This skill provides comprehensive guidance for deploying and managing AI models on Cloudflare's serverless platform. It covers the 2025 updates for flagship models like Llama 4, Gemma 3, and Mistral 3.1, while addressing critical implementation details such as breaking changes in token defaults and BGE pooling parameters. Developers can use this skill to implement low-latency streaming, build robust RAG workflows with Vectorize, and leverage AI Gateway for advanced caching, logging, and cost tracking across Cloudflare's distributed infrastructure.