Is streaming supported for text generation?

Yes, the skill emphasizes streaming patterns for all text models to prevent memory buffering, avoid Worker timeouts, and improve time-to-first-token.

Does this skill support the 2025 model updates?

Yes, it includes support for Llama 4 Scout, Gemma 3 (128K context), and Mistral 3.1, along with 2025 breaking changes like token-based context limits.

Does it help with vector embeddings for RAG?

Yes, it provides optimized patterns for BGE and EmbeddingGemma models, including 2025 performance updates and proper pooling parameter usage for v1.5 models.

How does it handle image generation errors?

The skill provides specific fixes for Flux NSFW filter false positives and ensures the required num_steps parameter is correctly included for API calls to prevent Error 1000.

Can I use this for local development?

Absolutely. It includes configurations to resolve Miniflare binding errors by using remote AI bindings or updated Vite plugins during local testing.

Cloudflare Workers AI

Name: Cloudflare Workers AI
Author: evolv3ai

byevolv3ai

0•

云基础设施

Implements high-performance LLM inference and AI model deployment on Cloudflare’s global GPU network with specialized error prevention.

This skill equips Claude with the expertise to build, deploy, and troubleshoot AI applications using Cloudflare Workers AI. It covers the complete lifecycle of AI integration—from selecting 2025-ready models like Llama 4 and Gemma 3 to implementing streaming responses and RAG with BGE embeddings. By proactively addressing seven documented edge cases, including token-based context window limits and Miniflare binding issues, it ensures production-grade stability for serverless AI workloads without the common pitfalls of neuron consumption or NSFW filter false positives.

主要功能

01Streaming response templates for reduced latency and memory efficiency

02Optimized RAG implementation using high-speed BGE and Gemma embeddings

03Deployment patterns for 2025 models including Llama 4 Scout and Gemma 3

04Automated troubleshooting for 7 documented Cloudflare AI error codes

05Advanced image generation handling with Flux and NSFW filter bypass strategies

060 GitHub stars

使用场景

01Implementing multi-modal AI agents with vision and tool-calling capabilities on the edge

02Generating photorealistic images using Flux with robust error handling for parameters

03Building serverless RAG applications with high-speed vector embeddings and D1 integration

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add evolv3ai/claude-skills-archive cloudflare-workers-ai

For use in Claude.ai and ChatGPT

Download Skill