Discover Agent Skills for data science & ml. Browse 61 skills for Claude, ChatGPT & Codex.
Guides users through an interactive interview and research process to build custom AI agents using the OpenHands SDK.
Manages, executes, and converts Jupyter notebooks to streamline data science and machine learning workflows.
Persists semantic context and project decisions across multiple Claude Code sessions using Mem0.
Ensures high-quality AI evaluation datasets by performing automated schema validation, duplicate detection, and coverage analysis.
Validates AI evaluation datasets for schema compliance, content integrity, and duplicate detection to ensure high-quality benchmarking.
Optimizes Large Language Model inference for production environments using vLLM, advanced quantization, and speculative decoding.
Enables LLMs to interact with external tools and return structured data through standardized function calling patterns and strict schemas.
Manages the automated curation and multi-agent validation of high-quality datasets for LLM evaluation.
Manages fault-tolerant workflow persistence and state recovery for LangGraph AI agents.
Implement and compare multi-agent orchestration frameworks like CrewAI, OpenAI Agents SDK, and Microsoft Agent Framework for specialized AI workflows.
Implements production-ready Retrieval-Augmented Generation patterns to ground AI responses in factual data and minimize hallucinations.
Implements robust Retrieval-Augmented Generation (RAG) patterns to ground LLM responses with accurate, cited, and validated external data.
Optimizes Large Language Model inference for production environments using vLLM, advanced quantization, and speculative decoding techniques.
Manages the multi-agent curation of high-quality training and testing datasets with automated quality scoring and bias detection.
Implements advanced LLM-as-judge patterns and RAGAS metrics to evaluate AI output quality and detect hallucinations.
Decomposes complex, multi-concept queries into independent sub-topics to improve RAG retrieval accuracy and coverage.
Breaks down complex search queries into independent sub-concepts to improve retrieval accuracy and coverage in RAG systems.
Provides implementation patterns and comparison guides for multi-agent orchestration frameworks including CrewAI, OpenAI Agents SDK, and Microsoft Agent Framework.
Implements autonomous agentic workflows and reasoning patterns for complex, multi-step LLM tasks.
Implements real-time voice agents, high-accuracy transcription, and expressive text-to-speech using native speech-to-speech models.
Implements automated quality gates, LLM-as-judge patterns, and RAGAS metrics to ensure reliable and grounded AI outputs.
Implements real-time voice agents, high-accuracy transcription, and text-to-speech using leading audio AI providers.
Optimizes LLM performance through production-ready patterns including Chain-of-Thought, dynamic few-shot learning, and automated prompt tuning.
Orchestrates multi-agent workflows using a central supervisor pattern to intelligently route tasks between specialized worker agents.
Optimizes AI application performance through production-ready prompt engineering patterns, versioning, and automated tuning.
Implements dynamic workflow branching and retry logic for AI agentic systems using LangGraph patterns.
Coordinates complex multi-agent workflows using a centralized supervisor-worker orchestration pattern.
Implements advanced Self-RAG and Corrective-RAG architectures for self-correcting AI retrieval systems.
Implements autonomous reasoning patterns like ReAct and Plan-and-Execute to enable LLMs to solve complex, multi-step tasks.
Optimizes LLM API costs and performance by implementing provider-native prompt caching for Claude and OpenAI.
Scroll for more results...