Data Science & ML Agent Skills

Discover Agent Skills for data science & ml. Browse 61skills for Claude, ChatGPT & Codex.

LLM Evaluation Harness

Evaluates Large Language Models across 60+ academic benchmarks to measure reasoning, coding, and mathematical capabilities using industry-standard metrics.

GRPO RL Fine-Tuning

384

Implements Group Relative Policy Optimization (GRPO) using the TRL library to enhance model reasoning and structured output capabilities.

Pinecone Vector Database Integration

384

Manages high-performance vector search and storage for production RAG and AI applications using Pinecone's serverless infrastructure.

Segment Anything Model (SAM)

384

Implements Meta AI's foundation model for high-precision zero-shot image segmentation using points, boxes, and masks.

Sentence Transformers

384

Generates state-of-the-art text and image embeddings for RAG, semantic search, and clustering tasks.

pyvene Causal Interventions

384

Performs declarative causal interventions and mechanistic interpretability experiments on PyTorch models.

Constitutional AI Safety Alignment

384

Implements Anthropic's Constitutional AI method to train harmless, helpful models through self-critique and automated AI feedback.

Megatron-Core LLM Training

384

Optimizes large-scale language model training using NVIDIA Megatron-Core with advanced 3D and expert parallelism strategies.

TensorRT-LLM Optimization

384

Accelerates Large Language Model inference on NVIDIA GPUs using state-of-the-art optimization techniques for maximum throughput and minimal latency.

NeMo Guardrails Safety

384

Implements programmable safety rails and validation for LLM applications to prevent jailbreaks, hallucinations, and PII leaks.

NanoGPT Model Training

384

Implements and trains minimalist GPT architectures for educational and research purposes using Andrej Karpathy's clean, hackable codebase.

LitGPT LLM Training & Implementation

384

Simplifies Large Language Model implementation, training, and fine-tuning using clean, production-ready LitGPT architectures.

LLaMA Factory Fine-Tuning

384

Streamlines the fine-tuning process for over 100 large language models using the LLaMA-Factory framework and QLoRA techniques.

SGLang Inference Serving

384

Optimizes LLM serving and structured generation using RadixAttention prefix caching for high-performance agentic workflows.

OpenRLHF Training Framework

384

Deploys high-performance Reinforcement Learning from Human Feedback (RLHF) workflows using Ray and vLLM acceleration for large-scale model alignment.

TransformerLens Mechanistic Interpretability

384

Facilitates mechanistic interpretability research by providing tools to inspect, cache, and manipulate transformer model activations via HookPoints.

SimPO LLM Alignment

384

Simplifies large language model alignment using reference-free preference optimization to improve model performance without the overhead of PPO or DPO.

HQQ LLM Quantization

384

Quantizes Large Language Models to 4/3/2-bit precision without calibration data for faster inference and reduced memory footprint.

PEFT Fine-Tuning

384

Fine-tunes large language models using LoRA, QLoRA, and other parameter-efficient methods to drastically reduce memory and compute requirements.

LangChain Framework

384

Builds LLM-powered applications using agents, retrieval-augmented generation (RAG), and modular chains.

MoE Training (Mixture of Experts)

384

Implements and optimizes Mixture of Experts (MoE) architectures to scale model capacity while reducing training and inference costs.

Ray Train Distributed Computing

384

Orchestrates distributed machine learning training across clusters to scale PyTorch, TensorFlow, and Hugging Face models.

GPTQ Model Optimization

384

Optimizes Large Language Models using 4-bit post-training quantization to reduce memory usage and accelerate inference on consumer GPUs.

Instructor Structured Outputs

384

Extracts structured, type-safe data from Large Language Models using Pydantic validation and automatic retries.

NeMo Curator - GPU Data Curation

384

Curates high-quality datasets for LLM training using GPU-accelerated deduplication, filtering, and PII redaction.

SentencePiece Tokenizer

384

Implements language-independent subword tokenization using BPE and Unigram algorithms for advanced AI model development.

Llama.cpp Inference

384

Deploys and optimizes LLM inference on CPU, Apple Silicon, and consumer hardware using GGUF quantization.

Outlines Structured Generation

384

Guarantees valid, type-safe JSON and structured outputs from Large Language Models using grammar-based constraints.

NCBI Gene Database Skill

324

Queries the NCBI Gene database to retrieve comprehensive genetic information, sequences, and functional annotations for biological research.

PyHealth

324

Builds and deploys specialized machine learning models for clinical healthcare data and electronic health records.

30 results loaded • More available

Scroll for more results...