Data Science & ML Agent Skills

Discover Agent Skills for data science & ml. Browse 61skills for Claude, ChatGPT & Codex.

RWKV Model Architecture

Implements and optimizes RWKV architectures, a hybrid RNN-Transformer model offering linear-time inference and infinite context windows.

PyTorch Lightning Training & Scaling

384

Streamlines deep learning development by decoupling research code from engineering boilerplate for automated distributed training and hardware scaling.

LLM Evaluation Harness

384

Evaluates Large Language Models across 60+ academic benchmarks to measure reasoning, coding, and mathematical capabilities using industry-standard metrics.

LLM Guidance & Constrained Generation

384

Enforces structured LLM outputs using regex and grammars to guarantee valid JSON, XML, and code generation.

CrewAI Multi-Agent Orchestration

384

Orchestrates teams of autonomous AI agents to collaborate on complex tasks through role-based delegation and memory.

Segment Anything Model (SAM)

384

Implements Meta AI's foundation model for high-precision zero-shot image segmentation using points, boxes, and masks.

nnsight Model Interpretability

384

Interprets and manipulates neural network internals for any PyTorch model, including massive foundation models via remote execution.

Long Context Transformer Extension

384

Extends transformer context windows using RoPE, YaRN, and ALiBi techniques to process documents exceeding 128k tokens.

FAISS Vector Similarity Search

384

Accelerates large-scale similarity search and clustering for dense vectors using Facebook AI's high-performance library.

Axolotl LLM Fine-Tuning

384

Streamlines the fine-tuning of large language models using Axolotl through expert YAML configuration guidance and advanced training techniques.

SAELens: Mechanistic Interpretability

384

Decomposes complex neural network activations into sparse, interpretable features to understand and steer model behavior.

Sentence Transformers

384

Generates state-of-the-art text and image embeddings for RAG, semantic search, and clustering tasks.

PyTorch FSDP Expert

384

Optimizes large-scale AI model training using PyTorch Fully Sharded Data Parallelism for efficient memory management and scaling.

LLM Model Pruning

384

Compresses Large Language Models using advanced techniques like Wanda and SparseGPT to reduce memory footprint and accelerate inference speeds.

Mamba Architecture & SSM Implementation

384

Implements and optimizes Mamba-based Selective State Space Models for high-efficiency sequence modeling and long-context AI research.

bitsandbytes LLM Quantization

384

Quantizes Large Language Models to 4-bit or 8-bit formats to reduce GPU memory usage by up to 75% with minimal accuracy loss.

DeepSpeed Distributed Training

384

Optimizes large-scale AI model training using DeepSpeed's ZeRO, pipeline parallelism, and high-performance DeepNVMe I/O handling.

HuggingFace Tokenizers

384

Provides high-performance, Rust-optimized text tokenization for NLP research and production-grade machine learning pipelines.

Weights & Biases MLOps

384

Tracks machine learning experiments and manages model lifecycles with real-time visualization and collaborative tools.

Stable Diffusion Image Generation

384

Generates high-quality images and performs advanced image transformations using Stable Diffusion models and the HuggingFace Diffusers library.

HuggingFace Accelerate Distributed Training

384

Simplifies PyTorch distributed training by providing a unified API for DDP, DeepSpeed, and FSDP with minimal code changes.

AWQ LLM Quantization

384

Optimizes Large Language Models using activation-aware 4-bit quantization to achieve 3x inference speedups and significant memory reduction with minimal accuracy loss.

BigCode Evaluation Harness

384

Evaluates AI code generation models across multiple programming languages and benchmarks using standardized pass@k metrics.

AudioCraft AI Generation

384

Generates high-fidelity music and sound effects from text descriptions using Meta's AudioCraft framework.

Flash Attention Optimization

384

Optimizes Transformer models using Flash Attention to achieve significant speedups and memory reductions during training and inference.

LLaVA Multimodal Assistant

384

Enables advanced vision-language capabilities for image understanding, multi-turn visual conversations, and document analysis.

vLLM High-Performance Inference Serving

384

Serves Large Language Models with maximum throughput and efficiency using vLLM's PagedAttention and continuous batching.

Knowledge Distillation for LLMs

384

Compresses large language models using teacher-student learning techniques to reduce inference costs while maintaining high performance.

Speculative Decoding LLM Accelerator

384

Accelerates LLM inference speeds by up to 3.6x using advanced decoding techniques like Medusa heads and lookahead decoding.

pyvene Causal Interventions

384

Performs declarative causal interventions and mechanistic interpretability experiments on PyTorch models.

30 results loaded • More available

Scroll for more results...