浏览所有 Claude 技能

探索我们完整的 Claude 技能集合，扩展 AI 代理的能力。

PyTorch Lightning Training & Scaling

Streamlines deep learning development by decoupling research code from engineering boilerplate for automated distributed training and hardware scaling.

RWKV Model Architecture

384

Implements and optimizes RWKV architectures, a hybrid RNN-Transformer model offering linear-time inference and infinite context windows.

FAISS Vector Similarity Search

384

Accelerates large-scale similarity search and clustering for dense vectors using Facebook AI's high-performance library.

pyvene Causal Interventions

384

Performs declarative causal interventions and mechanistic interpretability experiments on PyTorch models.

Constitutional AI Safety Alignment

384

Implements Anthropic's Constitutional AI method to train harmless, helpful models through self-critique and automated AI feedback.

Megatron-Core LLM Training

384

Optimizes large-scale language model training using NVIDIA Megatron-Core with advanced 3D and expert parallelism strategies.

TensorRT-LLM Optimization

384

Accelerates Large Language Model inference on NVIDIA GPUs using state-of-the-art optimization techniques for maximum throughput and minimal latency.

NeMo Guardrails Safety

384

Implements programmable safety rails and validation for LLM applications to prevent jailbreaks, hallucinations, and PII leaks.

NanoGPT Model Training

384

Implements and trains minimalist GPT architectures for educational and research purposes using Andrej Karpathy's clean, hackable codebase.

LitGPT LLM Training & Implementation

384

Simplifies Large Language Model implementation, training, and fine-tuning using clean, production-ready LitGPT architectures.

LLaMA Factory Fine-Tuning

384

Streamlines the fine-tuning process for over 100 large language models using the LLaMA-Factory framework and QLoRA techniques.

SGLang Inference Serving

384

Optimizes LLM serving and structured generation using RadixAttention prefix caching for high-performance agentic workflows.

LangSmith LLM Observability

384

Integrates comprehensive tracing, evaluation, and monitoring tools to debug and optimize Large Language Model (LLM) applications.

OpenRLHF Training Framework

384

Deploys high-performance Reinforcement Learning from Human Feedback (RLHF) workflows using Ray and vLLM acceleration for large-scale model alignment.

TransformerLens Mechanistic Interpretability

384

Facilitates mechanistic interpretability research by providing tools to inspect, cache, and manipulate transformer model activations via HookPoints.

SimPO LLM Alignment

384

Simplifies large language model alignment using reference-free preference optimization to improve model performance without the overhead of PPO or DPO.

Phoenix AI Observability

384

Monitors, traces, and evaluates LLM applications using an open-source, OpenTelemetry-based observability platform.

HQQ LLM Quantization

384

Quantizes Large Language Models to 4/3/2-bit precision without calibration data for faster inference and reduced memory footprint.

PEFT Fine-Tuning

384

Fine-tunes large language models using LoRA, QLoRA, and other parameter-efficient methods to drastically reduce memory and compute requirements.

LangChain Framework

384

Builds LLM-powered applications using agents, retrieval-augmented generation (RAG), and modular chains.

MoE Training (Mixture of Experts)

384

Implements and optimizes Mixture of Experts (MoE) architectures to scale model capacity while reducing training and inference costs.

Ray Train Distributed Computing

384

Orchestrates distributed machine learning training across clusters to scale PyTorch, TensorFlow, and Hugging Face models.

GPTQ Model Optimization

384

Optimizes Large Language Models using 4-bit post-training quantization to reduce memory usage and accelerate inference on consumer GPUs.

Instructor Structured Outputs

384

Extracts structured, type-safe data from Large Language Models using Pydantic validation and automatic retries.

NeMo Curator - GPU Data Curation

384

Curates high-quality datasets for LLM training using GPU-accelerated deduplication, filtering, and PII redaction.

SentencePiece Tokenizer

384

Implements language-independent subword tokenization using BPE and Unigram algorithms for advanced AI model development.

Llama.cpp Inference

384

Deploys and optimizes LLM inference on CPU, Apple Silicon, and consumer hardware using GGUF quantization.

Outlines Structured Generation

384

Guarantees valid, type-safe JSON and structured outputs from Large Language Models using grammar-based constraints.

Whisper Speech Recognition

384

Transcribes audio, translates speech to English, and automates multilingual audio processing using OpenAI's Whisper models.

BLIP-2 Multimodal Vision

384

Integrates Salesforce's BLIP-2 framework to enable advanced image captioning, visual question answering, and multimodal reasoning within AI workflows.

30 results loaded • More available

Scroll for more results...