Data Science & ML Agent Skills

Discover Agent Skills for data science & ml. Browse 61 skills for Claude, ChatGPT & Codex.

UniProt Database Search & Mapping

Accesses the UniProt knowledgebase to search, retrieve, and map protein sequence and functional information.

Economist Analyst

Analyzes market events and policy changes using rigorous economic frameworks and diverse schools of thought.

Statistical Analysis Toolset

Performs comprehensive hypothesis testing, regression analysis, and Bayesian statistics with automated assumption checking and APA-style reporting.

RAG Implementation Toolkit

Builds robust Retrieval-Augmented Generation systems using vector databases, semantic search, and optimized retrieval pipelines.

HuggingFace Model Inference

Provides procedural guidance for setting up HuggingFace model inference services using Flask, covering environment setup, model caching, and robust API implementation.

Scanpy Single-Cell Analysis

Performs comprehensive single-cell RNA-seq data analysis and visualization using the Scanpy Python framework.

CZ CELLxGENE Census

Facilitates programmatic access and analysis of the CZ CELLxGENE Census database containing over 61 million single-cell genomics records.

PyTDC Drug Discovery ML

Provides AI-ready datasets, benchmarks, and molecular oracles for drug discovery and therapeutics machine learning.

Pymatgen Materials Science

Automates materials science workflows including crystal structure analysis, phase diagrams, and Materials Project integration.

Excel Analysis & Automation

Automates complex Excel data processing, visualization, and formatting using powerful Python libraries like Pandas and OpenPyXL.

NeuroKit2 Biosignal Processing

Processes and analyzes physiological signals including ECG, EEG, EDA, and respiratory patterns for research and clinical applications.

LangChain Architecture

Architects sophisticated LLM applications using the LangChain framework with support for autonomous agents, memory management, and RAG patterns.

SHAP Model Interpretability

Explains machine learning model predictions and feature importance using Shapley values to provide transparent and actionable AI insights.

PyTorch Geometric (PyG)

Simplifies the development and training of Graph Neural Networks (GNNs) for deep learning on irregular and relational data structures.

Exploratory Data Analysis Toolkit

Automates comprehensive statistical analysis and visual profiling for diverse datasets to uncover hidden patterns, anomalies, and actionable insights.

COBRApy Metabolic Modeling

Facilitates constraint-based reconstruction and analysis (COBRA) of metabolic models for systems biology and metabolic engineering.

PathML

Analyzes whole-slide pathology images and multiparametric imaging data using computational tools for tissue segmentation, spatial graphs, and machine learning.

PyTorch Lightning

Organizes and scales PyTorch deep learning workflows by automating training loops, hardware orchestration, and boilerplate code.

LLM Project Development

Builds production-grade LLM applications using structured pipelines, task-model fit analysis, and deterministic architecture patterns.

DrugBank Database

Accesses and analyzes comprehensive pharmaceutical data from DrugBank to perform drug discovery research, interaction analysis, and target identification.

ML Model Leaderboard Navigator

Provides systematic guidance for identifying, verifying, and extracting current performance data from machine learning benchmarks and embedding leaderboards.

PyDESeq2 Gene Expression Analysis

Identifies differentially expressed genes from bulk RNA-seq counts using the PyDESeq2 statistical framework.

scvi-tools

Empowers single-cell omics analysis with deep generative models for dimensionality reduction, batch correction, and multimodal data integration.

Pymoo Optimization

Implements advanced multi-objective and many-objective optimization frameworks using state-of-the-art evolutionary algorithms and Pareto analysis.

Dask Parallel Computing

Enables parallel and distributed computing in Python to scale pandas and NumPy operations beyond memory limits.

TorchDrug Scientific Computing

Accelerates drug discovery and molecular research by providing specialized tools for graph neural networks, protein modeling, and chemical property prediction.

STRING Database Analysis

Queries the STRING database to analyze protein-protein interaction networks and perform comprehensive functional enrichment for systems biology.

Open Targets Scientific Database

Queries the Open Targets Platform to identify therapeutic drug targets, evaluate disease associations, and analyze clinical trial data.

Scientific Peer Review Toolkit

Provides a systematic framework for evaluating the methodology, statistics, and integrity of scientific manuscripts and grant proposals.

Scientific Resource Detection & Strategy

Identifies system hardware capabilities and provides data-driven recommendations for optimizing computationally intensive tasks like model training and large-scale data processing.

30 results loaded • More available

Scroll for more results...