Discover Agent Skills for data science & ml. Browse 61 skills for Claude, ChatGPT & Codex.
Accesses the UniProt knowledgebase to search, retrieve, and map protein sequence and functional information.
Analyzes market events and policy changes using rigorous economic frameworks and diverse schools of thought.
Performs comprehensive hypothesis testing, regression analysis, and Bayesian statistics with automated assumption checking and APA-style reporting.
Builds robust Retrieval-Augmented Generation systems using vector databases, semantic search, and optimized retrieval pipelines.
Provides procedural guidance for setting up HuggingFace model inference services using Flask, covering environment setup, model caching, and robust API implementation.
Performs comprehensive single-cell RNA-seq data analysis and visualization using the Scanpy Python framework.
Facilitates programmatic access and analysis of the CZ CELLxGENE Census database containing over 61 million single-cell genomics records.
Provides AI-ready datasets, benchmarks, and molecular oracles for drug discovery and therapeutics machine learning.
Automates materials science workflows including crystal structure analysis, phase diagrams, and Materials Project integration.
Automates complex Excel data processing, visualization, and formatting using powerful Python libraries like Pandas and OpenPyXL.
Processes and analyzes physiological signals including ECG, EEG, EDA, and respiratory patterns for research and clinical applications.
Architects sophisticated LLM applications using the LangChain framework with support for autonomous agents, memory management, and RAG patterns.
Explains machine learning model predictions and feature importance using Shapley values to provide transparent and actionable AI insights.
Simplifies the development and training of Graph Neural Networks (GNNs) for deep learning on irregular and relational data structures.
Automates comprehensive statistical analysis and visual profiling for diverse datasets to uncover hidden patterns, anomalies, and actionable insights.
Facilitates constraint-based reconstruction and analysis (COBRA) of metabolic models for systems biology and metabolic engineering.
Analyzes whole-slide pathology images and multiparametric imaging data using computational tools for tissue segmentation, spatial graphs, and machine learning.
Organizes and scales PyTorch deep learning workflows by automating training loops, hardware orchestration, and boilerplate code.
Builds production-grade LLM applications using structured pipelines, task-model fit analysis, and deterministic architecture patterns.
Accesses and analyzes comprehensive pharmaceutical data from DrugBank to perform drug discovery research, interaction analysis, and target identification.
Provides systematic guidance for identifying, verifying, and extracting current performance data from machine learning benchmarks and embedding leaderboards.
Identifies differentially expressed genes from bulk RNA-seq counts using the PyDESeq2 statistical framework.
Empowers single-cell omics analysis with deep generative models for dimensionality reduction, batch correction, and multimodal data integration.
Implements advanced multi-objective and many-objective optimization frameworks using state-of-the-art evolutionary algorithms and Pareto analysis.
Enables parallel and distributed computing in Python to scale pandas and NumPy operations beyond memory limits.
Accelerates drug discovery and molecular research by providing specialized tools for graph neural networks, protein modeling, and chemical property prediction.
Queries the STRING database to analyze protein-protein interaction networks and perform comprehensive functional enrichment for systems biology.
Queries the Open Targets Platform to identify therapeutic drug targets, evaluate disease associations, and analyze clinical trial data.
Provides a systematic framework for evaluating the methodology, statistics, and integrity of scientific manuscripts and grant proposals.
Identifies system hardware capabilities and provides data-driven recommendations for optimizing computationally intensive tasks like model training and large-scale data processing.
Scroll for more results...