Accesses and manages AI-ready drug discovery datasets, benchmarks, and molecular oracles for therapeutic machine learning.
PyTDC is an open-science platform designed to streamline drug discovery and development by providing standardized, AI-ready datasets across the entire therapeutics pipeline. It offers specialized tools for predicting molecular properties like ADME and toxicity, modeling drug-target interactions, and generating novel molecules with specific desired traits. By integrating benchmarks and specialized data split strategies like scaffold and cold-start splits, it ensures that machine learning models are evaluated rigorously for real-world pharmaceutical applications.
主な機能
01Standardized evaluation metrics for pharmacokinetics and toxicity prediction
02Access to 100+ AI-ready drug discovery and therapeutic datasets
03Integration with 17+ molecular oracles for goal-directed generation
04Automated data processing and format conversion for PyG, DGL, and SMILES
051 GitHub stars
06Advanced data splitting strategies including scaffold and cold-target splits
ユースケース
01Benchmarking drug-target interaction (DTI) models on standardized datasets
02Predicting ADME and toxicity profiles for new drug candidates
03Optimizing molecular generation for specific biological targets using oracles