01Chemical data loaders for SMILES, SDF, and FASTA protein sequences
0281 GitHub stars
03Specialized molecular featurization (Circular Fingerprints, GNN graphs, RDKit descriptors)
04Pretrained model integration for transfer learning with ChemBERTa and GROVER
05Advanced data splitting strategies including Scaffold and Butina splitting to prevent leakage
06Support for Graph Neural Networks like GCN, GAT, and MPNN