010 GitHub stars
02Support for state-of-the-art pretrained embeddings like ChemBERTa, ChemGPT, and Graphormer
03Scikit-learn compatible transformers for seamless integration into machine learning pipelines
04Unified interface for 100+ featurizers including ECFP, MACCS, and Mordred descriptors
05Robust error handling and validation for large-scale SMILES dataset processing
06High-performance parallel processing and built-in caching for expensive computations