013,983 GitHub stars
02Zero-shot image classification using natural language labels
03Cross-modal retrieval for image-to-text and text-to-image matching
04Support for multiple architectures including ResNet-50 and Vision Transformers (ViT)
05Semantic image search and indexing via vector embeddings
06Automated content moderation and NSFW detection