01Generates natural language descriptions of images
02Identifies and locates specific objects within images
03Answers questions about image content
04Uses quantized 8-bit models for efficient inference
05Automatically handles model downloading and environment setup