01Structured output in JSONL format for easy data ingestion
02Configurable frame extraction rate (FPS) via ffmpeg integration
030 GitHub stars
04Automatic Markdown generation with optional embedded frame snapshots
05Parallelized OCR processing powered by the native Apple Vision framework
06Intelligent frame deduplication using average perceptual hashing (aHash)