01Promptable text-to-mask segmentation and 3D reconstruction via SAM 3
02End-to-end real-time object detection using YOLO26 NMS-free architectures
0331,722 GitHub stars
04Edge device optimization for ONNX, TensorRT, and specialized NPUs
05Advanced visual reasoning and grounding using Vision Language Models (VLMs)
06High-precision spatial analysis including monocular depth and sub-pixel calibration