01Read local or URL images, returning dataURL and size information
02Perform OCR, visual question answering, or object detection on images using GLM-4.5V
03Process diverse document (PDF, DOCX, XLSX, PPTX, CSV, TXT) and image (PNG, JPG, JPEG) file formats
04Extract content from files with customizable prompts to guide the extraction process
05Return structured JSON results including extracted content and file metadata
060 GitHub stars