01Extracts raw text content directly from URLs
02AI-powered content extraction from media files using OCR and computer vision
03Fetches fully rendered HTML content using a headless browser
04Supports configuration via environment variables
05Converts web page content to well-formatted Markdown