01Extracts core article content while removing web clutter and noise
02Converts HTML to structured Markdown preserving essential formatting
03Supports custom output directories for organized content management
04Automatically generates sanitized filenames based on the article title
050 GitHub stars
06Utilizes Mozilla's Readability heuristics for high extraction accuracy