Website Scraper icon

Website Scraper

Extracts and converts website content to Markdown format.

About

This tool scrapes websites and converts HTML content into clean, high-quality Markdown using Mozilla's Readability library and TurndownService. It can be used as a command-line tool for one-off conversions or as an MCP server for continuous scraping and conversion. It also supports direct conversion of local HTML files and securely handles HTML by removing potentially harmful script tags.

Key Features

  • Converts clean HTML to Markdown with TurndownService.
  • Functions as a command-line tool and an MCP server.
  • Extracts meaningful content from web pages using Mozilla's Readability.
  • Securely handles HTML by removing potentially harmful script tags.
  • Supports direct conversion of local HTML files to Markdown.

Use Cases

  • Automated content extraction for documentation.
  • Content repurposing for different platforms.
  • Webpage archiving in Markdown format.