Extracts and cleans web content, filtering ads and irrelevant elements, and converts it to a clean Markdown format for various applications.
CleanWeb is a lightweight Model Context Protocol (MCP) server designed to intelligently process web pages. It excels at extracting core content, automatically filtering out advertisements, navigation, sidebars, and other distracting elements. Utilizing technologies like Axios, Cheerio, and Readability, it transforms complex HTML into a clean, readable Markdown format, with an option for JSON output including metadata. Its zero-browser dependency approach ensures simple and fast deployment, making it ideal for integration with AI assistants and other applications requiring optimized web content.