Moz Readability Parser icon

Moz Readability Parser

Createdemzimmer

Extracts and transforms webpage content into clean, LLM-optimized Markdown using Mozilla's Readability algorithm.

About

This server leverages Mozilla's Readability algorithm to extract and transform webpage content into clean, LLM-optimized Markdown. It removes ads, navigation, footers, and other non-essential elements, while preserving the core content structure. In addition to the main content, it returns article title, excerpt, byline, and site name, providing comprehensive metadata for effective LLM processing.

Key Features

  • Removes ads, navigation, and footers from web pages.
  • Converts HTML content into well-formatted Markdown.
  • Returns article metadata (title, excerpt, byline, site name).
  • Handles errors gracefully.
  • Reduces token usage by removing unnecessary HTML/CSS.

Use Cases

  • Cleaning web content for LLM ingestion.
  • Automating content extraction from websites.
  • Improving the quality of data for machine learning models.