01File Extraction: Converts content to HTML (with CSS) or plain text using Apache Tika.
02File Listing: Scans a designated directory to list available files with details like size and MIME type.
03REST Testing Endpoints: Provides API endpoints for easy testing, including direct raw HTML serving for browser rendering.
04Metadata Extraction: Retrieves key metadata such as title, author, content type, and creation date.
05MCP Integration: Exposes four synchronous tools for content extraction, text extraction, file listing, and metadata retrieval.
060 GitHub stars