概要
This skill provides a comprehensive suite of production-ready scraping tools designed for building RAG knowledge bases and collecting technical documentation. It features a decision matrix for choosing between Playwright for dynamic sites, BeautifulSoup for static parsing, and Scrapy for large-scale crawls. With built-in rate limiting, error handling, and markdown conversion, it ensures ethical and efficient data harvesting while respecting website resources and robots.txt protocols.