Builds resilient data ingestion pipelines that handle paginated API results with state tracking and historical backfills.
Incremental Fetch is a specialized skill designed to automate the creation of robust data pipelines that never lose progress or duplicate records. By implementing the 'Two Watermarks' pattern, it enables Claude to track both the newest and oldest records, allowing for seamless forward updates and backward historical backfills. It prioritizes resilience by saving data records page-by-page while deferring watermark updates until successful completion, making it ideal for large-scale data ingestion from platforms like X (Twitter), financial exchanges, and complex REST APIs where rate limits and connectivity issues are common.
주요 기능
01Resilient page-by-page data persistence to prevent loss on interruption
02Support for ID-based, cursor-based, and timestamp-based pagination types
0310 GitHub stars
04State management logic for resuming interrupted downloads without duplicates
05Two-watermark pattern for managing forward updates and historical backfills
06Configurable retry mechanisms with exponential backoff and jitter
사용 사례
01Building a continuous sync pipeline for social media mentions or posts
02Creating reliable ingestion scripts for third-party SaaS platform data
03Performing massive historical data backfills from financial market APIs