Automates comprehensive PDF operations including extraction, manipulation, and creation using Python and command-line tools.
This skill provides a robust framework for handling PDF documents directly within Claude Code, offering standardized patterns for complex document workflows. It enables developers to merge and split files, extract structured text and tables via pdfplumber, generate professional reports with reportlab, and perform OCR on scanned documents. By integrating both high-level Python libraries and powerful CLI utilities like qpdf and poppler, it ensures high-performance processing for data extraction, automated reporting, and secure document management.
主要功能
01Security features for password encryption, decryption, and watermarking
02Programmatic PDF generation with reportlab and Platypus templates
0331,722 GitHub stars
04Advanced text and table extraction using pdfplumber and pandas
05Comprehensive file manipulation including merging, splitting, and rotating
06OCR capabilities for converting scanned documents into searchable text
使用场景
01Automating the extraction of financial data from PDF statements into structured Excel files
02Generating dynamic invoices and business reports from raw database records
03Batch processing document archives for security compliance and watermark application