Manipulates and processes PDF documents programmatically through text extraction, document merging, and professional-grade file generation.
This Claude Code skill equips Claude with specialized knowledge for handling complex PDF tasks using industry-standard Python libraries and command-line tools. Whether you need to extract structured table data from financial reports, automate the generation of custom documents with ReportLab, merge/split files with qpdf, or perform OCR on scanned documents, this skill provides the implementation patterns and best practices required to process PDFs at scale. It covers everything from basic metadata extraction to advanced form handling and layout preservation, making it an essential tool for document-heavy automation workflows.
主な機能
01Security features for document encryption and password protection
02Automated PDF generation and multi-page document creation using ReportLab
03Advanced document operations including merging, splitting, and rotating pages
04OCR capabilities for converting scanned documents into searchable text
056 GitHub stars
06Comprehensive text and table extraction with layout preservation
ユースケース
01Automating the extraction of tabular data from financial statements into Excel or CSV
02Building automated document generation pipelines for custom invoices, reports, or certificates
03Programmatically merging multiple source documents into a single cohesive PDF for distribution