Can I use this for non-Python projects?

While the logic can be adapted, the skill is currently optimized for Python projects using structures like src/core/pdf_processor.py.

Does this skill support OCR for scanned documents?

No, this skill focuses on direct text extraction and explicitly excludes external OCR dependencies to maintain determinism and minimize external API calls.

Where are the validation rules typically implemented?

Validation rules are generally managed within src/utils/validation.py, covering file types and size limits as defined in the project requirements.

How does the skill ensure extraction reliability?

It emphasizes deterministic behavior and provides a step-by-step approach to updating core processor logic and corresponding integration tests.

PDF Processing and Validation

Name: PDF Processing and Validation
Author: gw-ai-security

bygw-ai-security

0•

Security & Testing

Implements robust PDF upload validation and deterministic text extraction workflows for document ingestion pipelines.

The PDF Processing skill provides specialized guidance for managing the lifecycle of PDF document ingestion, specifically within Python-based environments. It focuses on implementing strict file validation rules, such as size and type constraints, and ensuring deterministic text extraction without the need for external OCR dependencies. This skill is ideal for developers working on applications like Applicant Tracking Systems (ATS) where reliable document parsing, error handling, and comprehensive unit testing are critical for processing resumes and other structured documents.

Key Features

010 GitHub stars

02Configurable file upload validation

03Automated unit and integration test alignment

04Deterministic PDF text extraction

05Traceability to functional requirements

06Standardized error handling patterns

Use Cases

01Adjusting file size and type limits for document uploads

02Implementing robust error messaging for invalid document formats

03Enhancing text extraction reliability for resume parsing

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add gw-ai-security/ats-cv-scorer pdf-processing

For use in Claude.ai and ChatGPT

Download Skill