Security & Testing Agent Skills

Discover Agent Skills for security & testing. Browse 105 skills for Claude, ChatGPT & Codex.

Multi-AI Testing & TDD

Implements rigorous test-driven development workflows using independent multi-agent verification to ensure high code coverage and prevent test gaming.

Bedrock Automated Reasoning

Mathematically verifies AI-generated responses against formal policy rules with up to 99% accuracy to ensure deterministic compliance.

Review Helper

Analyzes source code to identify bugs, security risks, regressions, and gaps in test coverage.

Systematic Debugging

Implements a rigorous four-phase framework to identify root causes and ensure stable bug fixes before implementation.

Web Application Testing Toolkit

Automates local web application testing and browser interaction using Playwright and integrated server management.

Multi-AI Code Review

Automates multi-perspective code reviews by orchestrating Claude, Gemini, and Codex to perform deep analysis across five critical dimensions.

Ralph Research & Analysis

Generates structured research and analysis prompts for systematic codebase investigation and architectural planning.

Bedrock AgentCore Policy

Defines and enforces deterministic security boundaries for Amazon Bedrock agents using natural language and Cedar policy logic.

TDD Workflow

Implements a structured Red-Green-Refactor cycle for autonomous feature development and high-quality code implementation.

Codex Authentication Manager

Configures and manages OpenAI Codex CLI authentication, including OAuth login, API key orchestration, and secure multi-account switching.

Fear-of-Falling Preflight Guardrails

Enforces data integrity and R-script coding standards through strict, diff-aware guardrail checks.

Multi-AI Research & Analysis

Conducts deep, multi-perspective technical research and analysis using integrated Claude, Gemini, and Codex agents for verified, production-ready results.

Web Application Testing

Automates browser-based testing and UI verification for local web applications using Playwright.

Command Guard

Enforces security policies and safety guardrails by validating terminal commands against predefined rules before execution.

Testing Anti-Pattern Prevention

Enforces best practices in software testing by preventing mock reliance, production code pollution, and incomplete test doubles.

Agent Performance Evaluation

Builds comprehensive evaluation frameworks to measure, validate, and optimize AI agent performance and context engineering strategies.

Web Application Testing

Automates local web application testing and debugging using Playwright for comprehensive frontend verification and UI interaction.

Error Recoverer

Automates error detection, classification, and recovery to maintain continuous and resilient autonomous coding sessions.

Browser E2E Tester

Verifies feature implementation through automated browser-based end-to-end testing and acceptance criteria validation.

RSpec Matchers

Provides expert guidance on Ruby testing assertions, expectations, and the creation of custom RSpec matchers.

Jomonsho Release Preflight

Automates and standardizes the release verification process for Jomonsho projects through tiered linting, type-checking, and multi-level testing.

.NET Test Triage

Streamlines .NET test failure analysis by capturing errors and generating focused rerun filters.

Autonomous Error Recoverer

Automates the detection, classification, and resolution of errors encountered during AI-driven development sessions.

Testing Validator

Validates the functional integrity of Claude Code skills through automated example execution and systematic testing operations.

Command Guard

Enforces security policies and prevents destructive command execution by validating actions against local configuration files.

Quality Gates Manager

Enforces Definition of Done criteria by validating required checks and recording outcomes before task completion.

Self-Review Protocol

Ensures code quality and compliance through a standardized two-pass verification and quality assurance process.

MOVA Test Gate Runner

Executes and reports on validation, unit, and smoke test suites to ensure code quality before deployment.

Code Review Patterns & Security Audit

Performs rigorous security audits and code quality reviews using OWASP standards, Clean Code principles, and supply chain integrity checks.

Trust Layer TDD

Enforces test-driven development workflows by requiring failing tests before implementing behavioral code changes.

30 results loaded • More available

Scroll for more results...