Discover Agent Skills for security & testing. Browse 105 skills for Claude, ChatGPT & Codex.
Implements rigorous test-driven development workflows using independent multi-agent verification to ensure high code coverage and prevent test gaming.
Mathematically verifies AI-generated responses against formal policy rules with up to 99% accuracy to ensure deterministic compliance.
Analyzes source code to identify bugs, security risks, regressions, and gaps in test coverage.
Implements a rigorous four-phase framework to identify root causes and ensure stable bug fixes before implementation.
Automates local web application testing and browser interaction using Playwright and integrated server management.
Automates multi-perspective code reviews by orchestrating Claude, Gemini, and Codex to perform deep analysis across five critical dimensions.
Generates structured research and analysis prompts for systematic codebase investigation and architectural planning.
Defines and enforces deterministic security boundaries for Amazon Bedrock agents using natural language and Cedar policy logic.
Implements a structured Red-Green-Refactor cycle for autonomous feature development and high-quality code implementation.
Configures and manages OpenAI Codex CLI authentication, including OAuth login, API key orchestration, and secure multi-account switching.
Enforces data integrity and R-script coding standards through strict, diff-aware guardrail checks.
Conducts deep, multi-perspective technical research and analysis using integrated Claude, Gemini, and Codex agents for verified, production-ready results.
Automates browser-based testing and UI verification for local web applications using Playwright.
Enforces security policies and safety guardrails by validating terminal commands against predefined rules before execution.
Enforces best practices in software testing by preventing mock reliance, production code pollution, and incomplete test doubles.
Builds comprehensive evaluation frameworks to measure, validate, and optimize AI agent performance and context engineering strategies.
Automates local web application testing and debugging using Playwright for comprehensive frontend verification and UI interaction.
Automates error detection, classification, and recovery to maintain continuous and resilient autonomous coding sessions.
Verifies feature implementation through automated browser-based end-to-end testing and acceptance criteria validation.
Provides expert guidance on Ruby testing assertions, expectations, and the creation of custom RSpec matchers.
Automates and standardizes the release verification process for Jomonsho projects through tiered linting, type-checking, and multi-level testing.
Streamlines .NET test failure analysis by capturing errors and generating focused rerun filters.
Automates the detection, classification, and resolution of errors encountered during AI-driven development sessions.
Validates the functional integrity of Claude Code skills through automated example execution and systematic testing operations.
Enforces security policies and prevents destructive command execution by validating actions against local configuration files.
Enforces Definition of Done criteria by validating required checks and recording outcomes before task completion.
Ensures code quality and compliance through a standardized two-pass verification and quality assurance process.
Executes and reports on validation, unit, and smoke test suites to ensure code quality before deployment.
Performs rigorous security audits and code quality reviews using OWASP standards, Clean Code principles, and supply chain integrity checks.
Enforces test-driven development workflows by requiring failing tests before implementing behavioral code changes.
Scroll for more results...