Discover Agent Skills for security & testing. Browse 105skills for Claude, ChatGPT & Codex.
Validates feature implementation against acceptance criteria and automated test results to ensure software requirements are met.
Automates the creation of comprehensive test suites following Test-Driven Development principles to ensure high code quality and feature coverage.
Conducts deep, multi-perspective technical audits and research by orchestrating Claude, Gemini, and Codex AI systems.
Validates Claude Code skill functionality through automated example execution, integration checks, and systematic regression testing.
Generates comprehensive unit, integration, and E2E tests using industry-standard patterns and robust edge case detection.
Verifies critical claims and factual assertions through source-backed evidence and recursive confidence-building loops.
Generates comprehensive test suites following Test-Driven Development principles to ensure robust feature coverage and automated validation.
Safeguards autonomous coding operations using command validation, allowlists, and real-time execution hooks.
Protects autonomous coding environments by validating commands through customizable allowlists and execution hooks.
Automates the validation of Claude Code skills within isolated environments to identify hidden dependencies and ensure clean execution.
Implements software features using an incremental Test-Driven Development (TDD) workflow and automated git commits.
Validates Claude Code skills through hands-on scenario testing, example verification, and usability assessments.
Enforces a safety layer for terminal command execution by requiring explicit user confirmation for destructive or high-impact operations.
Implements comprehensive AI safety policies and content moderation using Amazon Bedrock's multi-layered safeguard framework.
Validates code quality through a comprehensive 5-layer pyramid using independent LLM-as-judge evaluation for production readiness.
Automates AI agent quality testing and monitoring using built-in metrics and custom LLM-as-judge evaluation patterns.
Performs isolated UI component testing for React, Vue, and Svelte using Playwright's experimental component testing framework.
Implements a 5-layer verification pyramid to ensure production-grade code quality through automated checks and LLM-as-judge evaluation.
Executes an intensive code quality suite combining ESLint, TypeScript compilation, and unused code detection to maintain high development standards.
Validates Claude Code skill functionality and usability through comprehensive scenario-based testing operations.
Automates visual regression testing and UI consistency checks using Chromatic, Lost Pixel, and Playwright snapshots.
Emulates diverse mobile devices and responsive environments using Playwright to ensure seamless cross-platform web performance.
Enforces standardized testing protocols and best practices to ensure high-quality, maintainable test suites.
Enforces a strict test-driven development cycle using a multi-agent red-green-refactor orchestration.
Protects autonomous coding environments by enforcing command allowlists, filesystem permissions, and execution sandboxing.
Enforces a rigorous, root-cause-first framework for resolving bugs by eliminating guesswork and prioritizing evidence-based investigation.
Enforces standardized pytest best practices including advanced fixtures, mocking strategies, and async test patterns.
Audits and refactors React applications using the Bulletproof React architecture to ensure scalable, maintainable, and production-grade codebases.
Ensures all code changes and task completions are backed by fresh, empirical evidence before making success claims.
Monitors session health and codebase quality in real-time, automatically triggering focused sub-agents for code reviews and task planning.
Scroll for more results...