Discover Agent Skills for security & testing. Browse 105skills for Claude, ChatGPT & Codex.
Implements robust pytest-style evaluation frameworks to measure LLM performance, RAG quality, and output faithfulness.
Retrieves credentials, API keys, and secure documents from 1Password using the official op CLI integration.
Automates the creation of secure 1Password items including logins, API keys, and notes via the 1Password CLI.
Streamlines the configuration and execution of promptfoo for rigorous LLM evaluation and regression testing.
Employs a disciplined methodology to identify and resolve the root causes of software bugs rather than masking symptoms.
Analyzes implementation changes for security vulnerabilities and risks using a structured framework based on industry best practices.
Builds robust evaluation frameworks to measure performance, validate context engineering, and track improvements in agentic systems.
Enforces rigorous evidence-based verification before making any claims of task completion or success.
Implement reliable, high-performance end-to-end test suites using industry-standard patterns for Playwright and Cypress.
Implements systematic debugging workflows, root cause analysis, and performance profiling across multiple programming languages and environments.
Implements comprehensive Python testing strategies using pytest, fixtures, mocking, and advanced patterns for robust software quality.
Implements secure access control systems using industry-standard patterns like JWT, OAuth2, and RBAC to protect web applications and APIs.
Automates comprehensive smart contract validation using Hardhat and Foundry frameworks for robust blockchain applications.
Provides comprehensive guidance on DevSecOps phases, security tools, and CI/CD integration patterns based on OWASP standards.
Runs Software Composition Analysis (SCA) to identify and mitigate vulnerabilities in project dependencies across multiple languages.
Audits Dockerfiles for best practices and scans container images for security vulnerabilities using industry-standard tools like Hadolint and Trivy.
Validates generated UI code against WCAG 2.1 AA standards to ensure highly accessible and compliant web components.
Scans git repositories for hardcoded secrets, credentials, and API keys to prevent security breaches and data leaks.
Audits web source files for SEO best practices and WCAG 2.1 AA accessibility compliance.
Implements a rigorous test-driven development cycle by writing failing tests before coding the implementation.
Identifies potential security vulnerabilities and secret exposures in real-time to promote safer development practices.
Implements a rigorous test-driven development workflow with spec traceability and multi-tier testing patterns.
Conducts comprehensive production-readiness audits to identify security vulnerabilities, code quality issues, and infrastructure gaps in AI-generated or legacy projects.
Establishes robust testing frameworks, CI/CD pipelines, and automated quality gates to maintain high software reliability.
Implements universal principles for writing robust, maintainable tests by focusing on behavior over implementation details.
Integrates robust Clerk authentication patterns into Next.js applications with Convex backend synchronization.
Automates LLM prompt testing and performance evaluation using Promptfoo to ensure model reliability and security.
Ensures mathematical consistency and execution order invariance for deterministic color streams and tripartite data.
Verifies mathematical formulas and expressions using Behavior-Driven Development workflows and automated LaTeX extraction.
Conducts comprehensive automated code reviews for pull requests to enhance security, performance, and code quality.
Scroll for more results...