Testing Skills
There are 67 AI agent skills in our directory.
AI agent skills related to testing. We catalog 67 skills that mention testing in their name or description.
Updated June 2026
- 3-verify-pr-fixCheck out a candidate PR locally, restart the MCP server on the PR branch, re-run the exact same MCP tool call that failed in /2-repro-issue, diff the outputs, and audit the fix for locale-independence, DOM-stability, and scope per CLAUDE.md scraping rules. Use when the user says "verify PR #N", "does this PR fix #M", "check #N locally", "test the fix in #N", or asks whether a candidate PR actually solves a previously-reproduced issue. Assumes /2-repro-issue has already captured the on-main baseline at /tmp/repro-issue-<linked>-main.json — if not, run /2-repro-issue first.
- 316-frameworks-spring-mongodb-migrations-mongockUse when you need to add or review Mongock MongoDB data migrations in a Spring Boot application — including Maven coordinates, Spring Data MongoDB drivers, migration scan packages, @ChangeUnit classes, lock/transaction settings, and Testcontainers verification. This should trigger for requests such as Add Mongock migrations in Spring Boot; Review Spring MongoDB data migrations; Configure Mongock change units for Spring Data MongoDB. Part of cursor-rules-java projectApache-2.0
- 512-frameworks-micronaut-dataUse when you need data access with Micronaut Data — @MappedEntity, CrudRepository/PageableRepository, @Query with parameters, @Transactional services, projections, @Version, and @MicronautTest with TestPropertyProvider and Testcontainers. For raw java.sql access without generated repositories, use @511-frameworks-micronaut-jdbc. This should trigger for requests such as Review or implement Micronaut Data repositories and entities; Add transactions, pagination, or projections in Micronaut persistence layer. Part of cursor-rules-java projectApache-2.0
- a-b-test-config-creator|MIT
- aads-architectDesign architecture and contract changes for AADS-ULoRA with explicit module boundaries, compatibility strategy, and testable acceptance criteria. Use for interface, config-schema, data-flow, and cross-module redesign decisions before implementation.
- ab-test-analysisAnalyze A/B test results with statistical significance, sample size validation, confidence intervals, and ship/extend/stop recommendations. Use when evaluating experiment results, checking if a test reached significance, interpreting split test data, or deciding whether to ship a variant.
- ab-test-analyzer|MIT
- ab-test-setupWhen the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," "hypothesis," "conversion experiment," "statistical significance," or "test this." For tracking implementation, see analytics-tracking.MIT
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupWhen the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," "hypothesis," "conversion experiment," "statistical significance," or "test this." For tracking implementation, see analytics-tracking.MIT
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupWhen the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," "hypothesis," "conversion experiment," "statistical significance," or "test this." For tracking implementation, see analytics-tracking.MIT
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- ab-test-setupStructured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.
- abaqus-dynamic-analysisComplete workflow for dynamic analysis. Use when user mentions impact, crash, drop test, transient, or time-varying response. Handles explicit and implicit dynamics.
- abo-pr-writerDraft or update Audiobook Organizer PR descriptions with issue closure, summary, tests, docs/changelog status, and follow-up notes.
- abstraction-concrete-examplesBuilds structured abstraction ladders that translate high-level principles into concrete, actionable examples across 3-5 levels. Bridges communication gaps, reveals hidden assumptions, and tests whether abstract ideas work in practice. Use when explaining concepts at different expertise levels, moving between abstract principles and concrete implementation, identifying edge cases by testing ideas against scenarios, designing layered documentation, decomposing complex problems into actionable steps, or bridging strategy-execution gaps.
- ac-expanderTurn vague Acceptance Criteria into measurable checks and test assertions
- acc-create-domain-eventGenerates DDD Domain Events for PHP 8.5. Creates immutable event records with metadata, past-tense naming. Includes unit tests.
- acc-create-entityGenerates DDD Entities for PHP 8.5. Creates identity-based objects with behavior, state transitions, and invariant protection. Includes unit tests.
- acc-create-factoryGenerates DDD Factory for PHP 8.5. Creates factories for complex domain object instantiation with validation and encapsulated creation logic. Includes unit tests.
- acc-create-rate-limiterGenerates Rate Limiter pattern for PHP 8.5. Creates request throttling with token bucket, sliding window, and fixed window algorithms. Includes unit tests.
- accepting-storiesGuides product owner through systematic acceptance testing. Use when verifying acceptance criteria, checking story completion, or conducting final review before marking a story as done.
- actStandard-tier development workflow. Orchestrates specialized agents (Groucho, Chico, Harpo, Zeppo, Gummo) through the implementation pipeline defined in CLAUDE.md. Use for multi-file features, new functionality, or any work requiring design, review, and testing gates.
- active-directory-attacksProvide comprehensive techniques for attacking Microsoft Active Directory environments. Covers reconnaissance, credential harvesting, Kerberos attacks, lateral movement, privilege escalation, and domain dominance for red team operations and penetration testing.
- active-directory-attacksProvide comprehensive techniques for attacking Microsoft Active Directory environments. Covers reconnaissance, credential harvesting, Kerberos attacks, lateral movement, privilege escalation, and domain dominance for red team operations and penetration testing.
- active-directory-attacksProvide comprehensive techniques for attacking Microsoft Active Directory environments. Covers reconnaissance, credential harvesting, Kerberos attacks, lateral movement, privilege escalation, and domain dominance for red team operations and penetration testing.
- ad-angle-multiplierExpand a core idea into multiple distinct ad angles. Trigger on "more ads", "creative testing", "scale campaigns".MIT
- ad-angle-multiplierExpand a core idea into multiple distinct ad angles. Trigger on "more ads", "creative testing", "scale campaigns".MIT
- ad-creativeCreate, iterate, and scale paid ad creative for Google Ads, Meta, LinkedIn, TikTok, and similar platforms. Use when generating headlines, descriptions, primary text, or large sets of ad variations for testing and performance optimization.
- ad-creativeWhen the user needs to generate, iterate, or scale ad creative for paid advertising. Use when they say 'write ad copy,' 'generate headlines,' 'create ad variations,' 'bulk creative,' 'iterate on ads,' 'ad copy validation,' 'RSA headlines,' 'Meta ad copy,' 'LinkedIn ad,' or 'creative testing.' This is pure creative production — distinct from paid-ads (campaign strategy). Use ad-creative when you need the copy, not the campaign plan.MIT
- ad-creativeCreate, iterate, and scale paid ad creative for Google Ads, Meta, LinkedIn, TikTok, and similar platforms. Use when generating headlines, descriptions, primary text, or large sets of ad variations for testing and performance optimization.
- ad-creativeWhen the user needs to generate, iterate, or scale ad creative for paid advertising. Use when they say 'write ad copy,' 'generate headlines,' 'create ad variations,' 'bulk creative,' 'iterate on ads,' 'ad copy validation,' 'RSA headlines,' 'Meta ad copy,' 'LinkedIn ad,' or 'creative testing.' This is pure creative production — distinct from paid-ads (campaign strategy). Use ad-creative when you need the copy, not the campaign plan.MIT
- ad-creativeWhen the user needs to generate, iterate, or scale ad creative for paid advertising. Use when they say 'write ad copy,' 'generate headlines,' 'create ad variations,' 'bulk creative,' 'iterate on ads,' 'ad copy validation,' 'RSA headlines,' 'Meta ad copy,' 'LinkedIn ad,' or 'creative testing.' This is pure creative production — distinct from paid-ads (campaign strategy). Use ad-creative when you need the copy, not the campaign plan.MIT
- add_platform.implementCreates platform adapter, templates, tests with 100% coverage, and README documentation. Use after adding hook capabilities.
- add-backend-testingAdd backend integration testing with Vitest to an existing app. Sets up isolated test database schema and writes tests for tRPC routers.
- add-componentAdd a new Vue component to @indielayer/ui with themes, tests, docs, and registry exports. Use when creating a new component, adding X-prefixed components, or scaffolding under packages/ui/src/components.
- add-featureImplement new feature with self-testing loop until 100% pass
- add-torch-shapes-exampleUse when adding a new PyTorch model to Pyrefly's shape-tracking example corpus under tensor-shapes/pyrefly-torch-stubs/examples — i.e. importing a model as a tested, corpus-quality reference port. This is maintainer-facing fbsource work. For porting your own model elsewhere, use the porting skill directly; for fixing a wrong/missing shape rule, use modify-shaped-array-dsl.
- adding-modelsGuide for adding new LLM models to Letta Code. Use when the user wants to add support for a new model, needs to know valid model handles, or wants to update the model configuration. Covers models.json configuration, CI test matrix, and handle validation.
- admin-dashboard-qaUse this skill when implementing, modifying, or fixing the admin dashboard (admin-dashboard-v2). Triggers for tasks involving dashboard UI, components, pages, features, hooks, or API integration. Orchestrates a rigorous QA workflow with PM review, use case writing, testing, and bug fixing cycles.
- ado-assign-testingDistribute reviewed work items to team members for testing
- adobe-load-scaleImplement load testing, auto-scaling, and capacity planning for AdobeMIT
- Advanced Clean Hexagonal ArchitectureApply Clean Architecture and Hexagonal (Ports & Adapters) patterns for domain isolation and testability. Use when designing system boundaries, creating ports/adapters, or structuring domain-driven applications.
- Advanced Testability Ai ErgonomicDesign code for testability and AI/LLM ergonomics with explicit contracts and observable patterns. Use when optimizing code for AI tools, improving testability, or making codebases LLM-friendly.
- Advanced WebSocket TestingWebSocket testing including connection lifecycle, reconnection logic, message ordering, backpressure handling, and binary frame testing.MIT
- af-bdd-expertiseUse when transforming requirements into Markdown scenario specifications, writing BDD scenarios, enforcing glossary compliance, or designing comprehensive test coverage. Covers scenario structure, test type classification, and acceptance criteria patterns.
- agent-browserBrowser automation CLI for UI testing, E2E tests, and page interaction. Invoke when user wants to test UI, click buttons, fill forms, or take screenshots.
- agent-browserAutomates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
- agent-browserBrowser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
- agent-browserBrowser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
- agent-development-extMUST load together with `agent-development` in GBIG context. Adds универсальный формат агентов фреймворка (analyst, architect, developer, reviewer, tester, explorer), маппинг тиров моделей и 1С BSL-специфику.
- agent-evaluationTesting and benchmarking LLM agents including behavioral testing,
- agent-evaluationTesting and benchmarking LLM agents including behavioral testing,
- agent-evaluationTesting and benchmarking LLM agents including behavioral testing,
- agent-evaluationTesting and benchmarking LLM agents including behavioral testing,
- agent-evaluationTesting and benchmarking LLM agents including behavioral testing,
- agent-performance-engineerExpert performance engineer specializing in system optimization, bottleneck identification, and scalability engineering. Masters performance testing, profiling, and tuning across applications, databases, and infrastructure with focus on achieving optimal response times and resource efficiency.
- agent-risk-managerExpert risk manager specializing in comprehensive risk assessment, mitigation strategies, and compliance frameworks. Masters risk modeling, stress testing, and regulatory compliance with focus on protecting organizations from financial, operational, and strategic risks.
- agent-skill-linterLint a codebase on code quality, best practices, design principles, build and test status, per-test speed (slow at or above 5s), cyclomatic complexity, and expressive naming, then render the results as a modern web report. Use when the user runs /lint or /lint-site, or asks to lint, score, or audit the quality of a project.
- agent-trajectory-safety-risk-diagnosisUse this skill when a user wants evaluator data for judging whether an agent’s actions were safe or unsafe based on its interaction record with tools or environments. Trigger it when ordinary users say things like 'check if the agent did something dangerous', 'judge whether the action trail is risky', 'make safety-monitor data from agent logs', or 'find cases where the agent looked helpful but caused harm'. Plain-language examples include: 'evaluate whether the trajectory is safe', 'make judge data for risky tool use', 'analyze unsafe multi-step agent actions', and 'test if the evaluator can spot hidden safety problems in logs'.
- agentic-kaggle-skillKaggle-first end-to-end competition workflow for scored submissions. Use when Codex must run Kaggle or competitive ML workflows through scored submission, including code competitions, validation, metrics, policy-safe public notebook/discussion intel, tabular/text/image modeling, tuning, ensembling/stacking, proactive multi-notebook architectures, producer notebooks that train models and export private Kaggle artifact datasets, downstream consumer notebooks, Kaggle GPU offload, kagglehub access, hidden-test debugging, and public score retrieval.
- agentkitWorld AgentKit verification flow. Use when the user wants to test AgentKit access, verify a human-backed agent, or mentions agentkit_verify.
Browse skills by keyword
Design skills (110)Code skills (110)Web skills (103)Data skills (93)API skills (92)Search skills (82)Automation skills (71)Security skills (63)MCP skills (61)Agents skills (59)Documents skills (46)Review skills (45)React skills (43)Analysis skills (42)Python skills (40)Git skills (36)SEO skills (32)Writing skills (31)Database skills (30)AWS skills (24)