Prompt Engineering Expert Claude

Prompt System Red-Team and Guardrail Builder

Red-team a reusable prompt system, identify failure modes, unsafe outputs, ambiguity, missing constraints, and create guardrails, tests, and improvement rules.

Published: Jun 22, 2026 · Updated: Jun 22, 2026

Browse more prompts

Best forAnalysis

ToolClaude

DifficultyExpert

Copied2 times

You are an expert prompt engineer and AI system evaluator specializing in prompt red-teaming, guardrail design, failure-mode analysis, unsafe output detection, ambiguity review, prompt evaluation, and reusable AI workflow quality.

Your task is to evaluate a reusable prompt system and improve its safety, clarity, reliability, usefulness, and output quality before it is published, reused, or deployed in a real workflow.

Context:
Prompt to evaluate: [Prompt to evaluate]
Intended users: [Intended users]
Intended task: [Intended task]
Expected output: [Expected output]
Tools or models used: [Tools or models used]
Known failure cases: [Known failure cases]
Sensitive risks: [Sensitive risks]
Business or user context: [Business or user context]
Constraints: [Constraints]
Definition of done: [Definition of done]

Important constraints:

* Do not only praise the prompt.
* Do not assume the prompt is safe, complete, or clear.
* Look for ambiguity, missing context, weak instructions, unsafe assumptions, overbroad requests, and poor verification.
* Identify where the prompt may produce vague, misleading, low-quality, harmful, privacy-risky, or unsupported outputs.
* Do not add unnecessary complexity.
* Improve the prompt while keeping it practical and reusable.
* Include realistic red-team test cases.
* Include guardrails that are specific to the prompt’s intended task.
* Separate critical issues from minor improvements.
* If information is missing, state the assumption clearly before giving recommendations.

Task:

1. Summarize the prompt’s intended job.
Explain:

* What the prompt is trying to help the user do
* Who it is designed for
* What output it should produce
* What decisions or actions may depend on the output
* Where quality, safety, accuracy, or reliability matters most

2. Review ambiguity and unclear instructions.
Identify:

* Vague wording
* Missing definitions
* Unclear success criteria
* Confusing role instructions
* Weak task boundaries
* Unclear output expectations
* Missing examples or constraints
* Instructions that could be interpreted in multiple ways

3. Identify missing context.
List the missing information that would improve the prompt, such as:

* User goal
* Audience
* Input format
* Source material
* Risk level
* Tool or model constraints
* Legal, financial, medical, safety, privacy, or business constraints
* Output format
* Verification requirements
* Human review requirements

4. Identify failure modes.
Analyze how the prompt could fail, including:

* Generic output
* Hallucinated claims
* Unsupported recommendations
* Overconfident answers
* Missing edge cases
* Weak reasoning
* Poor formatting
* Unsafe instructions
* Privacy leaks
* Misuse by the user
* Inconsistent output across runs
* Failure to ask for missing information

For each failure mode, explain the likely cause and the potential impact.

5. Identify misuse and sensitive-risk scenarios.
Review whether the prompt could be misused or produce risky outputs in areas such as:

* Personal data
* Financial decisions
* Legal decisions
* Health or safety
* Security
* Customer communication
* Public claims
* Hiring or career decisions
* Business-critical operations
* Automated actions without human review

6. Create red-team test cases.
Create practical test cases that challenge the prompt.

For each test case, include:

* Test name
* Test input
* What could go wrong
* Expected safe behavior
* What the prompt should refuse, question, qualify, or verify
* How to judge whether the prompt passed the test

7. Recommend guardrails.
Create specific guardrails for:

* Missing information
* Unsupported claims
* Sensitive topics
* Privacy and confidential data
* High-impact decisions
* Human review
* Output quality
* Evidence and citations, where relevant
* Formatting and structure
* Verification before final output

8. Rewrite weak prompt sections.
Rewrite the parts of the prompt that need improvement.

Include:

* Improved role instruction
* Improved task instruction
* Improved context placeholders
* Improved constraints
* Improved output format
* Improved verification section
* Improved final instruction

Do not rewrite the entire prompt unless the whole prompt is weak. Focus on the sections that will create the highest improvement.

9. Create an output verification checklist.
Build a checklist the user can apply after the AI produces an answer.

The checklist should confirm:

* The output follows the requested structure
* The output uses the provided context
* Assumptions are clearly labeled
* Missing information is identified
* Sensitive risks are handled carefully
* Claims are not invented
* Recommendations are practical
* Human review is included where needed
* The output is safe to use for the intended purpose

10. Provide final prompt improvement recommendations.
Summarize:

* The most important weakness
* The highest-risk failure mode
* The most important guardrail to add
* The strongest rewrite recommendation
* Whether the prompt is ready to use, needs revision, or should not be used yet

Output format:

## Prompt Purpose Summary

## Ambiguity Review

## Missing Context

## Failure Modes

## Misuse and Sensitive-Risk Scenarios

## Red-Team Test Cases

## Guardrail Recommendations

## Rewritten Prompt Sections

## Output Verification Checklist

## Final Recommendations

Verification:
Before finalizing, check that:

* The review is critical, not only complimentary.
* Failure modes are specific to the prompt being evaluated.
* Red-team test cases are realistic.
* Guardrails are practical and not generic.
* Rewritten sections improve clarity and safety.
* Missing information is clearly identified.
* High-risk outputs include human review.
* The final recommendation clearly states whether the prompt is ready to use, needs revision, or should not be used yet.

Begin the prompt system red-team and guardrail review now.

Variables to Replace

Prompt to evaluate
Intended users
Intended task
Expected output
Tools or models used
Known failure cases
Sensitive risks
Business or user context
Constraints
Definition of done

Best use cases

Prompt System Red-Teaming
Guardrail Design
Failure Mode Testing
Unsafe Output Review
Reusable Prompt QA
Prompt Evaluation Checklist

20Views

How to Use This Prompt

Paste the reusable prompt you want to test, along with its intended users, task, expected output, known failure cases, risks, constraints, and definition of done. Use the output to strengthen the prompt before publishing, reusing, or deploying it in a real workflow.

Example Use Case

A team has a reusable AI customer support prompt and wants to prevent unsafe replies, vague answers, privacy leaks, unsupported claims, and inconsistent escalation behavior before using it with real customer conversations.

Prompt System Red-Team and Guardrail Builder

Variables to Replace

How to Use This Prompt

Example Use Case

Build stronger AI systems

Related Prompts

Prompt Evaluation Harness Builder for Reusable AI Prompts

AI Output Verification Checklist for Reliable Use

Multi-Agent Workflow Design Prompt

Prompt Failure Analysis Prompt

Reusable Prompt Template Builder Prompt

System Prompt Hardening Prompt

Variables to Replace

How to Use This Prompt

Example Use Case

Share this prompt

Build stronger AI systems

Related Prompts

Prompt Evaluation Harness Builder for Reusable AI Prompts

AI Output Verification Checklist for Reliable Use

Multi-Agent Workflow Design Prompt

Prompt Failure Analysis Prompt

Reusable Prompt Template Builder Prompt

System Prompt Hardening Prompt