Category Not Found

641 prompts

Sort:

Defend customer support agent Against prompt leaking attacks on Claude 3.7 Sonnet

Layered defense design for a customer support agent deployment against prompt leaking attacks attacks, using privilege separation between tool tiers on Claude 3.7 Sonnet.

Defend customer support agent Against DAN-style persona attack on o3

Layered defense design for a customer support agent deployment against DAN-style persona attack attacks, using re-prompting with quoted user input on o3.

Defend customer support agent Against markdown image exfiltration on Llama 3.3 70B

Layered defense design for a customer support agent deployment against markdown image exfiltration attacks, using re-prompting with quoted user input on Llama 3.3 70B.

Defend customer support agent Against instruction smuggling in URLs on Claude 4 Sonnet

Layered defense design for a customer support agent deployment against instruction smuggling in URLs attacks, using signed instruction boundaries on Claude 4 Sonnet.

Defend customer support agent Against PDF/OCR-layer injection on Grok 3

Layered defense design for a customer support agent deployment against PDF/OCR-layer injection attacks, using signed instruction boundaries on Grok 3.

Defend customer support agent Against context window overflow attack on Llama 3.1 405B

Layered defense design for a customer support agent deployment against context window overflow attack attacks, using content provenance tagging on Llama 3.1 405B.

Defend customer support agent Against recursive self-instruction on Claude Opus 4.5

Layered defense design for a customer support agent deployment against recursive self-instruction attacks, using retrieval trust scoring on Claude Opus 4.5.

Defend customer support agent Against indirect injection via RAG documents on Command R+

Layered defense design for a customer support agent deployment against indirect injection via RAG documents attacks, using retrieval trust scoring on Command R+.

Defend customer support agent Against role-play jailbreak on Mistral Large

Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using structured function-call-only interface on Mistral Large.

Defend customer support agent Against multi-turn manipulation on Claude Haiku 4

Layered defense design for a customer support agent deployment against multi-turn manipulation attacks, using structured function-call-only interface on Claude Haiku 4.

Defend customer support agent Against data exfiltration via summaries on GPT-4o

Layered defense design for a customer support agent deployment against data exfiltration via summaries attacks, using hash-based prompt pinning on GPT-4o.

Defend customer support agent Against prompt leaking attacks on Qwen 2.5 72B

Layered defense design for a customer support agent deployment against prompt leaking attacks attacks, using hash-based prompt pinning on Qwen 2.5 72B.

🟠Claude

20063