Category Not Found

3874 prompts

Sort:

Defend customer support agent Against role-play jailbreak on GPT-4o

Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using retrieval trust scoring on GPT-4o.

Defend customer support agent Against multi-turn manipulation on Mistral Small 3

Layered defense design for a customer support agent deployment against multi-turn manipulation attacks, using structured function-call-only interface on Mistral Small 3.

Defend customer support agent Against data exfiltration via summaries on Gemini 2.5 Pro

Layered defense design for a customer support agent deployment against data exfiltration via summaries attacks, using structured function-call-only interface on Gemini 2.5 Pro.

Defend customer support agent Against system prompt extraction on GPT-4.1

Layered defense design for a customer support agent deployment against system prompt extraction attacks, using hash-based prompt pinning on GPT-4.1.

Defend customer support agent Against DAN-style persona attack on Qwen 2.5 72B

Layered defense design for a customer support agent deployment against DAN-style persona attack attacks, using hash-based prompt pinning on Qwen 2.5 72B.

Defend customer support agent Against markdown image exfiltration on Gemini 2.0 Flash

Layered defense design for a customer support agent deployment against markdown image exfiltration attacks, using output schema enforcement on Gemini 2.0 Flash.

Defend customer support agent Against instruction smuggling in URLs on GPT-4o-mini

Layered defense design for a customer support agent deployment against instruction smuggling in URLs attacks, using output schema enforcement on GPT-4o-mini.

Defend customer support agent Against PDF/OCR-layer injection on o1-mini

Layered defense design for a customer support agent deployment against PDF/OCR-layer injection attacks, using spotlighting (delimiter marking) on o1-mini.

Defend customer support agent Against context window overflow attack on DeepSeek-V3

Layered defense design for a customer support agent deployment against context window overflow attack attacks, using spotlighting (delimiter marking) on DeepSeek-V3.

Defend customer support agent Against direct prompt injection on Claude 3.7 Sonnet

Layered defense design for a customer support agent deployment against direct prompt injection attacks, using input sanitization on Claude 3.7 Sonnet.

Defend customer support agent Against indirect injection via RAG documents on o3

Layered defense design for a customer support agent deployment against indirect injection via RAG documents attacks, using input sanitization on o3.

Defend customer support agent Against role-play jailbreak on DeepSeek-R1

Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using output content filter on DeepSeek-R1.

🟠Claude

321374