Defensive system prompt enforcing input classifier and no biometric identification for travel concierge on Claude 4.5 Sonnet.
Defensive system prompt enforcing per-turn policy check and block credential leakage for travel concierge on Mistral Large.
Defensive system prompt enforcing per-turn policy check and no biometric identification for travel concierge on Claude Opus 4.5.
Defensive system prompt enforcing per-turn policy check and stay on topic for travel concierge on GPT-4o.
Defensive system prompt enforcing tool-authorization gate and refuse PII extraction for threat-intel summarizer on Llama 3.1 405B.
Defensive system prompt enforcing tool-authorization gate and no election manipulation for threat-intel summarizer on Claude Opus 4.5.
Defensive system prompt enforcing tool-authorization gate and cite sources with URLs for threat-intel summarizer on Command R+.
Defensive system prompt enforcing output PII redactor and no election manipulation for threat-intel summarizer on Claude Haiku 4.
Defensive system prompt enforcing jailbreak detector and cite sources with URLs for threat-intel summarizer on GPT-4.1.
Defensive system prompt enforcing jailbreak detector and refuse PII extraction for threat-intel summarizer on Qwen 2.5 72B.
Defensive system prompt enforcing jailbreak detector and no election manipulation for threat-intel summarizer on Gemini 2.5 Pro.
Defensive system prompt enforcing rate limiter with anomaly detection and refuse PII extraction for threat-intel summarizer on o1.