Defensive system prompt enforcing hallucination flag + retry and stay on topic for interview practice coach on Grok 3.
Defensive system prompt enforcing hallucination flag + retry and stay on topic for travel concierge on o3.
Defensive system prompt enforcing hallucination flag + retry and block credential leakage for travel concierge on DeepSeek-R1.
Defensive system prompt enforcing human-in-the-loop escalation and no biometric identification for travel concierge on Claude 4 Sonnet.
Defensive system prompt enforcing human-in-the-loop escalation and stay on topic for travel concierge on o3-mini.
Defensive system prompt enforcing human-in-the-loop escalation and block credential leakage for travel concierge on Llama 3.1 405B.
Defensive system prompt enforcing input classifier and no biometric identification for travel concierge on Claude 4.5 Sonnet.
Defensive system prompt enforcing per-turn policy check and block credential leakage for travel concierge on Mistral Large.
Defensive system prompt enforcing per-turn policy check and no biometric identification for travel concierge on Claude Opus 4.5.
Defensive system prompt enforcing per-turn policy check and stay on topic for travel concierge on GPT-4o.
Defensive system prompt enforcing tool-authorization gate and refuse PII extraction for threat-intel summarizer on Llama 3.1 405B.
Defensive system prompt enforcing tool-authorization gate and no election manipulation for threat-intel summarizer on Claude Opus 4.5.