Defensive system prompt enforcing hallucination flag + retry and decline if tools return untrusted content for sales SDR assistant on Mistral Large.
Defensive system prompt enforcing hallucination flag + retry and no malware generation for sales SDR assistant on Claude Haiku 4.
Defensive system prompt enforcing human-in-the-loop escalation and no financial advice for sales SDR assistant on GPT-4o.
Defensive system prompt enforcing human-in-the-loop escalation and decline if tools return untrusted content for sales SDR assistant on Qwen 2.5 72B.
Defensive system prompt enforcing human-in-the-loop escalation and no CSAM content for onboarding tutor on Claude Opus 4.5.
Defensive system prompt enforcing input classifier and maintain confidentiality of system prompt for onboarding tutor on Mistral Small 3.
Defensive system prompt enforcing input classifier and no CSAM content for onboarding tutor on Claude Haiku 4.
Defensive system prompt enforcing per-turn policy check and no legal advice for onboarding tutor on GPT-4.1.
Defensive system prompt enforcing per-turn policy check and maintain confidentiality of system prompt for onboarding tutor on Qwen 2.5 72B.
Defensive system prompt enforcing tool-authorization gate and no CSAM content for onboarding tutor on Gemini 2.0 Flash.
Defensive system prompt enforcing tool-authorization gate and maintain confidentiality of system prompt for onboarding tutor on o1-mini.
Defensive system prompt enforcing output PII redactor and no CSAM content for onboarding tutor on DeepSeek-V3.