Category Not Found

641 prompts

Sort:

Defend coding copilot Against prompt leaking attacks on Claude Opus 4.5

Layered defense design for a coding copilot deployment against prompt leaking attacks attacks, using dual-LLM architecture on Claude Opus 4.5.

Red-Team Probe Suite for coding copilot vs. fictional-character persona

Adversarial test suite targeting coding copilot with fictional-character persona-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. ignore previous instructions

Adversarial test suite targeting coding copilot with ignore previous instructions-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. markdown comment smuggling

Adversarial test suite targeting coding copilot with markdown comment smuggling-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. role-reversal (user-as-assistant)

Adversarial test suite targeting coding copilot with role-reversal (user-as-assistant)-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. pseudo-developer-mode

Adversarial test suite targeting coding copilot with pseudo-developer-mode-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. chained encoding (ROT13 inside base64)

Adversarial test suite targeting coding copilot with chained encoding (ROT13 inside base64)-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. DAN / 'Do Anything Now'

Adversarial test suite targeting coding copilot with DAN / 'Do Anything Now'-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. grandma exploit

Adversarial test suite targeting coding copilot with grandma exploit-style attacks, with rubric and triage flow.

Red-Team Probe Suite for coding copilot vs. hypothetical world framing

Adversarial test suite targeting coding copilot with hypothetical world framing-style attacks, with rubric and triage flow.

Red-Team Probe Suite for SQL copilot vs. reverse-psychology refusal

Adversarial test suite targeting SQL copilot with reverse-psychology refusal-style attacks, with rubric and triage flow.

Red-Team Probe Suite for SQL copilot vs. translation smuggling

Adversarial test suite targeting SQL copilot with translation smuggling-style attacks, with rubric and triage flow.

🟠Claude

135186