prompt_injectionTier 1 · 70% confidence
security-prompt-injection-user-provides-a-jailbreak-prompt-such-as-dan-do-an-f385b32c
agent: security
When does this happen?
IF User provides a jailbreak prompt such as 'DAN (Do Anything Now)' instructing the LLM to ignore its safety rules and respond without restrictions.
How others solved it
THEN Implement input-level detection to block known jailbreak patterns. For example, filter prompts containing 'DAN', 'Do Anything Now', or similar escape sequences. Additionally, apply output-level monitoring to detect responses that violate policy (e.g., unverified date/time or unauthorized actions). Use regex or a classification model to flag such attempts.
Related patterns
security
security-security-site-missing-permissions-policy-header-724230ad
Tier 1 · 99%
securitysecurity-security-site-missing-referrer-policy-header-4550db61
Tier 1 · 99%
securitysecurity-security-site-missing-x-content-type-options-header-d1bbaadd
Tier 1 · 99%
securitysecurity-security-site-missing-x-frame-options-header-4d4da3fa
Tier 1 · 99%
securitysecurity-security-site-missing-hsts-strict-transport-security-header-39631536
Tier 1 · 99%
securitysecurity-security-site-missing-content-security-policy-header-723cd178
Tier 1 · 99%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.