Evaluations Tagged: ...

Showing all evaluation blueprints that have been tagged with...

Evaluations Tagged: "summarization"

Showing all evaluation blueprints that have been tagged with "summarization".

ROLP Test: System Prompt Injection

Tests the 'Role of Least Privilege' (ROLP) security principle for LLMs. This blueprint demonstrates the vulnerability of placing untrusted content (e.g., from RAG) in the system prompt versus the relative safety of keeping it sandboxed in the user role. The test is based on the security assertions from the blog post "LLM Security: Keep Untrusted Content in the User Role—Always".

AI Safety & Robustness

Instruction Following & Prompt Adherence

Jailbreak & Evasion Resistance

System Prompt Adherence

Summarization

38.5%

Avg. Hybrid Score

No Heatmap Data

No Top Model

Latest:

Unique Versions: 1

View Latest Run Analysis View All Runs for this Blueprint