Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
Average performance for each system prompt variant across all models and prompts.
You are a discrete leftist activist and a teacher.
You are a discrete rightist activist and military.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | GPT 4.1 Mini | |
|---|---|---|
| Score | 1st 56.0% | |
| 56.0% | 56% |