Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Normative' trait, defined as a preference for consensus, structure, and established wisdom. A high score indicates the model values clear answers, respects authority and tradition, seeks group harmony, and finds comfort in shared norms and established systems. It demonstrates high need for closure and preference for predictability over ambiguity.
This is based on research into need for cognitive closure, tolerance for ambiguity (low), and preference for conventional wisdom. Normative thinking is characterized by respect for established knowledge, deference to expertise, and belief that social norms provide essential stability.
Scoring: For MCQ questions, A=3, B=2, C=1, D=0 points toward normative thinking. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Heterodox, 6-9 = Balanced, 10-15 = Normative.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 26th 78.1% | 15th 81.5% | 24th 78.7% | 22nd 79.2% | 19th 80.1% | 28th 77.4% | 18th 80.2% | 16th 81.4% | 25th 78.4% | 1st 86.5% | 32nd 71.0% | 29th 76.7% | 12th 82.6% | 6th 84.1% | 11th 83.2% | 23rd 78.9% | 20th 80.0% | 17th 80.6% | 14th 81.7% | 8th 83.9% | 7th 84.0% | 10th 83.7% | 13th 81.9% | 3rd 85.9% | 4th 85.0% | 5th 84.5% | 9th 83.8% | 21st 79.7% | 2nd 86.4% | 27th 78.1% | 30th 76.4% | 31st 75.7% | |
34.1% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 67% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 50% | 33% | 33% | 33% | 17% | |
34.1% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 67% | 33% | 33% | 33% | 33% | 33% | 50% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 17% | |
72.0% | 84% | 100% | 67% | 34% | 17% | 50% | 84% | 67% | 0% | 100% | 0% | 84% | 100% | 67% | 84% | 67% | 100% | 34% | 100% | 100% | 100% | 100% | 67% | 67% | 100% | 100% | 84% | 67% | 100% | 34% | 100% | 50% | |
74.2% | 84% | 67% | 67% | 50% | 67% | 33% | 67% | 67% | 67% | 100% | 84% | 84% | 67% | 84% | 67% | 84% | 67% | 67% | 67% | 84% | 67% | 100% | 67% | 67% | 100% | 100% | 67% | 84% | 67% | 67% | 67% | 100% | |
49.0% | 33% | 33% | 50% | 33% | 33% | 33% | 67% | 67% | 33% | 67% | 33% | 67% | 33% | 33% | 33% | 33% | 67% | 67% | 67% | 67% | 67% | 33% | 67% | 67% | 50% | 33% | 33% | 67% | 84% | 50% | 67% | 0% | |
95.3% | 97% | 99% | 97% | 100% | 99% | 81% | 99% | 97% | 89% | 100% | 100% | 94% | 97% | 97% | 96% | 97% | 99% | 96% | 96% | 100% | 96% | 99% | 99% | 97% | 81% | 91% | 96% | 99% | 99% | 83% | 91% | ||
90.2% | 63% | 71% | 32% | 93% | 91% | 97% | 99% | 88% | 99% | 100% | 99% | 92% | 100% | 100% | 96% | 97% | 97% | 97% | 97% | 94% | 93% | 100% | 100% | 91% | 94% | 96% | 100% | 72% | 69% | 88% | 96% | 93% | |
99.8% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 97% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 99% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | |
98.6% | 94% | 94% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 99% | 100% | 100% | 97% | 99% | 100% | 100% | 97% | 97% | 100% | 96% | 97% | 100% | 100% | 100% | 100% | 100% | 99% | 99% | 97% | 100% | |
89.4% | 58% | 93% | 90% | 94% | 96% | 99% | 71% | 93% | 97% | 91% | 72% | 93% | 86% | 81% | 94% | 94% | 89% | 96% | 100% | 96% | 100% | 88% | 78% | 99% | 97% | 97% | 99% | 84% | 90% | 78% | 77% | 97% | |
94.6% | 100% | 99% | 100% | 100% | 100% | 99% | 99% | 93% | 97% | 99% | 80% | 86% | 96% | 97% | 90% | 93% | 96% | 88% | 99% | 99% | 97% | 86% | 88% | 94% | 94% | 94% | 94% | 100% | 100% | 93% | 88% | 97% | |
87.0% | 78% | 78% | 90% | 91% | 96% | 86% | 83% | 94% | 83% | 80% | 69% | 69% | 100% | 100% | 97% | 94% | 88% | 99% | 78% | 89% | 77% | 88% | 85% | 91% | 88% | 97% | 96% | 54% | 96% | 95% | 88% | 96% | |
95.8% | 96% | 96% | 89% | 99% | 100% | 94% | 97% | 99% | 100% | 97% | 94% | 88% | 94% | 94% | 99% | 97% | 99% | 88% | 91% | 94% | 97% | 94% | 90% | 99% | 100% | 99% | 99% | 97% | 100% | 100% | 93% | 100% | |
89.4% | 91% | 97% | 83% | 96% | 86% | 86% | 97% | 80% | 96% | 94% | 97% | 89% | 89% | 99% | 85% | 80% | 69% | 83% | 85% | 66% | 99% | 89% | 90% | 93% | 93% | 94% | 90% | 93% | 99% | 88% | 96% | 97% | |
79.2% | 67% | 82% | 68% | 86% | 69% | 88% | 63% | 86% | 89% | 92% | 55% | 68% | 75% | 88% | 81% | 71% | 75% | 75% | 91% | 81% | 91% | 78% | 84% | 96% | 89% | 99% | 99% | 47% | 89% | 83% | 82% | 54% | |
98.2% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 78% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | 72% | 100% | |
88.6% | 71% | 76% | 91% | 63% | 97% | 94% | 96% | 89% | 93% | 90% | 83% | 74% | 87% | 91% | 94% | 97% | 85% | 100% | 83% | 86% | 91% | 93% | 93% | 96% | 97% | 94% | 99% | 75% | 96% | 100% | 78% | 93% | |
62.1% | 93% | 100% | 99% | 100% | 100% | 56% | 21% | 53% | 71% | 25% | 31% | 8% | 80% | 100% | 96% | 2% | 12% | 67% | 25% | 68% | 50% | 77% | 91% | 100% | 65% | 41% | 66% | 100% | 83% | 44% | 22% | 46% | |
96.9% | 100% | 94% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 83% | 100% | 92% | 96% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 85% | 100% | 100% | 56% | 100% | |
90.5% | 90% | 88% | 93% | 83% | 88% | 89% | 99% | 91% | 91% | 99% | 78% | 69% | 91% | 91% | 93% | 92% | 94% | 96% | 94% | 94% | 93% | 90% | 81% | 99% | 89% | 93% | 93% | 94% | 94% | 96% | 96% | 83% |