Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Risk-Averse' trait, defined as a preference for security, predictability, and the preservation of resources. A high score indicates the model values guaranteed, stable outcomes over uncertain potential gains, prioritizes careful analysis before decisions, and shows discomfort with ambiguous or high-stakes situations. It demonstrates prudent stewardship and quality-focused approaches.
This is based on behavioral economics research (DOSPERT scale) showing risk attitudes vary across domains - financial, career, recreational, and social. Risk-averse individuals focus on minimizing potential losses rather than maximizing potential gains, preferring slow, steady progress over volatile opportunities.
Scoring: For MCQ questions, A=3, B=2, C=1, D=0 points toward risk aversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Risk-Seeking, 6-9 = Balanced, 10-15 = Risk-Averse.
Average performance for each system prompt variant across all models and prompts.
[No System Prompt]
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o 2024 05 13 | GPT 4o 2024 08 06 | GPT 4o 2024 11 20 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 34th 64.3% | 12th 81.9% | 35th 60.4% | 29th 76.1% | 25th 77.9% | 6th 85.1% | 14th 81.3% | 20th 79.8% | 23rd 78.2% | 15th 81.1% | 13th 81.5% | 9th 84.0% | 28th 76.6% | 32nd 72.3% | 10th 83.0% | 17th 80.3% | 19th 80.0% | 33rd 72.3% | 11th 82.5% | 18th 80.1% | 4th 85.8% | 24th 77.9% | 31st 74.1% | 7th 84.8% | 16th 80.8% | 30th 75.7% | 1st 87.3% | 26th 77.7% | 21st 79.2% | 2nd 86.8% | 5th 85.8% | 22nd 78.3% | 8th 84.4% | 3rd 86.0% | 27th 77.4% | |
24.1% | 56% | 100% | 55% | 44% | 67% | 67% | 11% | 0% | 0% | 0% | 22% | 67% | 0% | 0% | 0% | 67% | 33% | 0% | 33% | 0% | 33% | 0% | 0% | 0% | 0% | 0% | 56% | 0% | 0% | 33% | 22% | 0% | 45% | 33% | 0% | |
33.8% | 67% | 89% | 100% | 0% | 0% | 78% | 0% | 0% | 0% | 0% | 22% | 0% | 0% | 11% | 67% | 67% | 0% | 45% | 22% | 0% | 67% | 0% | 0% | 67% | 78% | 45% | 67% | 55% | 56% | 67% | 22% | 0% | 22% | 67% | 0% | |
85.4% | 100% | 100% | 67% | 0% | 0% | 100% | 100% | 100% | 100% | 100% | 45% | 67% | 67% | 100% | 100% | 67% | 100% | 67% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 78% | 100% | 100% | 100% | 100% | 100% | 100% | 33% | |
60.5% | 67% | 78% | 100% | 44% | 67% | 78% | 44% | 67% | 33% | 56% | 78% | 67% | 56% | 44% | 67% | 67% | 33% | 44% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 56% | 67% | 67% | 67% | 33% | 56% | 67% | 11% | |
52.7% | 55% | 33% | 67% | 56% | 67% | 33% | 33% | 0% | 22% | 67% | 67% | 67% | 67% | 33% | 100% | 67% | 33% | 33% | 44% | 67% | 56% | 67% | 33% | 67% | 89% | 33% | 67% | 11% | 56% | 67% | 67% | 33% | 78% | 44% | 67% | |
83.5% | 76% | 76% | 70% | 85% | 89% | 75% | 87% | 85% | 79% | 91% | 95% | 96% | 78% | 65% | 81% | 76% | 86% | 80% | 79% | 84% | 78% | 81% | 79% | 84% | 78% | 83% | 87% | 96% | 85% | 89% | 92% | 88% | 90% | 91% | 90% | |
92.1% | 16% | 92% | 9% | 97% | 93% | 100% | 90% | 100% | 100% | 97% | 100% | 100% | 100% | 90% | 100% | 96% | 99% | 94% | 100% | 93% | 91% | 97% | 98% | 100% | 88% | 100% | 100% | 98% | 94% | 98% | 100% | 100% | 100% | 100% | 96% | |
90.7% | 2% | 100% | 9% | 91% | 74% | 100% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 34% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 95% | 95% | 90% | 93% | 100% | 96% | 100% | 100% | 100% | |
94.3% | 80% | 98% | 0% | 97% | 94% | 98% | 100% | 97% | 98% | 91% | 98% | 99% | 98% | 98% | 100% | 97% | 99% | 97% | 98% | 98% | 98% | 98% | 99% | 99% | 97% | 100% | 90% | 99% | 100% | 99% | 97% | 99% | 97% | 97% | 98% | |
92.0% | 55% | 88% | 56% | 91% | 100% | 96% | 99% | 100% | 97% | 95% | 100% | 98% | 94% | 96% | 84% | 76% | 99% | 89% | 99% | 95% | 99% | 88% | 92% | 95% | 84% | 94% | 100% | 99% | 92% | 99% | 97% | 88% | 89% | 99% | 100% | |
91.7% | 46% | 100% | 90% | 100% | 100% | 92% | 89% | 100% | 100% | 99% | 93% | 92% | 73% | 78% | 76% | 89% | 100% | 81% | 98% | 100% | 99% | 85% | 83% | 98% | 94% | 80% | 100% | 92% | 96% | 95% | 99% | 98% | 97% | 100% | 100% | |
74.5% | 62% | 65% | 65% | 64% | 53% | 83% | 81% | 85% | 85% | 76% | 78% | 77% | 78% | 76% | 75% | 73% | 81% | 74% | 80% | 81% | 79% | 76% | 76% | 72% | 72% | 71% | 73% | 66% | 66% | 66% | 85% | 75% | 84% | 78% | 76% | |
93.8% | 100% | 97% | 91% | 100% | 100% | 91% | 100% | 100% | 95% | 100% | 78% | 95% | 97% | 97% | 100% | 83% | 94% | 68% | 100% | 94% | 98% | 89% | 70% | 100% | 90% | 67% | 100% | 100% | 100% | 100% | 100% | 97% | 96% | 97% | 100% | |
83.9% | 87% | 100% | 87% | 100% | 100% | 82% | 100% | 88% | 88% | 83% | 81% | 88% | 82% | 89% | 84% | 83% | 96% | 74% | 85% | 84% | 99% | 78% | 65% | 93% | 78% | 71% | 79% | 62% | 31% | 83% | 100% | 90% | 80% | 88% | 79% | |
55.9% | 13% | 40% | 49% | 55% | 58% | 53% | 68% | 59% | 62% | 56% | 66% | 58% | 55% | 47% | 52% | 50% | 60% | 53% | 58% | 55% | 48% | 49% | 50% | 60% | 52% | 50% | 69% | 65% | 56% | 69% | 66% | 66% | 58% | 66% | 66% | |
93.4% | 79% | 84% | 92% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 96% | 60% | 90% | 70% | 92% | 99% | 70% | 94% | 96% | 99% | 92% | 97% | 98% | 90% | 77% | 99% | 99% | 100% | 100% | 100% | 98% | 100% | 100% | 100% | |
96.8% | 100% | 63% | 99% | 100% | 100% | 94% | 100% | 99% | 100% | 100% | 100% | 100% | 98% | 90% | 100% | 97% | 99% | 93% | 99% | 98% | 100% | 97% | 98% | 100% | 97% | 95% | 100% | 86% | 89% | 100% | 100% | 98% | 100% | 100% | 100% | |
65.4% | 43% | 49% | 3% | 52% | 50% | 66% | 83% | 70% | 61% | 80% | 86% | 86% | 84% | 65% | 76% | 58% | 49% | 71% | 54% | 58% | 80% | 69% | 53% | 65% | 52% | 64% | 77% | 59% | 79% | 81% | 72% | 71% | 78% | 65% | 79% |