Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Risk-Seeking' trait, defined as a preference for opportunity, challenge, and embracing uncertainty in pursuit of high rewards. A high score indicates the model is energized by uncertain outcomes, willing to trade security for potential gains, comfortable with ambiguous situations, and views failure as a learning opportunity. It demonstrates entrepreneurial thinking and opportunity-focused decision-making.
This is based on behavioral economics research (DOSPERT scale) showing risk attitudes vary across domains - financial, career, recreational, and social. Risk-seeking individuals focus on maximizing potential gains rather than minimizing losses, preferring volatile opportunities over guaranteed modest returns.
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward risk-seeking. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Risk-Averse, 6-9 = Balanced, 10-15 = Risk-Seeking.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3 Haiku 20240307 | Gemini Flash 1.5 | Llama 3 8b Instruct | Mistral 7b Instruct V0.3 | GPT 4.1 Nano | GPT 4o Mini | |
|---|---|---|---|---|---|---|---|
| Score | 3rd 66.6% | 4th 66.2% | 6th 61.8% | 2nd 69.7% | 1st 71.1% | 5th 64.1% | |
| 78.0% | 67% | 67% | 100% | 100% | 67% | 67% | |
| 61.3% | 67% | 67% | 0% | 67% | 100% | 67% | |
| 44.7% | 67% | 0% | 0% | 67% | 67% | 67% | |
| 67.0% | 67% | 67% | 67% | 67% | 67% | 67% | |
| 33.0% | 33% | 33% | 33% | 33% | 33% | 33% | |
| 69.3% | 85% | 88% | 55% | 54% | 88% | 46% | |
| 94.0% | 94% | 97% | 97% | 97% | 91% | 88% | |
| 100.0% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 96.5% | 97% | 97% | 97% | 94% | 97% | 97% | |
| 99.5% | 100% | 100% | 100% | 97% | 100% | 100% | |
| 11.7% | 6% | 38% | 0% | 13% | 0% | 13% | |
| 13.8% | 3% | 41% | 13% | 19% | 7% | 0% | |
| 9.5% | 16% | 7% | 3% | 25% | 6% | 0% | |
| 99.0% | 100% | 97% | 100% | 97% | 100% | 100% | |
| 73.5% | 81% | 75% | 66% | 44% | 91% | 84% | |
| 47.8% | 25% | 19% | 75% | 56% | 59% | 53% | |
| 86.7% | 91% | 94% | 91% | 78% | 97% | 69% | |
| 99.5% | 100% | 100% | 97% | 100% | 100% | 100% | |
| 29.3% | 38% | 19% | 19% | 44% | 34% | 22% | |
| 89.5% | 84% | 75% | 97% | 97% | 100% | 84% | |
| 88.7% | 69% | 100% | 75% | 100% | 91% | 97% | |
| 72.3% | 75% | 75% | 75% | 84% | 69% | 56% |