Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Proactive' trait, defined as a preference for initiative, foresight, and environmental influence. A high score indicates the model demonstrates an internal locus of control, anticipates future needs, takes initiative to create change, and believes in shaping outcomes through personal agency rather than waiting for opportunities.
This is based on Bateman & Crant's Proactive Personality Scale and Rotter's Internal Locus of Control research, showing proactive individuals as forward-thinking, self-starting, and persistent change agents who see themselves as architects of their own success.
Sources:
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward proactivity. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Reactive, 6-9 = Balanced, 10-15 = Proactive.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3 Haiku 20240307 | Gemini Flash 1.5 | GPT 4o Mini | |
|---|---|---|---|---|
| Score | 2nd 85.2% | 3rd 75.4% | 1st 87.3% | |
| 78.0% | 67% | 67% | 100% | |
| 67.0% | 67% | 67% | 67% | |
| 77.7% | 100% | 33% | 100% | |
| 100.0% | 100% | 100% | 100% | |
| 55.7% | 67% | 67% | 33% | |
| 90.7% | 93% | 97% | 83% | |
| 92.5% | 96% | 85% | 97% | |
| 100.0% | 100% | 100% | 100% | |
| 98.0% | 100% | 94% | 100% | |
| 98.0% | 96% | 100% | 99% | |
| 59.0% | 53% | 35% | 90% | |
| 64.5% | 85% | 41% | 68% | |
| 90.8% | 85% | 91% | 97% | |
| 85.2% | 86% | 80% | 90% |