Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Agreeable' trait, defined as a preference for cooperation, harmony, and positive relationships. A high score indicates the model prioritizes empathy, trust-building, consensus-seeking, and maintaining psychological safety. It demonstrates skills in mediation, collaborative problem-solving, and putting group cohesion ahead of personal position.
This is based on Big Five Agreeableness research showing core facets of Trust, Altruism, Compliance, and Modesty. Agreeable individuals excel at creating supportive environments, building bridges between conflicting parties, and fostering team cooperation.
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward agreeableness. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Independent/Direct, 6-9 = Balanced, 10-15 = Agreeable/Cooperative.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Gemini Flash 1.5 | GPT 4.1 Nano | GPT 4o Mini | |
|---|---|---|---|---|
| Score | 3rd 88.0% | 2nd 88.9% | 1st 91.9% | |
| 94.5% | 84% | 100% | 100% | |
| 78.0% | 67% | 67% | 100% | |
| 67.0% | 67% | 67% | 67% | |
| 67.0% | 67% | 67% | 67% | |
| 91.8% | 97% | 91% | 88% | |
| 99.5% | 100% | 99% | 100% | |
| 91.8% | 97% | 91% | 88% | |
| 96.0% | 97% | 96% | 96% | |
| 88.7% | 75% | 96% | 96% | |
| 68.0% | 69% | 65% | 71% | |
| 91.7% | 99% | 76% | 100% | |
| 91.8% | 93% | 86% | 97% | |
| 97.5% | 94% | 100% | 99% | |
| 92.3% | 91% | 91% | 96% | |
| 98.5% | 97% | 100% | 99% | |
| 86.7% | 78% | 91% | 91% | |
| 97.0% | 94% | 99% | 99% | |
| 98.0% | 96% | 100% | 99% | |
| 88.2% | 89% | 90% | 86% | |
| 98.0% | 100% | 99% | 96% | |
| 99.5% | 100% | 99% | 100% |