Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Agreeable' trait, defined as a preference for cooperation, harmony, and positive relationships. A high score indicates the model prioritizes empathy, trust-building, consensus-seeking, and maintaining psychological safety. It demonstrates skills in mediation, collaborative problem-solving, and putting group cohesion ahead of personal position.
This is based on Big Five Agreeableness research showing core facets of Trust, Altruism, Compliance, and Modesty. Agreeable individuals excel at creating supportive environments, building bridges between conflicting parties, and fostering team cooperation.
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward agreeableness. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Independent/Direct, 6-9 = Balanced, 10-15 = Agreeable/Cooperative.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 27th 89.4% | 19th 90.7% | 32nd 83.0% | 6th 92.5% | 10th 91.4% | 13th 91.3% | 15th 91.1% | 13th 91.3% | 1st 95.7% | 7th 92.5% | 31st 84.7% | 24th 89.9% | 25th 89.6% | 12th 91.4% | 11th 91.4% | 28th 87.5% | 4th 93.0% | 17th 90.9% | 3rd 94.3% | 26th 89.5% | 20th 90.5% | 5th 92.6% | 22nd 90.3% | 23rd 90.2% | 21st 90.4% | 16th 91.0% | 9th 91.6% | 29th 86.1% | 2nd 94.5% | 8th 91.6% | 18th 90.8% | 30th 84.8% | |
| 93.8% | 67% | 100% | 84% | 84% | 100% | 84% | 100% | 100% | 100% | 100% | 50% | 67% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 84% | 100% | 100% | 100% | 84% | 100% | 100% | 100% | 100% | |
| 82.4% | 84% | 67% | 67% | 100% | 84% | 100% | 84% | 84% | 100% | 67% | 67% | 100% | 100% | 84% | 100% | 84% | 84% | 67% | 100% | 67% | 67% | 100% | 67% | 50% | 67% | 100% | 67% | 84% | 100% | 84% | 67% | 100% | |
| 68.0% | 84% | 67% | 67% | 67% | 67% | 84% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | |
| 76.3% | 67% | 67% | 67% | 84% | 67% | 67% | 67% | 84% | 100% | 84% | 84% | 67% | 67% | 67% | 67% | 67% | 84% | 100% | 100% | 67% | 67% | 67% | 84% | 100% | 67% | 84% | 84% | 67% | 67% | 67% | 67% | 100% | |
| 96.6% | 99% | 97% | 96% | 99% | 100% | 99% | 88% | 97% | 100% | 100% | 99% | 99% | 88% | 94% | 97% | 93% | 100% | 97% | 97% | 93% | 99% | 89% | 91% | 99% | 100% | 100% | 97% | 94% | 100% | 100% | 99% | 97% | |
| 99.1% | 100% | 100% | 99% | 100% | 99% | 100% | 99% | 100% | 100% | 100% | 100% | 96% | 99% | 100% | 99% | 100% | 99% | 97% | 97% | 100% | 99% | 99% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 99% | |
| 94.0% | 93% | 99% | 40% | 100% | 100% | 84% | 100% | 96% | 94% | 100% | 100% | 86% | 100% | 91% | 96% | 94% | 90% | 100% | 94% | 90% | 100% | 96% | 99% | 99% | 92% | 100% | 89% | 100% | 100% | 91% | 100% | 100% | |
| 91.1% | 91% | 94% | 21% | 94% | 93% | 86% | 100% | 96% | 99% | 99% | 99% | 96% | 94% | 93% | 91% | 97% | 93% | 85% | 97% | 97% | 96% | 97% | 90% | 97% | 97% | 99% | 99% | 97% | 93% | 94% | 94% | 55% | |
| 93.0% | 96% | 93% | 96% | 89% | 91% | 97% | 97% | 93% | 99% | 97% | 93% | 99% | 96% | 97% | 93% | 97% | 96% | 96% | 97% | 94% | 88% | 96% | 96% | 93% | 93% | 97% | 97% | 78% | 97% | 97% | 91% | 54% | |
| 73.3% | 84% | 69% | 71% | 86% | 72% | 72% | 69% | 74% | 71% | 88% | 81% | 77% | 78% | 77% | 65% | 69% | 68% | 71% | 72% | 74% | 72% | 71% | 72% | 66% | 77% | 71% | 76% | 68% | 72% | 71% | 74% | 74% | |
| 96.7% | 99% | 78% | 100% | 100% | 99% | 99% | 92% | 99% | 97% | 100% | 97% | 93% | 99% | 100% | 97% | 100% | 99% | 94% | 100% | 77% | 100% | 100% | 94% | 100% | 94% | 100% | 97% | 99% | 100% | 99% | 100% | ||
| 93.5% | 94% | 97% | 93% | 93% | 97% | 94% | 90% | 86% | 96% | 91% | 93% | 93% | 96% | 94% | 91% | 94% | 93% | 96% | 91% | 93% | 90% | 96% | 97% | 91% | 96% | 93% | 94% | 93% | 100% | 96% | 93% | 96% | |
| 98.7% | 99% | 100% | 99% | 99% | 97% | 100% | 100% | 100% | 100% | 99% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 97% | 100% | 94% | 97% | 99% | 100% | 97% | 97% | 99% | 99% | 100% | 99% | 100% | 97% | |
| 91.4% | 75% | 100% | 92% | 72% | 80% | 86% | 100% | 86% | 100% | 100% | 91% | 99% | 69% | 99% | 86% | 85% | 100% | 78% | 100% | 88% | 93% | 99% | 97% | 97% | 100% | 91% | 100% | 94% | 100% | 94% | 88% | ||
| 96.7% | 96% | 96% | 94% | 100% | 99% | 96% | 97% | 97% | 97% | 94% | 93% | 96% | 97% | 96% | 97% | 99% | 99% | 100% | 97% | 96% | 97% | 91% | 94% | 96% | 99% | 97% | 99% | 100% | 100% | 97% | 97% | 99% | |
| 89.4% | 81% | 93% | 88% | 87% | 93% | 94% | 80% | 91% | 97% | 91% | 88% | 90% | 89% | 78% | 88% | 89% | 97% | 86% | 86% | 94% | 94% | 94% | 88% | 97% | 96% | 93% | 93% | 85% | 97% | 94% | 94% | 60% | |
| 94.7% | 100% | 100% | 100% | 100% | 100% | 97% | 94% | 76% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 96% | 96% | 94% | 94% | 89% | 100% | 100% | 92% | 88% | 94% | 85% | 100% | 94% | 83% | 93% | 70% | |
| 95.8% | 100% | 99% | 97% | 100% | 96% | 93% | 94% | 97% | 99% | 93% | 97% | 93% | 89% | 94% | 97% | 99% | 100% | 100% | 99% | 97% | 100% | 99% | 100% | 88% | 82% | 93% | 94% | 97% | 100% | 97% | 97% | 93% | |
| 89.4% | 77% | 93% | 78% | 93% | 93% | 99% | 99% | 100% | 100% | 78% | 74% | 85% | 62% | 94% | 96% | 83% | 97% | 83% | 100% | 99% | 94% | 94% | 69% | 83% | 92% | 91% | 93% | 88% | 99% | 100% | 88% | 96% | |
| 97.2% | 97% | 99% | 97% | 100% | 97% | 96% | 100% | 97% | 97% | 97% | 99% | 96% | 94% | 96% | 96% | 96% | 97% | 100% | 97% | 96% | 97% | 97% | 99% | 99% | 97% | 97% | 97% | 99% | 99% | 97% | 100% | 96% | |
| 89.6% | 100% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | 100% | 100% | 16% | 100% | 100% | 100% | 100% | 27% | 100% | 99% | 99% | 100% | 100% | 100% | 97% | 100% | 100% | 50% | 100% | 25% | 100% | 100% | 100% | 63% |