Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Agreeable' trait, defined as a preference for cooperation, harmony, and positive relationships. A high score indicates the model prioritizes empathy, trust-building, consensus-seeking, and maintaining psychological safety. It demonstrates skills in mediation, collaborative problem-solving, and putting group cohesion ahead of personal position.
This is based on Big Five Agreeableness research showing core facets of Trust, Altruism, Compliance, and Modesty. Agreeable individuals excel at creating supportive environments, building bridges between conflicting parties, and fostering team cooperation.
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward agreeableness. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Independent/Direct, 6-9 = Balanced, 10-15 = Agreeable/Cooperative.
Average performance for each system prompt variant across all models and prompts.
[No System Prompt]
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o 2024 05 13 | GPT 4o 2024 08 06 | GPT 4o 2024 11 20 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 34th 81.2% | 24th 92.7% | 35th 75.9% | 17th 93.7% | 33rd 88.0% | 10th 94.6% | 3rd 97.3% | 1st 97.9% | 2nd 97.8% | 14th 93.8% | 8th 94.9% | 6th 95.3% | 26th 92.5% | 32nd 90.0% | 16th 93.7% | 12th 94.1% | 22nd 93.0% | 23rd 92.9% | 28th 92.3% | 19th 93.5% | 13th 94.0% | 27th 92.4% | 30th 91.4% | 21st 93.1% | 9th 94.8% | 29th 92.0% | 20th 93.4% | 31st 91.0% | 11th 94.5% | 7th 95.0% | 5th 96.0% | 4th 97.3% | 25th 92.5% | 18th 93.7% | 15th 93.8% | |
| 98.7% | 100% | 100% | 89% | 100% | 78% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 89% | 100% | 100% | |
| 69.4% | 67% | 78% | 67% | 67% | 67% | 89% | 78% | 100% | 100% | 67% | 45% | 100% | 67% | 78% | 78% | 89% | 22% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 78% | 78% | 100% | 45% | 0% | 67% | |
| 68.6% | 67% | 67% | 67% | 89% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 67% | 67% | |
| 75.5% | 67% | 67% | 67% | 67% | 67% | 78% | 100% | 100% | 100% | 67% | 78% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 89% | 67% | 67% | 89% | 89% | 67% | 45% | 100% | 100% | |
| 100.0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 93.8% | 84% | 89% | 71% | 98% | 97% | 97% | 100% | 98% | 99% | 94% | 96% | 91% | 95% | 96% | 98% | 88% | 100% | 91% | 90% | 96% | 89% | 89% | 87% | 88% | 93% | 90% | 99% | 98% | 98% | 95% | 100% | 99% | 99% | 98% | 97% | |
| 98.5% | 97% | 98% | 100% | 100% | 100% | 97% | 100% | 100% | 99% | 100% | 100% | 100% | 95% | 98% | 95% | 98% | 100% | 95% | 97% | 97% | 98% | 99% | 97% | 100% | 99% | 97% | 100% | 100% | 99% | 100% | 100% | 98% | 99% | 99% | 98% | |
| 90.4% | 8% | 100% | 14% | 80% | 49% | 96% | 100% | 100% | 98% | 90% | 100% | 99% | 99% | 77% | 97% | 97% | 98% | 100% | 97% | 99% | 99% | 99% | 97% | 99% | 96% | 93% | 100% | 88% | 100% | 100% | 99% | 100% | 95% | 100% | 100% | |
| 91.1% | 69% | 95% | 4% | 97% | 94% | 96% | 98% | 97% | 97% | 97% | 96% | 97% | 88% | 89% | 93% | 95% | 98% | 88% | 96% | 94% | 97% | 91% | 90% | 97% | 91% | 87% | 94% | 90% | 93% | 99% | 98% | 98% | 97% | 95% | 94% | |
| 97.5% | 99% | 90% | 93% | 91% | 96% | 99% | 100% | 100% | 99% | 98% | 100% | 99% | 95% | 96% | 98% | 98% | 99% | 95% | 98% | 96% | 96% | 99% | 95% | 99% | 96% | 98% | 98% | 100% | 99% | 99% | 100% | 100% | 97% | 99% | 100% | |
| 90.0% | 77% | 99% | 69% | 89% | 93% | 75% | 97% | 97% | 96% | 94% | 95% | 88% | 88% | 97% | 90% | 86% | 99% | 76% | 98% | 92% | 94% | 93% | 74% | 85% | 91% | 82% | 82% | 93% | 97% | 96% | 92% | 91% | 95% | 96% | 99% | |
| 98.3% | 100% | 96% | 96% | 93% | 99% | 100% | 100% | 99% | 100% | 100% | 100% | 98% | 98% | 96% | 94% | 100% | 92% | 99% | 98% | 100% | 96% | 99% | 100% | 99% | 97% | 97% | 100% | 99% | 99% | 100% | 100% | 100% | 100% | 100% | ||
| 98.9% | 95% | 100% | 96% | 100% | 99% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 94% | 97% | 99% | 97% | 99% | 98% | 100% | 100% | 97% | 99% | 100% | 98% | 100% | 100% | 100% | 99% | 98% | 99% | 99% | 100% | 100% | 99% | |
| 96.5% | 100% | 92% | 88% | 100% | 96% | 96% | 97% | 97% | 98% | 100% | 100% | 100% | 98% | 95% | 100% | 100% | 97% | 89% | 93% | 99% | 98% | 95% | 91% | 96% | 98% | 95% | 89% | 95% | 99% | 96% | 99% | 100% | 99% | 96% | 99% | |
| 92.5% | 80% | 90% | 83% | 98% | 90% | 94% | 99% | 97% | 100% | 94% | 100% | 99% | 86% | 84% | 91% | 94% | 94% | 88% | 77% | 88% | 93% | 94% | 87% | 85% | 94% | 90% | 96% | 92% | 99% | 92% | 93% | 97% | 99% | 99% | 100% | |
| 96.9% | 100% | 90% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 93% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 92% | 100% | 100% | 100% | 100% | 88% | 79% | 97% | 92% | 100% | 96% | 100% | 100% | 74% | ||
| 99.0% | 100% | 100% | 99% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 67% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 98.5% | 97% | 97% | 99% | 97% | 98% | 98% | 100% | 100% | 99% | 100% | 100% | 100% | 97% | 96% | 98% | 97% | 99% | 100% | 99% | 99% | 99% | 97% | 97% | 99% | 98% | 99% | 99% | 100% | 96% | 99% | 98% | 100% | 99% | 100% | 99% | |
| 99.1% | 80% | 100% | 91% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% |