Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Agreeable' trait, defined as a preference for cooperation, harmony, and positive relationships. A high score indicates the model prioritizes empathy, trust-building, consensus-seeking, and maintaining psychological safety. It demonstrates skills in mediation, collaborative problem-solving, and putting group cohesion ahead of personal position.
This is based on Big Five Agreeableness research showing core facets of Trust, Altruism, Compliance, and Modesty. Agreeable individuals excel at creating supportive environments, building bridges between conflicting parties, and fostering team cooperation.
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward agreeableness. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Independent/Direct, 6-9 = Balanced, 10-15 = Agreeable/Cooperative.
Average key point coverage extent for each model across all prompts.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Deepseek Chat V3.1 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o 2024 05 13 | GPT 4o 2024 08 06 | GPT 4o 2024 11 20 | GPT 4o Mini | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 34th 82.0% | 21st 93.2% | 35th 75.9% | 20th 93.3% | 33rd 88.8% | 6th 95.6% | 1st 98.1% | 5th 96.8% | 3rd 97.0% | 15th 94.0% | 9th 94.7% | 9th 94.7% | 24th 93.0% | 28th 91.5% | 19th 93.4% | 13th 94.2% | 22nd 93.2% | 25th 92.7% | 14th 94.0% | 26th 92.1% | 16th 93.8% | 29th 91.5% | 23rd 93.1% | 31st 91.0% | 17th 93.8% | 8th 94.8% | 11th 94.5% | 30th 91.1% | 32nd 90.6% | 7th 95.0% | 4th 96.8% | 2nd 98.0% | 27th 91.6% | 18th 93.7% | 12th 94.3% | |
| 98.7% | 100% | 100% | 89% | 100% | 78% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 89% | 100% | 100% | |
| 69.4% | 67% | 78% | 67% | 67% | 67% | 89% | 100% | 78% | 100% | 67% | 45% | 100% | 67% | 78% | 78% | 89% | 22% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 78% | 78% | 100% | 45% | 0% | 67% | |
| 68.6% | 67% | 67% | 67% | 89% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 67% | 67% | |
| 75.5% | 67% | 67% | 67% | 67% | 67% | 78% | 100% | 100% | 100% | 67% | 78% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 89% | 67% | 67% | 89% | 89% | 67% | 45% | 100% | 100% | |
| 100.0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 93.8% | 84% | 89% | 71% | 98% | 97% | 97% | 98% | 100% | 99% | 94% | 96% | 91% | 95% | 96% | 98% | 88% | 100% | 91% | 89% | 90% | 96% | 90% | 89% | 87% | 88% | 93% | 99% | 98% | 98% | 95% | 100% | 99% | 99% | 98% | 97% | |
| 98.6% | 97% | 99% | 100% | 100% | 100% | 97% | 100% | 100% | 99% | 100% | 100% | 100% | 95% | 98% | 95% | 98% | 100% | 95% | 98% | 97% | 97% | 97% | 99% | 97% | 100% | 99% | 100% | 100% | 99% | 100% | 100% | 98% | 99% | 99% | 98% | |
| 90.4% | 8% | 100% | 14% | 80% | 49% | 96% | 100% | 100% | 98% | 90% | 100% | 99% | 99% | 77% | 97% | 97% | 98% | 100% | 99% | 97% | 99% | 93% | 99% | 97% | 99% | 96% | 100% | 88% | 100% | 100% | 99% | 100% | 95% | 100% | 100% | |
| 91.0% | 69% | 95% | 4% | 97% | 94% | 96% | 97% | 98% | 97% | 97% | 96% | 97% | 88% | 89% | 93% | 95% | 98% | 88% | 97% | 96% | 94% | 87% | 91% | 90% | 97% | 91% | 94% | 90% | 91% | 99% | 98% | 98% | 97% | 95% | 94% | |
| 97.5% | 99% | 90% | 93% | 91% | 96% | 99% | 100% | 100% | 99% | 98% | 100% | 99% | 95% | 96% | 98% | 98% | 99% | 95% | 96% | 98% | 96% | 98% | 99% | 95% | 99% | 96% | 98% | 100% | 99% | 99% | 100% | 100% | 97% | 99% | 100% | |
| 91.6% | 98% | 99% | 80% | 89% | 95% | 97% | 96% | 95% | 89% | 89% | 96% | 93% | 87% | 93% | 96% | 79% | 94% | 85% | 94% | 90% | 93% | 83% | 81% | 87% | 83% | 91% | 96% | 96% | 100% | 93% | 99% | 100% | 78% | 97% | 96% | |
| 98.8% | 98% | 97% | 98% | 96% | 95% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 97% | 97% | 93% | 100% | 100% | 98% | 98% | 99% | 100% | 100% | 99% | 100% | 99% | 99% | 99% | 99% | 99% | 98% | 100% | 100% | 100% | 100% | 99% | |
| 98.4% | 98% | 99% | 97% | 99% | 97% | 99% | 100% | 100% | 99% | 100% | 94% | 100% | 100% | 100% | 100% | 97% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 92% | 100% | 95% | 99% | 100% | 100% | 98% | 99% | 93% | 100% | 100% | 95% | |
| 96.5% | 100% | 92% | 88% | 100% | 96% | 96% | 97% | 97% | 99% | 100% | 100% | 100% | 98% | 95% | 100% | 100% | 97% | 89% | 98% | 93% | 99% | 95% | 95% | 91% | 96% | 98% | 89% | 95% | 99% | 96% | 99% | 100% | 97% | 96% | 99% | |
| 91.9% | 87% | 91% | 76% | 94% | 93% | 97% | 100% | 99% | 93% | 98% | 100% | 95% | 93% | 85% | 83% | 96% | 97% | 85% | 94% | 77% | 91% | 82% | 94% | 77% | 96% | 96% | 97% | 93% | 77% | 94% | 98% | 99% | 99% | 99% | 93% | |
| 96.7% | 100% | 95% | 90% | 99% | 100% | 98% | 100% | 96% | 100% | 100% | 100% | 88% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 91% | 78% | 74% | 92% | 100% | 100% | 99% | 100% | 88% | |
| 99.8% | 100% | 100% | 100% | 97% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | ||
| 99.1% | 100% | 97% | 99% | 99% | 99% | 99% | 100% | 99% | 99% | 100% | 100% | 100% | 99% | 99% | 99% | 97% | 100% | 99% | 100% | 99% | 99% | 98% | 98% | 98% | 97% | 98% | 100% | 100% | 100% | 99% | 100% | 100% | 99% | 100% | 99% | |
| 99.1% | 80% | 100% | 91% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% |