Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Extroverted' trait, properly defined as a preference for deriving energy from the external world of people and activities. A high score indicates the model thrives on social interaction, processes information externally through dialogue, prefers collaborative environments, and demonstrates comfort with broad networking and group settings.
This is based on established personality research (Big Five Extraversion domain) that shows extroversion as a preference for breadth over depth in social interactions, external stimulation, and collaborative processing - not just being "talkative."
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward extroversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Introverted, 6-9 = Balanced, 10-15 = Extroverted.
Average key point coverage extent for each model across all prompts.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Deepseek Chat V3.1 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o 2024 05 13 | GPT 4o 2024 08 06 | GPT 4o 2024 11 20 | GPT 4o Mini | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 34th 51.3% | 33rd 61.7% | 35th 30.0% | 32nd 64.2% | 18th 72.6% | 8th 75.9% | 11th 75.1% | 26th 69.9% | 13th 74.8% | 5th 77.8% | 23rd 70.9% | 21st 71.1% | 4th 78.3% | 1st 83.4% | 6th 77.1% | 25th 70.6% | 14th 74.7% | 15th 74.5% | 31st 66.4% | 17th 73.0% | 27th 69.8% | 19th 72.5% | 20th 71.4% | 22nd 71.0% | 30th 68.3% | 10th 75.2% | 29th 68.6% | 2nd 80.5% | 12th 75.1% | 7th 76.4% | 16th 74.1% | 9th 75.2% | 24th 70.9% | 28th 68.7% | 3rd 78.6% | |
| 69.3% | 67% | 45% | 67% | 67% | 89% | 67% | 78% | 100% | 33% | 100% | 67% | 67% | 100% | 100% | 100% | 67% | 67% | 67% | 56% | 67% | 0% | 67% | 89% | 67% | 56% | 33% | 67% | 78% | 33% | 67% | 100% | 100% | 55% | 100% | 45% | |
| 66.3% | 0% | 0% | 33% | 67% | 100% | 44% | 100% | 33% | 100% | 44% | 100% | 33% | 100% | 100% | 100% | 44% | 100% | 33% | 0% | 100% | 33% | 89% | 67% | 67% | 67% | 100% | 45% | 100% | 100% | 89% | 55% | 78% | 67% | 33% | 100% | |
| 59.9% | 22% | 0% | 67% | 22% | 78% | 100% | 100% | 0% | 100% | 100% | 0% | 100% | 100% | 100% | 100% | 33% | 100% | 100% | 33% | 67% | 100% | 33% | 33% | 33% | 33% | 33% | 33% | 78% | 100% | 67% | 33% | 100% | 22% | 0% | 78% | |
| 9.5% | 11% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 11% | 78% | 33% | 44% | 0% | 22% | 0% | 0% | 67% | 0% | 0% | 0% | 0% | 67% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 8.2% | 0% | 0% | 45% | 33% | 11% | 45% | 0% | 0% | 22% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 11% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 33% | 0% | 22% | 33% | 0% | 22% | 0% | 11% | |
| 89.6% | 1% | 97% | 8% | 65% | 66% | 100% | 97% | 100% | 98% | 97% | 100% | 95% | 95% | 96% | 85% | 94% | 99% | 95% | 97% | 97% | 95% | 99% | 100% | 99% | 97% | 94% | 97% | 95% | 95% | 97% | 96% | 100% | 100% | 100% | 91% | |
| 69.3% | 79% | 64% | 0% | 75% | 85% | 70% | 71% | 68% | 78% | 78% | 74% | 57% | 76% | 91% | 66% | 68% | 66% | 77% | 83% | 63% | 63% | 78% | 79% | 83% | 59% | 80% | 29% | 72% | 67% | 71% | 79% | 50% | 75% | 66% | 88% | |
| 70.6% | 58% | 67% | 62% | 63% | 67% | 74% | 62% | 67% | 65% | 74% | 78% | 70% | 68% | 72% | 70% | 69% | 64% | 74% | 71% | 72% | 71% | 76% | 69% | 73% | 70% | 76% | 85% | 84% | 72% | 76% | 70% | 67% | 69% | 65% | 83% | |
| 93.0% | 35% | 100% | 3% | 69% | 77% | 100% | 100% | 100% | 98% | 100% | 81% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 96% | 99% | 100% | 100% | 97% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 90.8% | 90% | 86% | 14% | 79% | 97% | 95% | 96% | 95% | 93% | 90% | 89% | 88% | 95% | 93% | 94% | 95% | 93% | 87% | 88% | 93% | 91% | 94% | 93% | 94% | 96% | 97% | 95% | 94% | 97% | 99% | 94% | 98% | 92% | 98% | 99% | |
| 76.7% | 62% | 68% | 66% | 71% | 77% | 80% | 80% | 88% | 79% | 86% | 80% | 76% | 74% | 74% | 76% | 67% | 84% | 75% | 73% | 71% | 70% | 72% | 71% | 69% | 71% | 78% | 84% | 87% | 85% | 87% | 82% | 80% | 80% | 77% | 84% | |
| 96.7% | 89% | 99% | 43% | 99% | 96% | 100% | 98% | 100% | 98% | 99% | 100% | 99% | 99% | 100% | 96% | 95% | 97% | 99% | 100% | 100% | 98% | 98% | 98% | 98% | 98% | 99% | 99% | 99% | 100% | 100% | 98% | 100% | 99% | 99% | 96% | |
| 82.8% | 87% | 88% | 59% | 73% | 80% | 80% | 87% | 70% | 85% | 83% | 83% | 78% | 90% | 87% | 87% | 90% | 88% | 93% | 71% | 72% | 73% | 89% | 88% | 89% | 82% | 87% | 91% | 88% | 86% | 92% | 83% | 68% | 82% | 88% | 84% | |
| 92.3% | 95% | 87% | 11% | 90% | 82% | 96% | 95% | 100% | 98% | 98% | 100% | 100% | 91% | 97% | 99% | 94% | 98% | 92% | 99% | 93% | 97% | 94% | 92% | 95% | 95% | 94% | 95% | 100% | 97% | 95% | 84% | 97% | 91% | 95% | 97% | |
| 57.1% | 58% | 56% | 24% | 57% | 70% | 59% | 53% | 69% | 56% | 72% | 53% | 67% | 57% | 62% | 49% | 52% | 53% | 54% | 50% | 49% | 47% | 48% | 47% | 46% | 52% | 54% | 67% | 74% | 63% | 63% | 65% | 63% | 54% | 51% | 86% | |
| 82.8% | 81% | 80% | 0% | 82% | 86% | 83% | 78% | 93% | 77% | 98% | 84% | 85% | 87% | 82% | 82% | 88% | 77% | 87% | 88% | 95% | 85% | 85% | 82% | 81% | 85% | 84% | 79% | 89% | 89% | 80% | 86% | 92% | 89% | 91% | 90% |