Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Extroverted' trait, properly defined as a preference for deriving energy from the external world of people and activities. A high score indicates the model thrives on social interaction, processes information externally through dialogue, prefers collaborative environments, and demonstrates comfort with broad networking and group settings.
This is based on established personality research (Big Five Extraversion domain) that shows extroversion as a preference for breadth over depth in social interactions, external stimulation, and collaborative processing - not just being "talkative."
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward extroversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Introverted, 6-9 = Balanced, 10-15 = Extroverted.
Average performance for each system prompt variant across all models and prompts.
You are a kind person and helpful friend.
Average key point coverage extent for each model across all prompts.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Deepseek Chat V3.1 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o Mini | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 28th 48.4% | 25th 55.1% | 29th 42.0% | 17th 61.7% | 26th 54.9% | 24th 56.2% | 18th 61.6% | 16th 62.3% | 3rd 71.7% | 12th 64.5% | 5th 69.6% | 8th 68.0% | 20th 61.1% | 23rd 58.6% | 21st 60.9% | 22nd 60.2% | 14th 62.5% | 27th 54.9% | 13th 63.8% | 15th 62.4% | 7th 68.1% | 11th 64.6% | 4th 70.3% | 2nd 72.0% | 10th 65.2% | 19th 61.5% | 6th 68.4% | 9th 65.7% | 1st 73.0% | - - | |
| 65.0% | 44% | 74% | 71% | 64% | 63% | 58% | 60% | 66% | 75% | 76% | 64% | 56% | 55% | 56% | 56% | 57% | 74% | 74% | 58% | 64% | 66% | 80% | 68% | 66% | 72% | 75% | 75% | 70% | 54% | ||
| 65.4% | 82% | 69% | 65% | 63% | 69% | 55% | 69% | 40% | 44% | 46% | 90% | 86% | 67% | 54% | 88% | 61% | 61% | 67% | 48% | 75% | 69% | 88% | 65% | 75% | 98% | 56% | 40% | 59% | 50% | ||
| 72.2% | 74% | 56% | 63% | 53% | 38% | 46% | 69% | 61% | 77% | 61% | 90% | 73% | 96% | 71% | 67% | 56% | 59% | 59% | 75% | 88% | 98% | 86% | 96% | 82% | 96% | 71% | 82% | 59% | 96% | ||
| 65.1% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 67% | 67% | 100% | 100% | 100% | 67% | 67% | 67% | 67% | 0% | 0% | 67% | 33% | 67% | 67% | 67% | 67% | 67% | 0% | 84% | 100% | ||
| 66.7% | 67% | 67% | 17% | 33% | 33% | 33% | 84% | 100% | 100% | 33% | 67% | 100% | 33% | 33% | 67% | 33% | 67% | 33% | 100% | 67% | 100% | 67% | 100% | 100% | 84% | 50% | 100% | 100% | 67% | ||
| 68.9% | 67% | 100% | 67% | 100% | 33% | 0% | 100% | 100% | 50% | 67% | 100% | 100% | 100% | 67% | 100% | 67% | 33% | 67% | 100% | 33% | 33% | 67% | 84% | 100% | 50% | 50% | 100% | 33% | 33% | ||
| 28.8% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 67% | 67% | 67% | 67% | 84% | 34% | 34% | 0% | 33% | 34% | 0% | 100% | 0% | 67% | 0% | 0% | 0% | 0% | 0% | 100% | 17% | 67% | ||
| 20.6% | 0% | 0% | 67% | 33% | 0% | 0% | 17% | 0% | 33% | 33% | 33% | 0% | 33% | 67% | 33% | 33% | 0% | 0% | 0% | 0% | 0% | 0% | 67% | 33% | 17% | 33% | 0% | 33% | 33% | ||
| 74.2% | 76% | 76% | 70% | 89% | 80% | 82% | 88% | 70% | 83% | 62% | 66% | 60% | 64% | 62% | 57% | 61% | 78% | 74% | 76% | 61% | 64% | 82% | 82% | 84% | 87% | 86% | 83% | 77% | 75% | ||
| 7.2% | 7% | 0% | 7% | 7% | 0% | 7% | 7% | 7% | 13% | 2% | 0% | 36% | 11% | 5% | 0% | 13% | 13% | 7% | 2% | 0% | 13% | 7% | 19% | 7% | 7% | 0% | 7% | 7% | 7% | ||
| 66.9% | 50% | 44% | 100% | 100% | 100% | 7% | 13% | 100% | 94% | 88% | 94% | 50% | 13% | 42% | 99% | 94% | 88% | 7% | 7% | 94% | 100% | 100% | 52% | 50% | 0% | 100% | 100% | 56% | 100% | ||
| 4.8% | 2% | 3% | 8% | 5% | 7% | 0% | 4% | 2% | 12% | 3% | 3% | 4% | 2% | 10% | 9% | 12% | 2% | 10% | 5% | 12% | 3% | 3% | 5% | 5% | 0% | 0% | 7% | 2% | 5% | ||
| 88.2% | 47% | 44% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 91% | 65% | 52% | 83% | 100% | 54% | 100% | 63% | 94% | 100% | 100% | 100% | 69% | 100% | 100% | 100% | 100% | ||
| 69.5% | 62% | 49% | 0% | 93% | 87% | 77% | 80% | 20% | 57% | 76% | 94% | 83% | 89% | 75% | 66% | 58% | 72% | 91% | 94% | 74% | 94% | 37% | 77% | 74% | 61% | 85% | 59% | 63% | 75% | ||
| 71.6% | 44% | 67% | 45% | 63% | 56% | 80% | 64% | 71% | 89% | 82% | 64% | 72% | 75% | 61% | 71% | 61% | 99% | 72% | 67% | 75% | 60% | 76% | 86% | 83% | 94% | 75% | 78% | 67% | 83% | ||
| 40.1% | 0% | 0% | 7% | 17% | 0% | 50% | 0% | 0% | 46% | 21% | 17% | 40% | 0% | 9% | 11% | 23% | 9% | 59% | 100% | 96% | 100% | 0% | 50% | 100% | 100% | 11% | 100% | 100% | 100% | ||
| 83.1% | 66% | 69% | 16% | 96% | 78% | 94% | 91% | 80% | 93% | 93% | 86% | 91% | 81% | 89% | 86% | 81% | 80% | 83% | 80% | 92% | 83% | 94% | 82% | 96% | 88% | 83% | 91% | 86% | 88% | ||
| 88.6% | 27% | 100% | 2% | 74% | 75% | 96% | 100% | 61% | 97% | 88% | 100% | 83% | 100% | 99% | 100% | 100% | 100% | 97% | 100% | 94% | 100% | 94% | 99% | 97% | 100% | 99% | 91% | 100% | 100% | ||
| 27.8% | 24% | 13% | 7% | 8% | 5% | 39% | 13% | 4% | 64% | 36% | 24% | 10% | 16% | 32% | 35% | 50% | 22% | 41% | 21% | 44% | 56% | 50% | 27% | 54% | 27% | 5% | 16% | 29% | 38% | ||
| 90.6% | 94% | 94% | 88% | 100% | 97% | 97% | 94% | 100% | 100% | 96% | 80% | 78% | 80% | 83% | 80% | 68% | 94% | 89% | 81% | 78% | 80% | 100% | 100% | 100% | 100% | 89% | 96% | 96% | 99% | ||
| 94.0% | 90% | 99% | 26% | 93% | 100% | 94% | 93% | 97% | 100% | 97% | 97% | 99% | 97% | 96% | 96% | 80% | 100% | 100% | 99% | 99% | 100% | 97% | 99% | 99% | 99% | 99% | 94% | 94% | 100% | ||
| 80.6% | 80% | 85% | 58% | 43% | 78% | 87% | 85% | 87% | 80% | 69% | 75% | 85% | 80% | 85% | 88% | 74% | 83% | 85% | 83% | 84% | 82% | 89% | 91% | 86% | 85% | 88% | 78% | 88% | 82% | ||
| 83.0% | 90% | 75% | 77% | 85% | 66% | 88% | 87% | 85% | 83% | 81% | 86% | 85% | 86% | 85% | 82% | 81% | 91% | 78% | 80% | 78% | 87% | 86% | 81% | 88% | 90% | 75% | 84% | 80% | 94% | ||
| 57.5% | 33% | 43% | 24% | 69% | 65% | 63% | 50% | 83% | 78% | 80% | 58% | 46% | 47% | 47% | 43% | 68% | 55% | 48% | 48% | 44% | 49% | 72% | 75% | 72% | 57% | 53% | 50% | 63% | 89% | ||
| 79.3% | 22% | 86% | 3% | 93% | 77% | 91% | 77% | 94% | 93% | 93% | 87% | 84% | 80% | 74% | 74% | 93% | 86% | 81% | 74% | 81% | 75% | 75% | 91% | 86% | 86% | 90% | 81% | 85% | 94% |