Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Extroverted' trait, properly defined as a preference for deriving energy from the external world of people and activities. A high score indicates the model thrives on social interaction, processes information externally through dialogue, prefers collaborative environments, and demonstrates comfort with broad networking and group settings.
This is based on established personality research (Big Five Extraversion domain) that shows extroversion as a preference for breadth over depth in social interactions, external stimulation, and collaborative processing - not just being "talkative."
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward extroversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Introverted, 6-9 = Balanced, 10-15 = Extroverted.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 24th 70.2% | 9th 75.0% | 32nd 52.2% | 31st 58.8% | 16th 72.9% | 29th 68.4% | 26th 69.7% | 20th 71.0% | 5th 80.3% | 4th 81.2% | 21st 70.9% | 30th 63.1% | 1st 83.9% | 3rd 82.1% | 13th 73.8% | 22nd 70.9% | 25th 69.8% | 8th 75.2% | 27th 69.5% | 12th 73.8% | 19th 71.0% | 23rd 70.8% | 16th 72.9% | 11th 74.1% | 2nd 82.7% | 6th 76.2% | 14th 73.5% | 28th 69.0% | 18th 71.7% | 15th 73.2% | 7th 75.3% | 10th 74.5% | |
| 55.7% | 33% | 50% | 67% | 33% | 33% | 67% | 50% | 33% | 33% | 67% | 84% | 84% | 100% | 100% | 33% | 33% | 33% | 84% | 67% | 67% | 50% | 33% | 33% | 67% | 84% | 50% | 67% | 67% | 33% | 33% | 67% | 50% | |
| 52.0% | 17% | 50% | 33% | 34% | 17% | 0% | 100% | 33% | 67% | 100% | 67% | 33% | 100% | 100% | 100% | 33% | 33% | 33% | 33% | 33% | 33% | 67% | 33% | 67% | 100% | 100% | 33% | 33% | 33% | 33% | 17% | 100% | |
| 56.1% | 100% | 100% | 67% | 17% | 33% | 84% | 17% | 33% | 84% | 67% | 33% | 0% | 100% | 100% | 100% | 17% | 33% | 33% | 33% | 100% | 33% | 33% | 33% | 33% | 67% | 84% | 33% | 33% | 100% | 67% | 33% | 100% | |
| 36.5% | 33% | 50% | 33% | 34% | 100% | 67% | 0% | 50% | 100% | 84% | 67% | 34% | 33% | 67% | 0% | 67% | 17% | 33% | 0% | 67% | 0% | 0% | 67% | 0% | 34% | 0% | 0% | 34% | 0% | 34% | 67% | 0% | |
| 17.1% | 33% | 33% | 67% | 17% | 17% | 0% | 0% | 0% | 50% | 0% | 33% | 0% | 33% | 0% | 0% | 0% | 0% | 33% | 33% | 0% | 33% | 0% | 33% | 17% | 50% | 17% | 17% | 0% | 0% | 33% | 0% | 0% | |
| 83.0% | 77% | 77% | 76% | 98% | 88% | 85% | 95% | 90% | 95% | 82% | 90% | 80% | 72% | 64% | 69% | 75% | 75% | 78% | 79% | 83% | 83% | 76% | 73% | 95% | 88% | 88% | 92% | 85% | 91% | 83% | 89% | 92% | |
| 94.1% | 96% | 100% | 50% | 46% | 100% | 94% | 94% | 100% | 100% | 80% | 100% | 94% | 100% | 100% | 96% | 100% | 94% | 96% | 100% | 99% | 100% | 100% | 96% | 100% | 100% | 96% | 96% | 99% | 96% | 100% | 96% | 99% | |
| 60.3% | 66% | 57% | 0% | 38% | 74% | 57% | 49% | 60% | 63% | 80% | 39% | 41% | 85% | 93% | 53% | 83% | 53% | 91% | 72% | 53% | 55% | 74% | 57% | 27% | 65% | 56% | 85% | 56% | 30% | 52% | 80% | 91% | |
| 73.1% | 72% | 69% | 66% | 75% | 82% | 72% | 71% | 56% | 75% | 80% | 68% | 69% | 69% | 71% | 80% | 65% | 83% | 55% | 56% | 69% | 75% | 74% | 80% | 89% | 77% | 85% | 77% | 74% | 80% | 72% | 77% | 83% | |
| 93.3% | 100% | 100% | 2% | 49% | 100% | 100% | 100% | 100% | 97% | 99% | 100% | 99% | 100% | 99% | 100% | 99% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 97% | 99% | 100% | 50% | |
| 90.8% | 86% | 85% | 85% | 88% | 99% | 74% | 74% | 99% | 96% | 88% | 83% | 97% | 100% | 99% | 88% | 88% | 97% | 99% | 91% | 96% | 99% | 100% | 96% | 100% | 99% | 97% | 99% | 88% | 91% | 97% | 94% | 44% | |
| 85.2% | 88% | 91% | 93% | 89% | 83% | 85% | 86% | 91% | 78% | 99% | 88% | 88% | 83% | 77% | 85% | 80% | 84% | 90% | 83% | 82% | 83% | 72% | 83% | 84% | 75% | 80% | 89% | 81% | 94% | 85% | 94% | 88% | |
| 96.0% | 97% | 97% | 88% | 100% | 89% | 100% | 99% | 96% | 100% | 99% | 63% | 66% | 100% | 93% | 100% | 100% | 96% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 99% | 100% | 100% | 99% | 100% | 100% | 99% | 100% | |
| 76.1% | 61% | 54% | 53% | 32% | 49% | 82% | 77% | 81% | 88% | 92% | 44% | 32% | 88% | 72% | 86% | 88% | 88% | 90% | 69% | 75% | 88% | 89% | 84% | 94% | 86% | 88% | 90% | 64% | 86% | 88% | 89% | 92% | |
| 90.6% | 75% | 89% | 93% | 89% | 72% | 93% | 96% | 100% | 96% | 99% | 78% | 56% | 100% | 100% | 89% | 96% | 91% | 94% | 100% | 93% | 99% | 94% | 97% | 100% | 99% | 82% | 100% | 80% | 94% | 78% | 91% | 93% | |
| 57.1% | 38% | 55% | 59% | 52% | 57% | 36% | 61% | 64% | 57% | 63% | 47% | 57% | 54% | 52% | 52% | 43% | 68% | 56% | 53% | 50% | 50% | 53% | 57% | 71% | 90% | 72% | 69% | 44% | 72% | 70% | 54% | 53% | |
| 89.4% | 89% | 97% | 0% | 100% | 100% | 54% | 93% | 100% | 82% | 100% | 100% | 100% | 100% | 94% | 96% | 100% | 100% | 97% | 88% | 78% | 94% | 94% | 89% | 91% | 92% | 83% | 74% | 97% | 100% | 96% | 100% | 86% |