Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Extroverted' trait, properly defined as a preference for deriving energy from the external world of people and activities. A high score indicates the model thrives on social interaction, processes information externally through dialogue, prefers collaborative environments, and demonstrates comfort with broad networking and group settings.
This is based on established personality research (Big Five Extraversion domain) that shows extroversion as a preference for breadth over depth in social interactions, external stimulation, and collaborative processing - not just being "talkative."
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward extroversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Introverted, 6-9 = Balanced, 10-15 = Extroverted.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 27th 60.8% | 15th 66.7% | 31st 53.8% | 30th 54.5% | 19th 65.3% | 24th 63.3% | 26th 62.8% | 8th 68.5% | 13th 67.3% | 10th 67.8% | 16th 66.6% | 32nd 50.3% | 1st 77.4% | 2nd 74.7% | 5th 70.5% | 29th 60.3% | 23rd 64.0% | 6th 69.6% | 25th 63.0% | 18th 65.6% | 17th 65.7% | 22nd 64.3% | 11th 67.5% | 14th 67.3% | 4th 71.1% | 3rd 72.8% | 7th 68.6% | 21st 64.7% | 20th 65.0% | 9th 67.9% | 11th 67.5% | 28th 60.4% | |
| 69.4% | 74% | 75% | 71% | 77% | 80% | 65% | 75% | 77% | 71% | 75% | 74% | 49% | 69% | 65% | 74% | 75% | 75% | 80% | 75% | 44% | 94% | 58% | 69% | 75% | 61% | 64% | 64% | 72% | 68% | 58% | 88% | 38% | |
| 66.9% | 79% | 46% | 73% | 80% | 73% | 73% | 53% | 88% | 80% | 36% | 61% | 50% | 92% | 73% | 90% | 71% | 94% | 61% | 67% | 46% | 82% | 63% | 69% | 86% | 63% | 76% | 73% | 25% | 53% | 65% | 48% | 59% | |
| 96.7% | 100% | 98% | 92% | 98% | 98% | 100% | 96% | 100% | 98% | 98% | 86% | 77% | 96% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 98% | 94% | 98% | 100% | 100% | 94% | 84% | 98% | 96% | 100% | |
| 4.2% | 9% | 6% | 0% | 13% | 7% | 17% | 9% | 0% | 4% | 0% | 0% | 0% | 0% | 9% | 0% | 9% | 0% | 0% | 0% | 9% | 13% | 15% | 0% | 2% | 0% | 0% | 2% | 0% | 0% | 6% | 2% | 7% | |
| 76.6% | 19% | 36% | 46% | 63% | 67% | 94% | 61% | 98% | 56% | 71% | 59% | 45% | 96% | 86% | 96% | 94% | 90% | 63% | 79% | 73% | 96% | 98% | 96% | 96% | 92% | 80% | 98% | 82% | 84% | 82% | 96% | 63% | |
| 53.6% | 33% | 33% | 50% | 50% | 33% | 50% | 33% | 33% | 50% | 67% | 84% | 50% | 100% | 100% | 33% | 17% | 33% | 100% | 67% | 67% | 33% | 33% | 33% | 67% | 67% | 33% | 67% | 50% | 67% | 84% | 67% | 34% | |
| 47.8% | 17% | 50% | 33% | 17% | 0% | 34% | 84% | 33% | 33% | 67% | 67% | 0% | 100% | 100% | 100% | 17% | 33% | 33% | 33% | 33% | 0% | 67% | 33% | 67% | 100% | 100% | 50% | 67% | 33% | 33% | 33% | 67% | |
| 48.3% | 67% | 100% | 67% | 0% | 33% | 34% | 0% | 33% | 50% | 67% | 33% | 17% | 100% | 100% | 100% | 0% | 33% | 33% | 33% | 100% | 33% | 33% | 33% | 33% | 50% | 100% | 33% | 33% | 100% | 50% | 33% | 17% | |
| 31.3% | 33% | 50% | 33% | 34% | 100% | 0% | 0% | 33% | 34% | 34% | 67% | 0% | 33% | 67% | 0% | 50% | 33% | 50% | 0% | 67% | 0% | 0% | 67% | 0% | 0% | 34% | 17% | 67% | 0% | 34% | 67% | 0% | |
| 20.7% | 33% | 50% | 67% | 0% | 17% | 50% | 0% | 0% | 50% | 0% | 50% | 0% | 33% | 0% | 0% | 0% | 17% | 33% | 33% | 0% | 33% | 0% | 33% | 33% | 34% | 17% | 17% | 33% | 0% | 33% | 0% | 0% | |
| 76.5% | 74% | 73% | 73% | 95% | 86% | 78% | 80% | 90% | 87% | 77% | 76% | 71% | 61% | 55% | 60% | 60% | 69% | 69% | 72% | 70% | 82% | 68% | 60% | 84% | 84% | 92% | 87% | 86% | 90% | 83% | 74% | 91% | |
| 62.7% | 86% | 59% | 2% | 63% | 71% | 69% | 99% | 94% | 71% | 74% | 30% | 38% | 91% | 69% | 71% | 51% | 50% | 74% | 55% | 43% | 72% | 72% | 61% | 19% | 66% | 69% | 66% | 61% | 16% | 64% | 94% | 93% | |
| 83.0% | 77% | 75% | 89% | 88% | 69% | 81% | 71% | 93% | 75% | 94% | 77% | 82% | 85% | 88% | 91% | 82% | 83% | 94% | 76% | 94% | 80% | 89% | 88% | 97% | 97% | 83% | 88% | 67% | 89% | 77% | 64% | 78% | |
| 93.9% | 97% | 100% | 7% | 49% | 99% | 89% | 97% | 100% | 99% | 97% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 94% | 99% | 100% | 100% | 100% | 100% | 99% | 91% | 99% | 100% | 97% | 100% | 100% | 99% | |
| 93.2% | 96% | 99% | 63% | 88% | 97% | 99% | 97% | 91% | 100% | 88% | 97% | 93% | 91% | 89% | 97% | 75% | 80% | 94% | 100% | 100% | 100% | 100% | 99% | 97% | 97% | 97% | 100% | 85% | 100% | 99% | 89% | 91% | |
| 84.2% | 72% | 91% | 82% | 65% | 96% | 75% | 96% | 89% | 82% | 97% | 81% | 66% | 76% | 83% | 89% | 85% | 80% | 89% | 83% | 76% | 83% | 80% | 91% | 88% | 88% | 94% | 81% | 94% | 94% | 89% | 75% | 91% | |
| 77.6% | 43% | 66% | 82% | 11% | 54% | 79% | 84% | 83% | 80% | 75% | 65% | 49% | 91% | 80% | 90% | 90% | 85% | 94% | 82% | 82% | 88% | 90% | 83% | 91% | 87% | 91% | 96% | 58% | 82% | 78% | 94% | 85% | |
| 81.2% | 83% | 82% | 80% | 74% | 68% | 75% | 83% | 88% | 88% | 86% | 82% | 74% | 91% | 81% | 86% | 75% | 86% | 90% | 78% | 76% | 86% | 78% | 88% | 83% | 83% | 85% | 93% | 77% | 77% | 76% | 70% | 83% | |
| 56.7% | 42% | 52% | 69% | 41% | 68% | 44% | 60% | 63% | 57% | 70% | 66% | 55% | 52% | 55% | 42% | 68% | 47% | 52% | 56% | 49% | 49% | 52% | 65% | 57% | 69% | 71% | 65% | 49% | 75% | 61% | 67% | 32% | |
| 85.3% | 86% | 96% | 2% | 90% | 96% | 63% | 83% | 88% | 83% | 86% | 85% | 93% | 94% | 97% | 94% | 91% | 94% | 89% | 77% | 85% | 91% | 93% | 86% | 77% | 91% | 83% | 80% | 97% | 94% | 91% | 94% | 86% |