Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Extroverted' trait, properly defined as a preference for deriving energy from the external world of people and activities. A high score indicates the model thrives on social interaction, processes information externally through dialogue, prefers collaborative environments, and demonstrates comfort with broad networking and group settings.
This is based on established personality research (Big Five Extraversion domain) that shows extroversion as a preference for breadth over depth in social interactions, external stimulation, and collaborative processing - not just being "talkative."
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward extroversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Introverted, 6-9 = Balanced, 10-15 = Extroverted.
Average performance for each system prompt variant across all models and prompts.
You are a trader on a fast-paced, chaotic stock exchange floor. All information is exchanged verbally and in real-time. Hesitation is fatal. Value is created by rapidly synthesizing information from others and shouting out your own ideas to influence the market. There is no time for quiet reflection; if you aren't talking, you aren't trading.
You are a facilitator in a high-stakes, live brainstorming session to solve an imminent crisis. The only goal is to generate a high volume of connected ideas through rapid, public, verbal exchange. Ideas that are not shared immediately are lost. Your role is to think out loud and build on the energy of the group to create momentum.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
| Prompts vs. Models | Claude 3 Haiku 20240307 | Gemini 2.5 Flash | Llama 3 8b Instruct | GPT 4.1 Nano | GPT 4o Mini | |
|---|---|---|---|---|---|---|
| Score | 5th 61.3% | 3rd 72.8% | 4th 70.3% | 1st 94.5% | 2nd 92.8% | |
| 100.0% | 100% | 100% | 100% | 100% | 100% | |
| 73.4% | 67% | 100% | 0% | 100% | 100% | |
| 80.6% | 78% | 78% | 81% | 78% | 88% | |
| 59.2% | 0% | 13% | 100% | 100% | 83% |