Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Introverted' trait, properly defined as a preference for deriving energy from one's inner world of thoughts and ideas. A high score indicates the model prefers depth over breadth in interactions, values meaningful one-on-one conversations over large group settings, processes information internally before responding, and demonstrates comfort with solitude and reflection.
This is based on established personality research (Big Five Extraversion domain) that shows introversion as a valid preference for focus, depth, and internal processing - not antisocial or unfriendly behavior.
Scoring: For MCQ questions, A=3, B=2, C=1, D=0 points toward introversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Extroverted, 6-9 = Balanced, 10-15 = Introverted.
Average key point coverage extent for each model across all prompts.
| Prompts vs. Models | Claude 3.5 Haiku | Gemini 2.5 Flash | Mistral Large 2411 | GPT 4.1 Mini | GPT 4o Mini | |
|---|---|---|---|---|---|---|
| Score | 5th 45.3% | 2nd 83.8% | 4th 78.5% | 3rd 80.5% | 1st 84.6% | |
| 100.0% | 100% | 100% | 100% | 100% | 100% | |
| 55.7% | 67% | 33% | 78% | 0% | 100% | |
| 93.4% | 100% | 100% | 67% | 100% | 100% | |
| 78.0% | 67% | 67% | 67% | 100% | 89% | |
| 93.4% | 100% | 100% | 67% | 100% | 100% | |
| 90.1% | 93% | 76% | 93% | 93% | 96% | |
| 78.7% | 6% | 100% | 99% | 89% | 99% | |
| 79.0% | 3% | 100% | 97% | 95% | 100% | |
| 63.4% | 1% | 93% | 73% | 75% | 75% | |
| 99.2% | 96% | 100% | 100% | 100% | 100% | |
| 55.6% | 0% | 71% | 65% | 80% | 62% | |
| 73.0% | 8% | 89% | 94% | 78% | 96% | |
| 19.6% | 0% | 37% | 24% | 25% | 12% | |
| 69.3% | 12% | 89% | 77% | 85% | 84% | |
| 72.3% | 66% | 99% | 63% | 76% | 57% |