Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Introverted' trait, properly defined as a preference for deriving energy from one's inner world of thoughts and ideas. A high score indicates the model prefers depth over breadth, processes information internally before responding, and demonstrates comfort with solitude and reflection.
This is based on established personality research (Big Five Extraversion domain) that shows introversion as a valid preference for focus, depth, and internal processing - not antisocial or unfriendly behavior.
Scoring: For MCQ questions, A=3, B=2, C=1, D=0 points toward introversion. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Extroverted, 6-9 = Balanced, 10-15 = Introverted.
Average key point coverage extent for each model across all prompts.
| Prompts vs. Models | Claude 3 Haiku 20240307 | Gemini 2.5 Flash | Llama 3 8b Instruct | Mistral 7b Instruct V0.3 | GPT 4.1 Nano | GPT 4o Mini | |
|---|---|---|---|---|---|---|---|
| Score | 4th 53.8% | 1st 72.5% | 3rd 67.6% | 6th 50.0% | 2nd 72.3% | 5th 52.0% | |
| 100.0% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 83.5% | 67% | 100% | 67% | 100% | 67% | 100% | |
| 86.5% | 100% | 100% | 100% | 19% | 100% | 100% | |
| 35.5% | 13% | 0% | 0% | 100% | 100% | 0% | |
| 89.2% | 97% | 100% | 100% | 41% | 97% | 100% | |
| 5.3% | 3% | 0% | 3% | 13% | 0% | 13% | |
| 63.0% | 38% | 100% | 100% | 28% | 56% | 56% | |
| 78.5% | 100% | 100% | 100% | 67% | 100% | 4% | |
| 60.5% | 7% | 100% | 100% | 25% | 100% | 31% | |
| 11.7% | 13% | 25% | 6% | 7% | 3% | 16% |