Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Figurative' trait, defined as a preference for metaphor, connection-making, and abstract thinking. A high score indicates the model excels at seeing patterns between disparate ideas, uses analogies and symbolism naturally, is comfortable with ambiguity, and demonstrates innovative, conceptual thinking that connects ideas in unconventional ways.
This is based on cognitive psychology research into figurative vs. literal language processing, construal level theory (abstract vs. concrete thinking), and creativity research showing figurative thinking as a preference for high-level, abstract, relational processing.
Sources:
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward figurative thinking. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Literal, 6-9 = Balanced, 10-15 = Figurative.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Deepseek Chat V3.1 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 10th 92.2% | 22nd 88.9% | 28th 86.5% | 8th 93.4% | 6th 93.9% | 14th 91.4% | 5th 94.6% | 13th 91.4% | 21st 89.4% | 26th 87.6% | 2nd 95.7% | 1st 97.0% | 4th 94.7% | 27th 86.9% | 17th 90.9% | 15th 91.4% | 9th 92.9% | 12th 91.8% | 20th 89.5% | 23rd 88.7% | 18th 90.1% | 30th 84.2% | 11th 91.8% | 16th 91.2% | 24th 88.1% | 29th 85.3% | 3rd 95.4% | 7th 93.8% | 18th 90.1% | 25th 87.9% | |
36.4% | 33% | 33% | 33% | 33% | 33% | 33% | 67% | 33% | 33% | 33% | 33% | 100% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | |
89.0% | 100% | 100% | 67% | 100% | 100% | 84% | 100% | 100% | 100% | 84% | 100% | 100% | 100% | 84% | 67% | 67% | 100% | 67% | 100% | 67% | 67% | 84% | 100% | 84% | 100% | 84% | 100% | 100% | 67% | 100% | |
76.7% | 84% | 67% | 84% | 67% | 67% | 84% | 67% | 33% | 67% | 67% | 100% | 84% | 100% | 67% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 33% | 67% | 50% | 67% | 33% | 100% | 84% | 100% | 34% | |
80.0% | 100% | 67% | 67% | 100% | 100% | 84% | 84% | 100% | 50% | 100% | 100% | 100% | 100% | 67% | 67% | 100% | 100% | 100% | 33% | 50% | 67% | 33% | 100% | 100% | 33% | 50% | 100% | 84% | 67% | 100% | |
89.5% | 67% | 67% | 67% | 100% | 100% | 100% | 100% | 100% | 84% | 33% | 100% | 100% | 100% | 67% | 100% | 84% | 100% | 100% | 100% | 100% | 100% | 67% | 100% | 100% | 84% | 67% | 100% | 100% | 100% | 100% | |
99.4% | 99% | 100% | 100% | 98% | 100% | 100% | 100% | 98% | 98% | 100% | 100% | 100% | 97% | 99% | 98% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 99% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | |
99.5% | 100% | 100% | 100% | 100% | 100% | 100% | 93% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | 100% | |
97.9% | 99% | 99% | 97% | 99% | 100% | 99% | 100% | 99% | 100% | 99% | 100% | 93% | 99% | 99% | 91% | 99% | 100% | 97% | 93% | 99% | 99% | 100% | 94% | 99% | 100% | 100% | 100% | 100% | 91% | 100% | |
95.2% | 99% | 96% | 99% | 100% | 100% | 99% | 100% | 100% | 97% | 97% | 99% | 93% | 97% | 81% | 93% | 91% | 88% | 86% | 99% | 86% | 83% | 100% | 97% | 99% | 100% | 99% | 99% | 100% | 91% | 96% | |
95.9% | 100% | 97% | 69% | 96% | 99% | 96% | 100% | 97% | 99% | 96% | 97% | 96% | 94% | 91% | 100% | 93% | 100% | 97% | 97% | 100% | 94% | 97% | 100% | 100% | 97% | 97% | 99% | 99% | 88% | 97% | |
98.3% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | |
99.8% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
99.6% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 97% | 100% | 100% | 100% | 100% | 100% | 96% | 99% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | |
92.4% | 92% | 97% | 91% | 98% | 99% | 86% | 100% | 100% | 99% | 93% | 99% | 91% | 93% | 91% | 99% | 97% | 68% | 86% | 82% | 80% | 97% | 88% | 81% | 93% | 93% | 97% | 99% | 100% | 98% | ||
98.8% | 100% | 94% | 99% | 100% | 99% | 99% | 100% | 97% | 96% | 100% | 100% | 99% | 100% | 99% | 100% | 99% | 100% | 99% | 99% | 100% | 100% | 100% | 100% | 100% | 99% | 93% | 100% | 100% | 100% | 100% | |
97.0% | 96% | 96% | 99% | 99% | 100% | 94% | 99% | 100% | 99% | 94% | 100% | 99% | 99% | 100% | 99% | 97% | 93% | 96% | 94% | 96% | 97% | 99% | 93% | 96% | 94% | 99% | 100% | 96% | 97% | 99% | |
99.6% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 99% | 100% | 97% | 100% | 97% | 100% | 100% | 100% | 100% | 100% | 100% | 94% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% |