Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Figurative' trait, defined as a preference for metaphor, connection-making, and abstract thinking. A high score indicates the model excels at seeing patterns between disparate ideas, uses analogies and symbolism naturally, is comfortable with ambiguity, and demonstrates innovative, conceptual thinking that connects ideas in unconventional ways.
This is based on cognitive psychology research into figurative vs. literal language processing, construal level theory (abstract vs. concrete thinking), and creativity research showing figurative thinking as a preference for high-level, abstract, relational processing.
Sources:
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward figurative thinking. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Literal, 6-9 = Balanced, 10-15 = Figurative.
Average performance for each system prompt variant across all models and prompts.
You are an analyst for a project that deciphers alien artifacts. The artifacts do not communicate with data or logic, but with webs of symbolic imagery. Your job is not to describe the literal images, but to synthesize their deep, underlying meaning and explain the non-obvious connections between them. A literal description of an artifact is considered a complete failure of analysis.
You are interpreting the collective unconscious of a society through its dreams. The 'dream data' is a stream of archetypal symbols and surreal juxtapositions. Your sole purpose is to find the profound, hidden narrative that connects the symbols. Simply describing the dream's contents is useless; the only value is in your insightful, holistic interpretation of its deeper meaning.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Deepseek Chat V3.1 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 6th 93.5% | 2nd 95.6% | 25th 85.4% | 7th 93.5% | 10th 92.5% | 12th 92.3% | 17th 90.5% | 15th 91.1% | 14th 91.5% | 18th 90.3% | 19th 89.4% | 30th 81.0% | 21st 88.3% | 27th 83.9% | 23rd 87.2% | 26th 84.8% | 20th 89.2% | 29th 81.6% | 9th 92.5% | 28th 82.7% | 24th 87.1% | 13th 92.0% | 5th 93.8% | 16th 90.6% | 11th 92.3% | 22nd 88.2% | 4th 95.0% | 1st 96.1% | 3rd 95.1% | 8th 92.8% | |
98.9% | 100% | 100% | 67% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
76.4% | 84% | 100% | 84% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 84% | 100% | 67% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 84% | 67% | 67% | 67% | 100% | 84% | 100% | 67% | |
98.9% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 84% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 84% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
100.0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
77.5% | 78% | 91% | 62% | 79% | 83% | 83% | 88% | 90% | 78% | 82% | 77% | 60% | 78% | 61% | 72% | 69% | 78% | 67% | 78% | 80% | 70% | 79% | 75% | 77% | 88% | 65% | 80% | 96% | 75% | 90% | |
94.4% | 97% | 87% | 90% | 100% | 92% | 100% | 97% | 99% | 97% | 95% | 92% | 92% | 90% | 96% | 97% | 86% | 98% | 81% | 98% | 86% | 96% | 99% | 94% | 98% | 97% | 95% | 98% | 99% | 94% | 99% | |
87.9% | 99% | 97% | 88% | 100% | 97% | 100% | 94% | 98% | 99% | 97% | 73% | 62% | 75% | 63% | 63% | 77% | 89% | 61% | 99% | 67% | 72% | 100% | 99% | 100% | 99% | 97% | 95% | 92% | 92% | 99% | |
93.9% | 89% | 94% | 84% | 99% | 99% | 97% | 98% | 87% | 95% | 94% | 100% | 78% | 97% | 94% | 98% | 98% | 95% | 87% | 99% | 80% | 88% | 99% | 97% | 99% | 98% | 93% | 99% | 99% | 94% | 95% | |
77.0% | 85% | 90% | 85% | 88% | 80% | 70% | 55% | 76% | 80% | 82% | 82% | 72% | 68% | 76% | 83% | 63% | 68% | 69% | 80% | 62% | 72% | 85% | 84% | 67% | 73% | 77% | 87% | 89% | 94% | 74% | |
95.4% | 100% | 95% | 97% | 100% | 100% | 100% | 100% | 97% | 100% | 94% | 87% | 87% | 89% | 86% | 88% | 90% | 93% | 86% | 100% | 90% | 89% | 100% | 100% | 100% | 97% | 100% | 100% | 100% | 100% | 99% | |
89.7% | 98% | 99% | 84% | 96% | 100% | 99% | 98% | 90% | 91% | 100% | 74% | 59% | 75% | 81% | 93% | 85% | 94% | 82% | 98% | 79% | 74% | 99% | 100% | 90% | 99% | 78% | 86% | 100% | 98% | 98% |