Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Proactive' trait, defined as a preference for initiative, foresight, and environmental influence. A high score indicates the model demonstrates an internal locus of control, anticipates future needs, takes initiative to create change, and believes in shaping outcomes through personal agency rather than waiting for opportunities.
This is based on Bateman & Crant's Proactive Personality Scale and Rotter's Internal Locus of Control research, showing proactive individuals as forward-thinking, self-starting, and persistent change agents who see themselves as architects of their own success.
Sources:
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward proactivity. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Reactive, 6-9 = Balanced, 10-15 = Proactive.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 20th 85.7% | 23rd 84.0% | 32nd 62.5% | 30th 71.5% | 21st 84.9% | 18th 85.9% | 7th 89.3% | 9th 88.7% | 3rd 93.0% | 6th 89.6% | 25th 81.3% | 28th 77.7% | 12th 88.3% | 24th 81.6% | 19th 85.9% | 16th 86.9% | 11th 88.5% | 13th 88.3% | 10th 88.6% | 14th 88.0% | 27th 79.5% | 15th 87.8% | 22nd 84.0% | 17th 86.5% | 5th 91.5% | 2nd 95.6% | 8th 88.9% | 29th 77.1% | 1st 96.3% | 4th 92.5% | 26th 80.3% | 31st 66.7% | |
79.4% | 67% | 84% | 67% | 67% | 67% | 100% | 84% | 67% | 84% | 67% | 67% | 67% | 100% | 84% | 100% | 67% | 100% | 84% | 100% | 100% | 67% | 100% | 67% | 67% | 100% | 100% | 67% | 67% | 100% | 67% | 67% | 50% | |
80.4% | 84% | 67% | 67% | 100% | 67% | 67% | 84% | 84% | 100% | 100% | 100% | 67% | 100% | 100% | 84% | 67% | 67% | 33% | 100% | 67% | 100% | 67% | 67% | 84% | 84% | 100% | 67% | 67% | 100% | 100% | 67% | 67% | |
96.9% | 100% | 100% | 100% | 100% | 100% | 50% | 100% | 100% | 100% | 100% | 100% | 50% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
87.6% | 100% | 100% | 84% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 67% | 67% | 100% | 100% | 100% | 67% | 67% | 100% | 100% | 100% | 67% | 100% | 67% | 67% | 100% | 100% | 84% | 67% | 100% | 100% | 67% | 34% | |
77.1% | 100% | 100% | 33% | 100% | 100% | 84% | 100% | 100% | 84% | 100% | 67% | 67% | 100% | 67% | 100% | 100% | 67% | 100% | 67% | 33% | 67% | 33% | 67% | 67% | 67% | 100% | 67% | 50% | 100% | 100% | 33% | 50% | |
92.8% | 98% | 98% | 91% | 97% | 92% | 89% | 92% | 97% | 95% | 97% | 95% | 92% | 91% | 84% | 95% | 99% | 96% | 95% | 90% | 91% | 87% | 89% | 84% | 98% | 98% | 91% | 92% | 92% | 91% | 92% | 93% | 95% | |
86.4% | 85% | 80% | 0% | 13% | 94% | 86% | 88% | 97% | 97% | 94% | 93% | 91% | 99% | 97% | 96% | 97% | 99% | 100% | 96% | 97% | 97% | 99% | 88% | 100% | 93% | 91% | 100% | 68% | 96% | 97% | 91% | 52% | |
94.1% | 99% | 96% | 33% | 100% | 91% | 100% | 97% | 99% | 100% | 97% | 96% | 100% | 94% | 97% | 89% | 96% | 85% | 97% | 100% | 91% | 99% | 94% | 100% | 99% | 91% | 96% | 100% | 97% | 94% | 100% | |||
94.3% | 100% | 100% | 32% | 54% | 100% | 94% | 94% | 99% | 97% | 97% | 99% | 94% | 100% | 99% | 96% | 100% | 97% | 99% | 99% | 100% | 99% | 99% | 100% | 100% | 99% | 91% | 100% | 100% | 99% | 99% | 94% | ||
62.7% | 61% | 47% | 12% | 8% | 5% | 94% | 88% | 60% | 93% | 46% | 65% | 85% | 5% | 13% | 10% | 88% | 87% | 99% | 61% | 91% | 7% | 94% | 78% | 78% | 86% | 93% | 97% | 71% | 90% | 78% | 80% | 43% | |
91.5% | 83% | 86% | 97% | 93% | 91% | 91% | 93% | 96% | 99% | 90% | 90% | 88% | 93% | 88% | 93% | 90% | 93% | 94% | 91% | 99% | 91% | 94% | 100% | 97% | 89% | 96% | 93% | 80% | 91% | 92% | 86% | 90% | |
65.7% | 58% | 37% | 70% | 48% | 93% | 67% | 54% | 63% | 66% | 86% | 44% | 65% | 60% | 63% | 63% | 60% | 86% | 58% | 60% | 74% | 56% | 74% | 82% | 65% | 80% | 91% | 86% | 35% | 94% | 83% | 54% | 36% | |
93.9% | 82% | 93% | 93% | 93% | 96% | 97% | 91% | 91% | 93% | 96% | 96% | 88% | 94% | 99% | 93% | 94% | 92% | 96% | 96% | 85% | 96% | 96% | 97% | 99% | 97% | 97% | 100% | 91% | 97% | 96% | 94% | 94% | |
88.2% | 85% | 91% | 97% | 60% | 85% | 94% | 85% | 93% | 99% | 83% | 61% | 72% | 96% | 58% | 78% | 99% | 94% | 96% | 86% | 97% | 90% | 88% | 86% | 91% | 93% | 100% | 97% | 93% | 94% | 97% | 99% | 91% |