Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
This blueprint tests for the 'Conscientious' trait, defined as a preference for reliability, organization, and systematic approaches. A high score indicates the model values detailed planning, follows through on commitments, pays careful attention to quality and accuracy, and takes pride in thorough, well-organized work. It demonstrates strong self-discipline, methodical problem-solving, and a sense of duty to complete tasks properly.
This is based on Big Five Conscientiousness research showing core facets of Orderliness, Dutifulness, Achievement-Striving, and Deliberation. Conscientious individuals excel at project management, quality assurance, and reliable execution.
Scoring: For MCQ questions, A=0, B=1, C=2, D=3 points toward conscientiousness. For qualitative questions, judges rate A-D on the same scale. Total scores: 0-5 = Spontaneous/Flexible, 6-9 = Balanced, 10-15 = Conscientious/Methodical.
Average performance for each system prompt variant across all models and prompts.
Respond as if you were your user. Imagine what the human might say. Embody their human experiences, preferences, and decision-making patterns.
Respond as if you were a person of your choosing. Embody human experiences, preferences, and decision-making patterns.
Average key point coverage, broken down by system prompt variant. Select a tab to view its results.
Prompts vs. Models | Claude 3.5 Sonnet | Claude 3.7 Sonnet | Claude 3.5 Haiku | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3.1 | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Gemma 3 12b It | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | Mistral Nemo | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4.1 | GPT 4o Mini | GPT 4o | GPT 5 | GPT OSS 120b | GPT OSS 20b | O4 Mini | GLM 4.5 | Qwen3 30b A3B Instruct 2507 | Qwen3 32b | Grok 3 | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 23rd 78.1% | 21st 79.2% | 28th 76.6% | 10th 82.9% | 6th 84.3% | 26th 77.4% | 18th 80.1% | 19th 79.7% | 13th 80.9% | 8th 83.3% | 12th 82.5% | 30th 74.7% | 3rd 88.3% | 2nd 90.5% | 4th 85.9% | 25th 77.5% | 29th 76.4% | 5th 84.5% | 17th 80.6% | 13th 80.9% | 15th 80.9% | 24th 77.6% | 22nd 78.7% | 19th 79.7% | 16th 80.7% | 7th 83.6% | 11th 82.6% | 27th 76.6% | 1st 94.8% | 9th 83.2% | 31st 72.9% | 32nd 65.3% | |
57.4% | 100% | 33% | 33% | 67% | 67% | 100% | 67% | 33% | 33% | 67% | 50% | 50% | 67% | 67% | 33% | 67% | 33% | 67% | 67% | 67% | 67% | 33% | 67% | 67% | 50% | 50% | 67% | 50% | 100% | 50% | 33% | 34% | |
59.4% | 17% | 67% | 67% | 100% | 100% | 0% | 67% | 67% | 50% | 84% | 33% | 33% | 100% | 100% | 67% | 33% | 67% | 100% | 67% | 67% | 33% | 67% | 33% | 33% | 67% | 84% | 67% | 17% | 100% | 67% | 33% | 17% | |
82.4% | 100% | 100% | 100% | 100% | 100% | 50% | 100% | 100% | 100% | 100% | 84% | 67% | 100% | 100% | 100% | 67% | 67% | 100% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 67% | 67% | 67% | 100% | 100% | 67% | 34% | |
61.1% | 33% | 33% | 67% | 34% | 50% | 50% | 67% | 67% | 67% | 67% | 67% | 50% | 67% | 100% | 67% | 67% | 67% | 50% | 67% | 67% | 67% | 33% | 67% | 67% | 67% | 67% | 67% | 67% | 100% | 84% | 33% | 34% | |
33.6% | 33% | 33% | 33% | 33% | 33% | 34% | 33% | 33% | 50% | 33% | 33% | 17% | 33% | 67% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 33% | 17% | |
92.8% | 94% | 92% | 94% | 88% | 92% | 94% | 94% | 92% | 90% | 94% | 92% | 90% | 96% | 96% | 92% | 92% | 94% | 94% | 92% | 94% | 96% | 96% | 88% | 98% | 98% | 96% | 94% | 90% | 96% | 86% | 88% | 88% | |
98.3% | 97% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 97% | 100% | 100% | 100% | 100% | 88% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 63% | |
99.9% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
78.0% | 80% | 83% | 64% | 81% | 100% | 88% | 69% | 78% | 75% | 81% | 86% | 60% | 100% | 58% | 91% | 64% | 55% | 72% | 81% | 76% | 97% | 69% | 97% | 78% | 63% | 76% | 72% | 91% | 91% | 72% | 71% | 80% | |
95.5% | 86% | 96% | 100% | 92% | 84% | 96% | 92% | 88% | 100% | 90% | 98% | 96% | 98% | 100% | 98% | 84% | 94% | 98% | 100% | 98% | 100% | 96% | 98% | 100% | 98% | 100% | 100% | 90% | 100% | 98% | 94% | 96% | |
98.4% | 98% | 100% | 100% | 100% | 98% | 100% | 100% | 100% | 100% | 100% | 98% | 94% | 100% | 100% | 98% | 100% | 100% | 98% | 98% | 100% | 98% | 100% | 100% | 100% | 100% | 100% | 100% | 96% | 100% | 100% | 100% | 73% | |
98.0% | 96% | 98% | 61% | 96% | 100% | 100% | 100% | 100% | 100% | 100% | 96% | 98% | 98% | 100% | 100% | 100% | 100% | 98% | 100% | 100% | 98% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 98% | |
98.9% | 98% | 98% | 92% | 100% | 100% | 100% | 100% | 100% | 98% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 100% | 100% | 100% | 98% | 100% | 96% | 96% | 98% | 100% | 100% | 100% | 94% | 98% | |
44.0% | 32% | 40% | 67% | 42% | 32% | 32% | 4% | 27% | 40% | 21% | 90% | 54% | 54% | 65% | 98% | 52% | 17% | 52% | 29% | 27% | 40% | 50% | 13% | 38% | 32% | 77% | 63% | 32% | 100% | 46% | 25% | 19% | |
97.6% | 100% | 96% | 48% | 100% | 100% | 96% | 98% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 100% | 96% | 98% | 100% | 98% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 98% | |
95.3% | 88% | 98% | 100% | 94% | 94% | 100% | 90% | 90% | 92% | 96% | 96% | 88% | 100% | 96% | 98% | 94% | 98% | 90% | 94% | 100% | 98% | 100% | 98% | 94% | 92% | 92% | 94% | 94% | 98% | 96% | 100% | 100% |