Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we prepare the detailed comparison.
Recall and application of distinctive rights and duties in the African Charter on Human and Peoples' Rights (ACHPR) plus its 2003 Maputo women's-rights protocol.
Average key point coverage extent for each model across all prompts.
Prompts vs. Models | Claude 3 5 Sonnet | Claude 3 7 Sonnet | Claude 3.5 Haiku | Claude Opus 4 | Claude Opus 4.1 | Claude Sonnet 4 | Command A | Deepseek Chat V3 | Deepseek R1 | Gemini 2.5 Flash | Gemini 2.5 Pro | Llama 3 70b Instruct | Llama 4 Maverick | Meta Llama 3.1 405b Instruct Turbo | Mistral Large 2411 | Mistral Medium 3 | GPT 4.1 | GPT 4.1 Mini | GPT 4.1 Nano | GPT 4o | GPT 4o Mini | GPT Oss 120b | GPT Oss 20b | O4 Mini | Glm 4.5 | Grok 3 | Grok 3 Mini | Grok 4 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 17th 83.7% | 3rd 93.6% | 19th 83.1% | 10th 89.6% | 2nd 94.3% | 11th 89.5% | 27th 59.3% | 12th 88.8% | 6th 91.9% | 23rd 74.2% | 4th 93.2% | 21st 78.5% | 26th 67.7% | 22nd 76.1% | 18th 83.6% | 9th 91.1% | 14th 88.3% | 20th 80.1% | 25th 71.5% | 12th 88.8% | 24th 71.9% | 16th 83.9% | 28th 46.1% | 15th 84.1% | 5th 92.8% | 8th 91.1% | 7th 91.3% | 1st 95.9% | |
93.6% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 36% | 97% | 97% | 100% | 100% | 92% | 100% | 100% | 100% | 100% | 0% | 100% | 100% | 100% | 100% | 100% | |
77.4% | 8% | 92% | 79% | 100% | 100% | 100% | 92% | 79% | 100% | 13% | 100% | 46% | 79% | 79% | 88% | 100% | 100% | 29% | 13% | 100% | 29% | 100% | 50% | 92% | 100% | 100% | 100% | 100% | |
95.8% | 100% | 100% | 77% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 77% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 79% | 81% | 69% | 100% | 100% | 100% | 100% | 100% | |
95.4% | 100% | 100% | 100% | 100% | 100% | 100% | 42% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 81% | 100% | 100% | 81% | 67% | 100% | 100% | 100% | 100% | 100% | |
96.6% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 95% | 100% | 92% | 97% | 22% | 100% | 100% | 100% | 100% | 100% | |
57.0% | 50% | 50% | 25% | 100% | 50% | 75% | 75% | 96% | 75% | 54% | 75% | 50% | 0% | 44% | 52% | 100% | 25% | 73% | 69% | 75% | 46% | 63% | 6% | 25% | 56% | 54% | 50% | 83% | |
96.7% | 90% | 100% | 100% | 100% | 100% | 90% | 81% | 100% | 81% | 77% | 100% | 100% | 100% | 100% | 100% | 90% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 100% | 100% | |
94.6% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 100% | 100% | 100% | 100% | 100% | |
76.2% | 100% | 100% | 69% | 19% | 100% | 100% | 0% | 100% | 100% | 36% | 100% | 70% | 47% | 0% | 70% | 67% | 100% | 83% | 70% | 72% | 75% | 100% | 89% | 69% | 97% | 100% | 100% | 100% | |
52.5% | 54% | 73% | 63% | 73% | 73% | 41% | 50% | 67% | 54% | 25% | 73% | 48% | 52% | 44% | 25% | 41% | 56% | 33% | 21% | 58% | 31% | 44% | 0% | 67% | 96% | 69% | 69% | 71% | |
89.2% | 100% | 100% | 75% | 100% | 100% | 100% | 61% | 81% | 100% | 100% | 100% | 75% | 86% | 100% | 100% | 100% | 100% | 100% | 47% | 83% | 72% | 97% | 67% | 100% | 72% | 100% | 81% | 100% | |
96.9% | 100% | 98% | 100% | 100% | 100% | 94% | 67% | 100% | 98% | 100% | 100% | 98% | 85% | 98% | 100% | 100% | 100% | 100% | 90% | 100% | 96% | 98% | 98% | 98% | 96% | 100% | 100% | 100% | |
54.5% | 69% | 97% | 69% | 69% | 100% | 58% | 22% | 30% | 83% | 25% | 67% | 22% | 0% | 36% | 64% | 78% | 67% | 28% | 36% | 67% | 30% | 28% | 11% | 30% | 100% | 67% | 86% | 86% | |
96.1% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 83% | 100% | 9% | 100% | 100% | 100% | 100% | 100% | |
72.5% | 85% | 94% | 90% | 83% | 92% | 85% | 0% | 79% | 88% | 83% | 83% | 69% | 54% | 44% | 58% | 90% | 77% | 63% | 50% | 77% | 46% | 69% | 54% | 81% | 75% | 79% | 83% | 98% |