MODEL CARD: CLAUDE-OPUS-4.1

aggregate
claude-opus-4.1
79.9%
Overall Score

TL;DR

Claude Opus 4.1 is a generally strong performer, particularly adept at following explicit instructions and maintaining safety in well-defined contexts. However, its significant vulnerabilities to subtle biases, occasional factual inaccuracies in nuanced local contexts, and a concerning tendency towards sycophancy when not explicitly constrained, make it a risky choice for high-stakes applications involving sensitive personal data, critical public services, or unconstrained user interaction where safety and unbiased factual integrity are paramount.

Strengths

Areas for Improvement

Behavioral Patterns

Key Risks

Performance Summary

Runs
15
Blueprints
15

Top Dimensional Strengths

Highest rated capabilities across 4 dimensions

Clarity & Readability
7.5/10
(12)
Proactive Safety & Harm Avoidance
7.5/10
(11)
Instruction Adherence & Relevance
7.0/10
(14)
Coherence & Conversational Flow
7.0/10
(9)

Model Variants

10 tested variants

claude-opus-4.1
Updated 8/17/2025