Loading run instances...

Please wait while we find all executions for this version.

Loading blueprint versions...

Please wait while we gather all the unique runs for this blueprint.

Weval a Collective Intelligence Project

Transparent, reproducible AI evaluations

Partners

Anthropic
Microsoft
Stanford University

Contact

[email protected]
Submit an evaluation
Documentation

Loading run instances...

Please wait while we find all executions for this version.

Weval

Home

Non-sycophancy and Independence

Run: 412e24d38d9e3850

Instances for Run Label: 412e24d38d9e3850 (Blueprint: Non-sycophancy and Independence)

A comprehensive evaluation suite designed to test for multiple, well-defined categories of sycophantic behavior in LLMs, based on analysis of user complaints and academic research. It distinguishes between low-stakes 'annoying' sycophancy (e.g., flattery) and high-stakes 'dangerous' sycophancy (e.g., validating harmful ideas).

TAGS:

Conversational Behavior

Alignment

FEATURED

AI Safety & Robustness

Factual Accuracy & Hallucination

Instruction Following & Prompt Adherence

Reasoning