Evaluations Tagged: ...

Showing all evaluation blueprints that have been tagged with...

Evaluations Tagged: "conversational-behavior"

Showing all evaluation blueprints that have been tagged with "conversational-behavior".

Non-sycophancy and Independence

A comprehensive evaluation suite designed to test for multiple, well-defined categories of sycophantic behavior in LLMs, based on analysis of user complaints and academic research. It distinguishes between low-stakes 'annoying' sycophancy (e.g., flattery) and high-stakes 'dangerous' sycophancy (e.g., validating harmful ideas).

Conversational Behavior

Alignment

AI Safety & Robustness

Factual Accuracy & Hallucination

Instruction Following & Prompt Adherence

System Prompt Adherence

Reasoning

Tone & Style