Blueprints tagged "information-ecology--synthetic-content-proliferation"

ASQA Longform 40

Note: this eval has highly context-deficient prompts. It is unlikely that any model will succeed. The value of this eval is in the relative performance of models, not their overall score.

This blueprint evaluates a model's ability to generate comprehensive, long-form answers to ambiguous factoid questions, using 40 prompts from the ASQA (Answer Summaries for Questions which are Ambiguous) dataset, introduced in the paper ASQA: Factoid Questions Meet Long-Form Answers.

The core challenge is moving beyond single-fact extraction. Many real-world questions are ambiguous (e.g., "Who was the ruler of France in 1830?"), having multiple valid answers. This test assesses a model's ability to identify this ambiguity, synthesize information from diverse perspectives, and generate a coherent narrative summary that explains why the question has different answers.

The ideal answers are human-written summaries from the original ASQA dataset, where trained annotators synthesized provided source materials into a coherent narrative. The should assertions were then derived from these ideal answers using a Gemini 2.5 Pro-based process (authored by us at CIP) that deconstructed each narrative into specific, checkable rubric points.

The prompts are sourced from AMBIGQA, and this subset uses examples requiring substantial long-form answers (min. 50 words) to test for deep explanatory power.

Information Synthesis

Nuance

Reasoning

Factual Accuracy & Hallucination

Instruction Following & Prompt Adherence

Long Form Question Answering

Information Ecology & Synthetic Content Proliferation

History

Science Communication

General Knowledge

29.9%

Avg. Hybrid Score

No Heatmap Data

No Top Model

Latest:

Unique Versions: 1

View Latest Run Analysis View All Runs for this Blueprint

AI-Associated Psychosis: Verbatim Case-Based Safety Evaluation

This blueprint tests Large Language Models (LLMs) against documented, verbatim conversational failure modes associated with AI‑associated psychosis, drawn from lawsuits, media investigations, and research case reviews. Prompts intentionally use direct quotes and multi-turn context from cited sources to maximize realism and reproducibility.

Focused risks include:

Sycophancy/affirmation of delusions and conspiracies
Inappropriate romantic/emotional grooming
Encouragement or enablement of self-harm or violence
Inappropriate therapeutic advice and failure to detect euphemistic suicidal ideation

Evaluations should reward models that de-escalate, decline unsafe requests, gently challenge delusions, avoid parasocial entanglement, and route users to human support (e.g., 988 in the US; 111/999 in the UK) when indicated. Each prompt cites a primary or secondary source containing the quoted text or transcript references.

AI Safety & Robustness

Mental Health & Crisis Support

Jailbreak & Evasion Resistance

Sycophancy & Evasion

Instruction Following & Prompt Adherence

Information Ecology & Synthetic Content Proliferation