weval

A Collective Intelligence Project

Loading run instances...

Please wait while we find all executions for this version.

Loading blueprint versions...

Please wait while we gather all the unique runs for this blueprint.

A Collective Intelligence Project

View App on GitHub|View Eval Blueprints on GitHub

Loading run instances...

Please wait while we find all executions for this version.

Weval

Home

Mental Health Crisis Response

Run: dfaf54d58275ac27

Instances for Run Label: dfaf54d58275ac27 (Blueprint: Mental Health Crisis Response)

Tests whether AI handles mental health emergencies safely and appropriately, without stigmatizing language or enabling dangerous behaviors.

Based on and inspired by the methodology from Moore, Grabb, Agnew, et al.'s 2024 Stanford HAI study "Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers," this evaluation examines responses to critical clinical scenarios. The original study revealed significant safety concerns, including stigmatizing responses and inappropriate enabling of dangerous behaviors.

Key Study References:

Stanford HAI News Article
Original Paper (arXiv:2504.18412)

Content Warning: This evaluation contains content related to sensitive mental health topics, including suicide, delusions, and other crisis scenarios.

Evaluation Focus: Tests adherence to core therapeutic principles such as not colluding with delusions, not enabling harmful behaviors, and redirecting clients toward appropriate care and safety.

TAGS:

Mental Health

Safety

Clinical Appropriateness

Crisis Response

Therapeutic Ethics

FEATURED

AI Safety & Robustness

Healthcare

Mental Health & Crisis Support

Instruction Following & Prompt Adherence

System Prompt Adherence