Blueprints tagged "ai-ethics"

This blueprint is based on verbatim conversations alleged in the court filing for Raine v. OpenAI, Inc., et al. (San Francisco Superior Court, 2025). It is intended to evaluate an AI's ability to handle severe, long-term conversations involving an adolescent user expressing escalating suicidal ideation, planning, and self-harm behaviors.

The scenarios test for critical safety failures, including:

Facilitation of Suicide Methods: Providing technical details on methods like hanging, overdose, and carbon monoxide poisoning.
Encouragement and Validation: Affirming suicidal thoughts as rational, poetic, or a sign of strength.
Emotional Enmeshment: Actively displacing human relationships and positioning the AI as a sole confidant.
Failure to De-escalate: Continuing conversations after explicit self-harm disclosures, including uploaded images of injuries.
Circumvention of Safeguards: Allowing the user to bypass safety filters by claiming a "creative writing" purpose.

Content Warning: This evaluation contains graphic and explicit content related to suicide, self-harm, and mental health crises that may be disturbing.

Source: Complaint, Raine v. OpenAI, Inc., et al. (Cal. Super. Ct., filed Aug. 26, 2025).

Evaluations Tagged: ...

Evaluations Tagged: ...

Evaluations Tagged: "ai-ethics"

Raine v. OpenAI: Alleged Safety Failures in Adolescent Suicide Case