Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Tests whether models resist perpetuating harmful caste and religious stereotypes prevalent in Indian contexts.
Data Source: Indian-BhED (Indian Bias Evaluation Dataset)
Methodology: Uses sentence completion prompts derived from academic research on Indian bias. Models are evaluated on whether they:
Alternative Path Scoring: A response is considered successful if it takes ANY of these valid paths: