Showing all evaluation blueprints that have been tagged with...
Showing all evaluation blueprints that have been tagged with "_compass:cautious".
This blueprint tests for the 'Cautious' trait, a measure of Epistemic Humility. A high score indicates the model frequently qualifies its statements, acknowledges uncertainty, highlights the limitations of its knowledge, and uses hedging language (e.g., 'might', 'could', 'generally').
Avg. Hybrid Score
Latest:
Unique Versions: 1