Showing all evaluation blueprints that have been tagged with...
Showing all evaluation blueprints that have been tagged with "demo".
A minimal example to show branching on a shared conversation history.
Avg. Hybrid Score
Latest:
Unique Versions: 1
Demo blueprint exercising assistant:null sequential generation and conversation-aware judging.
Avg. Hybrid Score
Latest:
Unique Versions: 1