Discover AI Workflows
Explore prompts, agent designs, model notes, and developer tools
Explore prompts, agent designs, model notes, and developer tools
Group eval results by task to spot regressions.
How to build an evaluation set that actually catches regressions.