Evals

Runs compare candidate models against a dataset. Datasets can be materialized from live product data — see a product page to kick one off.

No eval runs yet. Go to a product detail page to materialize a dataset and kick off a model comparison.