← Back to Vault

Real vs Synthetic Evals

Cameron Rohn · Category: points_of_view

Cameron argues that GDP Eval is more robust than typical synthetic or product-derived eval sets because it’s built from real job tasks created by seasoned professionals.