← Back to Vault

Sequential Diagnostic Benchmark

Cameron Rohn · Category: frameworks_and_exercises

Researchers built SDbench, a 304-case sequential diagnostic benchmark from New England Journal of Medicine case proceedings to evaluate iterative AI diagnostic agents.