Cameron Rohn · Episode: EP 6 - Agentic Medical AI, Claude’s Desktop Tools & The OpenRouter Mystery Model · Category: frameworks_and_exercises
Researchers built SDbench, a 304-case sequential diagnostic benchmark from New England Journal of Medicine case proceedings to evaluate iterative AI diagnostic agents.