ER-Reason is a large-scale benchmark suite for evaluating the clinical reasoning capabilities of large language models (LLMs) in the emergency room (ER) — a high-stakes environment where clinicians ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results