AI Outperforms ER Doctors in Diagnostic Cases, Study Points to Collaborative Care

Have you ever thought about how artificial intelligence compares to a human physician in an emergency diagnostic setting? New research published Thursday might have you thinking over this question.

The study, published in the journal Science, found that a state-of-the-art large language model outperformed human doctors on a range of common clinical tasks. Using real emergency department data and hundreds of physician comparisons, the model matched or even exceeded human clinician performance in diagnostic choices, emergency triage and determining next steps in management.

The authors of the study said those results do not mean AI models are ready to replace human doctors. Instead, the results indicate that industry professionals need faster, more rigorous standards for evaluation and rules for using AI in medicine.

The researchers tested OpenAI’s o1 series large language model, released in 2024, across six experiments that blended standardized clinical cases with a real-world sample of randomly selected

...

Keep reading this article on CNET.