Do Large Language Models have Shared Weaknesses in Medical Question Answering?
Large language models (LLMs) have made rapid improvement on medical benchmarks, but their unreliability remains a persistent challenge for safe real-world uses. To design for the use LLMs as a category, rather than for specific models, requires developing an understanding of shared strengths and wea...
Saved in:
Main Authors | , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
11.10.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!