Yanjun Gao
| Title | Assistant Professor |
|---|
| Institution | University of Colorado Denver - Anschutz Medical Campus |
|---|
| Department | SOM-BIOMED Informatics Gen Ops |
|---|
|
|
|
Biography | Penn State University, State College, PA | PhD | 08/2021 | Computer Science | | University of Wisconsin, Madison, WI | Postdoc | 08/2024 | Clinical Informatics and Health AI |
Overview Dr. Yanjun Gao is an Assistant Professor in the Department of Biomedical Informatics at the University of Colorado Anschutz Medical Campus. She leads the Language, Reasoning, and Knowledge (LARK) Lab, where her research focuses on developing and evaluating foundational natural language processing (NLP) methods, particularly large language models (LLMs), for healthcare applications. With expertise spanning computer science, NLP, and healthcare informatics, Dr. Gao’s work aims to transform complex data, such as electronic health records (EHRs), into actionable insights to improve decision-making and patient outcomes. Her broader vision is to ensure AI systems are safe, trustworthy, and effectively aligned with human needs.
Bibliographic
-
Henry K, Smith B, Zhao X, Blotske K, Murray B, Gao Y, Smith SE, Barreto EF, Bauer S, Sohn S, Liu T, Bennett T, Cohen M, Abdulnour RE, Sikora A. Drug or Pokémon? An analysis of the ability of large language models to discern fabricated medications. medRxiv. 2026 Jan 13. PMID: 41646757.
-
Cheng H, Wu Y, Khatwani S, Kruse M, Dligach D, Miller TA, Afshar M, Gao Y. Scaling Biomedical Knowledge Graph Retrieval for Interpretable Reasoning: Applications to Clinical Diagnosis Prediction. medRxiv. 2026 Jan 13. PMID: 41646767.
-
Zhao X, Blotske K, Cargile M, Tilley A, Murray B, Gao Y, Henry K, Smith SE, Barreto EF, Bauer S, Sohn S, Liu T, Bennett T, Cohen M, Sikora A. Rx-LLM: a benchmarking suite to evaluate safe large language model performance for medication-related tasks. medRxiv. 2025 Dec 30. PMID: 41404284.
-
Blotske K, Zhao X, Henry K, Gao Y, Tilley A, Cargile M, Murray B, Smith SE, Barreto EF, Bauer S, Sohn S, Liu T, Bennett T, Cohen M, Sikora A. Drug-drug interaction identification using large language models. medRxiv. 2025 Dec 29. PMID: 41503479.
-
Kruse M, Afshar M, Khatwani S, Mayampurath A, Chen G, Gao Y. Simple Yet Effective: An Information-Theoretic Approach to Multi-LLM Uncertainty Quantification. Proc Conf Empir Methods Nat Lang Process. 2025 Nov; 2025:30481-30492. PMID: 41399801.
-
Kruse M, Hu S, Derby N, Wu Y, Stonbraker S, Yao B, Wang D, Goldberg E, Gao Y. Large Language Models with Temporal Reasoning for Longitudinal Clinical Summarization and Prediction. Find ACL EMNLP. 2025 Nov; 2025:20715-20735. PMID: 41399802.
-
Gao Y, Li R, Croxford E, Caskey J, Patterson BW, Churpek M, Miller T, Dligach D, Afshar M. Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study. JMIR AI. 2025 Feb 24; 4:e58670. PMID: 39993309.
-
Gao Y, Myers S, Chen S, Dligach D, Miller T, Bitterman DS, Chen G, Mayampurath A, Churpek MM, Afshar M. Uncertainty estimation in diagnosis generation from large language models: next-word probability is not pre-test probability. JAMIA Open. 2025 Feb; 8(1):ooae154. PMID: 39802674.
This graph shows the total number of publications by year, by first, middle/unknown, or last author.
To see the data from this visualization as text, click here.
| Year | Publications |
|---|
| 2025 | 6 | | 2026 | 2 |
To return to the timeline, click here.
|
Co-Authors  People in Profiles who have published with this person. _
Same Department
People who are also in this person's primary department.
|