UCSF AI Publications
MedEvalArena: A Self-Generated, Peer-Judged Benchmark for Medical Reasoning
medRxiv,
Kie Shidara, Esmé Wheeler, Danilo Bernardo [last], and 4 more from outside UCSF
Altmetric score: 1Text-Driven Tumor Synthesis
IEEE Transactions on Medical Imaging,
Kang Wang, Yang Yang, and 12 more from outside UCSF
Related to: Biomedical Imaging, Cancer
Evaluation of large language models for diagnostic impression generation from brain MRI report findings: a multicenter benchmark and reader study
npj Digital Medicine,
Xue Wu, and 11 more from outside UCSF
Related to: Clinical Research, Neurosciences, Biomedical Imaging
Large Language Models in Radiologic Numerical Tasks: A Thorough Evaluation and Error Analysis
Journal of Imaging Informatics in Medicine,
Ali Nowroozi [first], Masha Bondarenko, Adrian Serapio, Tician Schnitzler, Jae Ho Sohn [last], and 1 more from outside UCSF
Related to: Biomedical Imaging
Large Language Models in Radiologic Numerical Tasks: A Thorough Evaluation and Error Analysis
Journal of Imaging Informatics in Medicine,
Ali Nowroozi [first], Masha Bondarenko, Adrian Serapio, Tician Schnitzler, Jae Ho Sohn [last], and 1 more from outside UCSF
Related to: Biomedical Imaging
The Importance of Artificial Intelligence Literacy in Nursing and Midwifery to Achieve the Sustainable Development Goals: Perspectives From the 80th Session of the United Nations General Assembly
Journal of Nursing Scholarship,
Jerry John Ouner, and 8 more from outside UCSF
Related to: Machine Learning and Artificial Intelligence
Agentic AI in Radiology: Evolution from Large Language Models to Future Clinical Integration.
Radiology Artificial Intelligence,
Ali Tejani, and 8 more from outside UCSF
Altmetric score: 6Related to: Biomedical Imaging, Bioengineering, Networking and Information Technology R&D (NITRD), Clinical Research
Quantifying the impact of a computer‐aided diagnostic score on the clinical diagnosis of functional seizures
Epilepsia,
Adriana Y. Koek, Di Sun, and 37 more from outside UCSF
Altmetric score: 16Related to: Machine Learning and Artificial Intelligence, Neurosciences, Brain Disorders, Neurodegenerative, Clinical Research, Epilepsy
Application and validation of AI-driven methods to explore patient experiences of pre-cervical cancer
European Journal of Obstetrics & Gynecology and Reproductive Biology,
Christopher Y K Williams [last], and 1 more from outside UCSF
Related to: Cervical Cancer, Women's Health, Cancer, Networking and Information Technology R&D (NITRD), Clinical Research, Machine Learning and Artificial Intelligence
Knowledge graph representation of the mappings between seizure semiology and epileptogenic zones
Scientific Reports,
Danilo Bernardo, and 14 more from outside UCSF
Altmetric score: 3Related to: Brain Disorders, Networking and Information Technology R&D (NITRD), Neurodegenerative, Epilepsy, Neurosciences
Generative Artificial Intelligence Successfully Automates Data Extraction From Unstructured Magnetic Resonance Imaging Reports: Feasibility in Prostate Cancer Care
JCO Clinical Cancer Informatics,
Anobel Y. Odisho, Andrew W. Liu, William A. Pace, Marvin N. Carlisle, Robert Krumm, Janet E. Cowan, Peter R. Carroll, Matthew R. Cooperberg
Altmetric score: 3Related to: Biomedical Imaging, Cancer, Bioengineering, Aging, Urologic Diseases, Machine Learning and Artificial Intelligence, Prostate Cancer
Can large language models extract operative standards from narrative operative reports in rectal cancer?
Surgery,
Karen Trang [first], Beiqun Zhao, Colleen P Flanagan, Logan Pierce, Elizabeth Wick [last], and 3 more from outside UCSF
Related to: Colo-Rectal Cancer, Cancer, Digestive Diseases, Rare Diseases
The Utility of Artificial Intelligence Platforms for Post‐Operative Mohs Micrographic Surgery Questions: A Blinded Expert Panel Evaluation
International Journal of Dermatology,
Siegrid S. Yu, and 9 more from outside UCSF
Altmetric score: 11Related to: Bioengineering, Networking and Information Technology R&D (NITRD), Machine Learning and Artificial Intelligence, Patient Safety
Are ChatGPT Answers to Patient Questions Regarding Fecal Incontinence Accurate, Complete, and Consistent With the American Society of Colorectal Surgeons Clinical Practice Guidelines?
Diseases of the Colon & Rectum,
Karen Trang, Elizabeth C Wick, and 10 more from outside UCSF
Related to: Colo-Rectal Cancer, Clinical Trials and Supportive Activities, Clinical Research, Health Disparities, Digestive Diseases, Cancer, Health Disparities and Racial or Ethnic Minority Health Research
Evaluating AI-based comprehensive clinical decision support for sepsis and ARDS: protocol for a Clinician Turing Test
BMJ Open,
Romain Pirracchio, and 12 more from outside UCSF
Altmetric score: 2Related to: Acute Respiratory Distress Syndrome, Clinical Research, Clinical Trials and Supportive Activities, Rare Diseases, Networking and Information Technology R&D (NITRD), Hematology, Infectious Diseases, Sepsis, Lung, Patient Safety, Machine Learning and Artificial Intelligence
Evaluating AI-based comprehensive clinical decision support for sepsis and ARDS: protocol for a Clinician Turing Test
BMJ Open,
Romain Pirracchio, and 12 more from outside UCSF
Altmetric score: 2Related to: Sepsis, Acute Respiratory Distress Syndrome, Clinical Trials and Supportive Activities, Networking and Information Technology R&D (NITRD), Clinical Research, Machine Learning and Artificial Intelligence, Infectious Diseases, Hematology, Rare Diseases, Lung, Patient Safety
Comparing computable structured phenotype- versus large language model-identification of opioid use disorder using electronic health record data
medRxiv,
Melanie Molina [first], Cynthia Fenton, Kathy T. LeSaint, Aaron E. Kornblith [last], and 1 more from outside UCSF
Altmetric score: 1Related to: Brain Disorders, Opioid Misuse and Addiction, Opioids, Drug Abuse (NIDA only), Substance Misuse, Emergency Care
Integrating a host biomarker with a large language model for diagnosis of lower respiratory tract infection
Nature Communications,
Hoang Van Phan, Natasha Spottiswoode, Emily C. Lydon, Victoria T. Chu, Adolfo Cuesta, Alexander D. Kazberouk, Natalie L. Richmond, Padmini Deosthale, Carolyn S. Calfee, Charles R. Langelier
Altmetric score: 65Related to: Rare Diseases, Lung, Infectious Diseases, Clinical Research
Integrating a host biomarker with a large language model for diagnosis of lower respiratory tract infection
Nature Communications,
Hoang Van Phan, Natasha Spottiswoode, Emily C. Lydon, Victoria T. Chu, Adolfo Cuesta, Alexander D. Kazberouk, Natalie L. Richmond, Padmini Deosthale, Carolyn S. Calfee, Charles R. Langelier
Altmetric score: 65Related to: Rare Diseases, Clinical Research, Infectious Diseases, Lung
Evaluating the reliability of large language models in answering FAQs for cataract surgery
Digital Health,
Hassan Asadigandomani, and 4 more from outside UCSF
Related to: Clinical Research, Patient Safety
Evaluating the reliability of large language models in answering FAQs for cataract surgery
Digital Health,
Hassan Asadigandomani, and 4 more from outside UCSF
Related to: Patient Safety, Clinical Research
Machine learning-based mortality prediction in critically ill patients with hypertension: comparative analysis, fairness, and interpretability
Frontiers in Artificial Intelligence,
Sirui Ding, and 3 more from outside UCSF
Cited 1 timesRelated to: Machine Learning and Artificial Intelligence, Bioengineering, Hypertension, Patient Safety, Cardiovascular, Brain Disorders, Networking and Information Technology R&D (NITRD), Data Science
Machine learning-based mortality prediction in critically ill patients with hypertension: comparative analysis, fairness, and interpretability
Frontiers in Artificial Intelligence,
Sirui Ding, and 3 more from outside UCSF
Altmetric score: 1
Cited 3 timesRelated to: Networking and Information Technology R&D (NITRD), Data Science, Brain Disorders, Cardiovascular, Bioengineering, Hypertension, Patient Safety, Machine Learning and Artificial Intelligence
All That Shines Is Not Gold: Maintaining Scientific Rigor When Evaluating, Interpreting, and Reviewing Studies Using Large Language Models
Anesthesiology,
Tyler Law, Teva Brender, Hunter Mills, Edie Espejo, Arthur W. Wallace, Julien Cobert [last], and 2 more from outside UCSF
Altmetric score: 1Automating expert-level medical reasoning evaluation of large language models
npj Digital Medicine,
Yuen-Hei Chung, and 18 more from outside UCSF
Altmetric score: 1
Cited 1 timesZero-Shot PI-RADS Version 2.1 Scoring with ChatGPT-4 Turbo and Llama 3: Diagnostic Performance and Agreement with Abdominal Radiologists.
Radiology Imaging Cancer,
Spencer Behr, and 4 more from outside UCSF
Altmetric score: 1
Cited 1 timesRelated to: Biomedical Imaging, Prostate Cancer, Urologic Diseases, Cancer
Review of Artificial Intelligence for Clinical Use in Alzheimer's Disease and Related Dementias
Seminars in Neurology,
Andrew G. Breithaupt [first], Alice Tang, Emily W. Paolillo, Rowan Saloner, Katherine L. Possin, Charles C. Windon, Tanisha G. Hill-Jarrett, Andreas M. Rauschecker, Jet M. J. Vonk, Pedro Pinheiro-Chagas [last], and 4 more from outside UCSF
Altmetric score: 7Related to: Alzheimer's Disease including Alzheimer's Disease Related Dementias (AD/ADRD), Acquired Cognitive Impairment, Neurosciences, Aging, Networking and Information Technology R&D (NITRD), Dementia, Brain Disorders, Machine Learning and Artificial Intelligence, Neurodegenerative, Prevention, Alzheimer's Disease, Clinical Research, Bioengineering, Behavioral and Social Science, Health Services
Human level information extraction from clinical reports with finetuned language models
Scientific Reports,
Aidan Pace, Elaine Kim, Peter R. Carroll, Anobel Y. Odisho, Maggie Chung, Adam Yala [last], and 9 more from outside UCSF
Altmetric score: 3
Cited 1 timesRelated to: Clinical Research, Networking and Information Technology R&D (NITRD), Bioengineering
Health Equity Considerations in the Age of Artificial Intelligence
Neurology,
Noriko Anderson, and 9 more from outside UCSF
Altmetric score: 48Related to: Behavioral and Social Science, Machine Learning and Artificial Intelligence, Health Disparities and Racial or Ethnic Minority Health Research, Minority Health, Networking and Information Technology R&D (NITRD), Clinical Research, Data Science, Health Disparities, Patient Safety, Bioengineering, Social Determinants of Health
Amplifying signal-to-noise: Responsible use of large language models in radiology publishing
Clinical Imaging,
Ali S Tejani, and 3 more from outside UCSF
Uncertainty-aware large language models for explainable disease diagnosis
npj Digital Medicine,
Yuen-Hei Chung, and 13 more from outside UCSF
Altmetric score: 2
Cited 3 timesRelated to: Machine Learning and Artificial Intelligence, Bioengineering
Integrating expert knowledge into large language models improves performance for psychiatric reasoning and diagnosis
Psychiatry Research,
Karthik V Sarma, Kaitlin E Hanss, Andrew J M Halls, Andrew Krystal, Daniel F Becker, Anne L Glowinski, Atul J Butte
Altmetric score: 27Related to: Clinical Research
A large language model-based approach to quantifying the effects of social determinants in liver transplant decisions
npj Digital Medicine,
Emily Robitschek [first], Asal Bastani, Kathryn Horwath, Savyon Sordean, Mark J. Pletcher, Jennifer C. Lai, Jin Ge, Irene Y. Chen [last], and 2 more from outside UCSF
Cited 1 timesRelated to: Transplantation, Minority Health, Health Disparities, Health Disparities and Racial or Ethnic Minority Health Research, Liver Disease, Digestive Diseases, Mental Health, Basic Behavioral and Social Science, Social Determinants of Health, Substance Misuse, Prevention, Clinical Research, Behavioral and Social Science, Health Services
Extracting TNFi switching reasons and trajectories from real-world data using large language models
JAMIA Open,
Brenda Y Miao [first], Marie Binvignat, Augusto Garcia-Agundez, Christopher Yk Williams, Claire Q Miao, Ahmed Alaa, Vivek Rudrapatna, Atul J Butte, Gabriela Schmajuk, Jinoos Yazdany [last], and 1 more from outside UCSF
Related to: Clinical Research, Patient Safety
Limitations of large language models in clinical problem-solving arising from inflexible reasoning
Scientific Reports,
Kie Shidara, Danilo Bernardo [last], and 4 more from outside UCSF
Altmetric score: 4
Cited 8 timesLarge Language Models can Identify the Presence of MASH and Extract VCTE Measurements from Unstructured Documentation
Digestive Diseases and Sciences,
Aryana T. Far, Aryan Ayati, Jordan Guillot, Shadera Azzam, Vivek A. Rudrapatna, Jin Ge
Related to: Clinical Research
BioAgents: Bridging the gap in bioinformatics analysis with multi-agent systems
Scientific Reports,
Ahmed Alaa, and 8 more from outside UCSF
Altmetric score: 4
Cited 5 timesRelated to: Genetics, Biotechnology, Networking and Information Technology R&D (NITRD), Human Genome
BioAgents: Bridging the gap in bioinformatics analysis with multi-agent systems
Scientific Reports,
Ahmed Alaa, and 8 more from outside UCSF
Altmetric score:Related to: Human Genome, Genetics, Networking and Information Technology R&D (NITRD), Biotechnology
Adolescent Health and Generative AI—Risks and Benefits
JAMA Pediatrics,
Jason M. Nagata [first], Zain Memon, Oliver Huang, and 1 more from outside UCSF
Altmetric score: 24Related to: Behavioral and Social Science, Sleep Research, Basic Behavioral and Social Science, Machine Learning and Artificial Intelligence, Mental Health, Pediatric Research Initiative, Physical Activity, Networking and Information Technology R&D (NITRD)
Large language models accurately extract aortic information from abdominal imaging reports in a large, real-world database
Journal of Vascular Surgery,
Colleen P Flanagan [first], Myra McLenon, Elizabeth M Lancaster, and 6 more from outside UCSF
Related to: Machine Learning and Artificial Intelligence, Cardiovascular, Biomedical Imaging, Bioengineering, Clinical Research, Rare Diseases, Networking and Information Technology R&D (NITRD)
Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates
Korean Journal of Radiology,
Ali S. Tejani, and 6 more from outside UCSF
Altmetric score: 4
Cited 2 timesAgentic Generative Artificial Intelligence System for Classification of Pathology-Confirmed Primary Progressive Aphasia Variants
medRxiv,
Chiara Gallingani, Zachary A Miller, Maria Luisa Mandelli, Howard J Rosen, Zoe Ezzes, Mia Lin, Diana Rodriguez, Lea T Grinberg, Salvatore Spina, William W Seeley, Bruce Miller, Maria Luisa Gorno-Tempini, Pedro Pinheiro-Chagas
Altmetric score: 3Related to: Rare Diseases, Frontotemporal Dementia (FTD), Alzheimer's Disease, Neurosciences, Acquired Cognitive Impairment, Biomedical Imaging, Bioengineering, Aging, Neurodegenerative, Alzheimer's Disease Related Dementias (ADRD), Brain Disorders, Dementia, Clinical Research, Alzheimer's Disease including Alzheimer's Disease Related Dementias (AD/ADRD), Machine Learning and Artificial Intelligence, Aphasia, Networking and Information Technology R&D (NITRD)
Artificial Intelligence Applications in Musculoskeletal Imaging
Current Reviews in Musculoskeletal Medicine,
Atlas Haddadi Avval, and 7 more from outside UCSF
Cited 1 timesRelated to: Bioengineering, Machine Learning and Artificial Intelligence, Biomedical Imaging, Networking and Information Technology R&D (NITRD)
Building an analytical framework for tobacco-related information on social media: an exploratory analysis with generative AI assistance
BMC Public Health,
Eileen Han [first], Pamela Ling [last], and 1 more from outside UCSF
Cited 1 timesRelated to: Tobacco, Cancer, Tobacco Smoke and Health, Substance Misuse, Behavioral and Social Science, Networking and Information Technology R&D (NITRD), Machine Learning and Artificial Intelligence
An Exploratory Typology of Tobacco-Related Misleading Content on Social Media: Qualitative Analysis of Instagram and TikTok
Journal of Medical Internet Research,
Eileen Han, Joanne Chen Lyu, Pamela M Ling
Altmetric score: 1Related to: Networking and Information Technology R&D (NITRD), Prevention, Machine Learning and Artificial Intelligence, Tobacco Smoke and Health, Behavioral and Social Science, Tobacco
Artificial Intelligence for Response Assessment in Pediatric Neuro-Oncology (AI-RAPNO), part 2: challenges, opportunities, and recommendations for clinical translation
The Lancet Oncology,
Michael Prados, Susan M Chang, Sabine Mueller, Javier E Villanueva-Meyer, and 19 more from outside UCSF
Altmetric score: 37
Cited 2 timesRelated to: Machine Learning and Artificial Intelligence, Data Science, Brain Cancer, Cancer, Pediatric Research Initiative, Bioengineering, Brain Disorders, Clinical Research, Networking and Information Technology R&D (NITRD), Pediatric Cancer, Neurosciences, Rare Diseases, Radiation Oncology, Clinical Trials and Supportive Activities
A community‐driven vision for a new knowledge resource for AI
AI Magazine,
Sharat Israni, and 31 more from outside UCSF
A community‐driven vision for a new knowledge resource for AI
AI Magazine,
Sharat Israni, and 31 more from outside UCSF
Altmetric score:A community‐driven vision for a new knowledge resource for AI
AI Magazine,
Sharat Israni, and 31 more from outside UCSF
Large Language Models in Population Oncology: A Contemporary Review on the Use of Large Language Models to Support Data Collection, Aggregation, and Analysis in Cancer Care and Research
JCO Clinical Cancer Informatics,
Ryzen Benson, Clodagh Kenny, Amir Ashraf Ganjouei, Michelle Zhao, Rami Darawsheh, Alexander Qian, Julian C. Hong
Altmetric score: 6Related to: Prevention, Cancer, Data Science, Networking and Information Technology R&D (NITRD)