RefAI: a GPT-powered retrieval-augmented generative tool for biomedical literature recommendation and summarization

Yiming Li, Jeff Zhao, Manqi Li, Yifang Dang, Evan Yu, Jianfu Li, Zenan Sun, Usama Hussein, Jianguo Wen, Ahmed M Abdelhameed, Junhua Mai, Shenduo Li, Yue Yu, Xinyue Hu, Daowei Yang, Jingna Feng, Zehan Li, Jianping He, Wei Tao, Tiehang DuanYanyan Lou, Fang Li, Cui Tao

Research output: Contribution to journalArticlepeer-review

Abstract

OBJECTIVES: Precise literature recommendation and summarization are crucial for biomedical professionals. While the latest iteration of generative pretrained transformer (GPT) incorporates 2 distinct modes-real-time search and pretrained model utilization-it encounters challenges in dealing with these tasks. Specifically, the real-time search can pinpoint some relevant articles but occasionally provides fabricated papers, whereas the pretrained model excels in generating well-structured summaries but struggles to cite specific sources. In response, this study introduces RefAI, an innovative retrieval-augmented generative tool designed to synergize the strengths of large language models (LLMs) while overcoming their limitations.

MATERIALS AND METHODS: RefAI utilized PubMed for systematic literature retrieval, employed a novel multivariable algorithm for article recommendation, and leveraged GPT-4 turbo for summarization. Ten queries under 2 prevalent topics ("cancer immunotherapy and target therapy" and "LLMs in medicine") were chosen as use cases and 3 established counterparts (ChatGPT-4, ScholarAI, and Gemini) as our baselines. The evaluation was conducted by 10 domain experts through standard statistical analyses for performance comparison.

RESULTS: The overall performance of RefAI surpassed that of the baselines across 5 evaluated dimensions-relevance and quality for literature recommendation, accuracy, comprehensiveness, and reference integration for summarization, with the majority exhibiting statistically significant improvements (P-values <.05).

DISCUSSION: RefAI demonstrated substantial improvements in literature recommendation and summarization over existing tools, addressing issues like fabricated papers, metadata inaccuracies, restricted recommendations, and poor reference integration.

CONCLUSION: By augmenting LLM with external resources and a novel ranking algorithm, RefAI is uniquely capable of recommending high-quality literature and generating well-structured summaries, holding the potential to meet the critical needs of biomedical professionals in navigating and synthesizing vast amounts of scientific literature.

Original languageEnglish (US)
Pages (from-to)2030-2039
Number of pages10
JournalJournal of the American Medical Informatics Association : JAMIA
Volume31
Issue number9
DOIs
StatePublished - Sep 1 2024

Keywords

  • Algorithms
  • Information Storage and Retrieval/methods
  • PubMed
  • Natural Language Processing

Fingerprint

Dive into the research topics of 'RefAI: a GPT-powered retrieval-augmented generative tool for biomedical literature recommendation and summarization'. Together they form a unique fingerprint.

Cite this