LLM-Driven Literature Review Generation in Medicine: A Rapid and Reproducible Pipeline for CRSwNP Research

Arash Ziaee; Mehdi Bakhshaee; Mohammad Salehi-Marzijarani

LLM-Driven Literature Review Generation in Medicine: A Rapid and Reproducible Pipeline for CRSwNP Research

محل انتشار: دومین کنگره بین المللی هوش مصنوعی در علوم پزشکی

سال انتشار: 1404

نوع سند: مقاله کنفرانسی

زبان: انگلیسی

مشاهده: 37

متن کامل این مقاله منتشر نشده است و فقط به صورت چکیده یا چکیده مبسوط در پایگاه موجود می باشد.
توضیح: معمولا کلیه مقالاتی که کمتر از ۵ صفحه باشند در پایگاه سیویلیکا اصل مقاله (فول تکست) محسوب نمی شوند و فقط کاربران عضو بدون کسر اعتبار می توانند فایل آنها را دریافت نمایند.

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/2311369

شناسه ملی سند علمی:

AIMS02_650

تاریخ نمایه سازی: 29 تیر 1404

چکیده مقاله:

Background and Aims: Synthesizing extensive biomedical literature for comprehensive reviews is traditionally time-consuming and labor-intensive. Large Language Models (LLMs) equipped with million-token context windows offer potential avenues for automating information extraction and drafting, but standardized, reproducible protocols are needed. We aimed to develop and demonstrate a reproducible, human-supervised, end-to-end LLM-driven pipeline capable of generating a comprehensive, citation-accurate academic literature review addressing the research question: Role of Vitamin D in Chronic Rhinosinusitis with Nasal Polyps (CRSwNP) Pathogenesis. Methods: A corpus of ۶۰ relevant English-language full-text research papers was manually curated by an otolaryngology faculty panel. Due to significant format variability hindering direct LLM ingestion, PDFs were uniformly batch-converted to Markdown using the open-source Marker package, which internally leverages Gemini Flash ۲.۰ for layout-aware extraction. The resulting concatenated Markdown corpus (~۷۰۰ pages; ~۷۰۰,۰۰۰ tokens) was processed within a single context window using Gemini Pro ۲.۵ (۱M token capacity) via Google AI Studio. All Prompts were formatted using Google's Prompt Engineering book. A single prompt was deployed sequentially: first generating a hierarchical outline, which was reviewed and approved by authors. Subsequently, using the approved structure and the corpus, the LLM drafted the manuscript with deterministic settings (Temperature=۰, Top-P=۰.۹۵) to ensure reproducibility and mitigate hallucination, strictly using only the provided information and generating inline author-date citations. Human oversight occurred at critical checkpoints: outline approval, first-draft appraisal, and targeted gap-filling. Supplementary literature identified during gap-filling underwent the same PDF-to-Markdown processing and was used for LLM-assisted section-level revisions. Final reference management was handled manually using EndNote, followed by scientific proofreading. Results: The pipeline successfully yielded a cohesive, structured, ۴۴۰۰-word draft literature review manuscript containing ۳۸ correctly formatted inline citations derived solely from the provided corpus. This was achieved in less than two working days with no direct API cost. Conclusion: This proof-of-concept establishes a scalable methodology integrating high-fidelity PDF-to-Markdown conversion, large-context LLM synthesis within a single call, deterministic prompting, and staged human oversight. It demonstrates the potential for rapid, reproducible generation of domain-specific literature reviews directly from primary source material, significantly accelerating evidence synthesis in specialized biomedical

نویسندگان

Arash Ziaee

Sinus and Surgical Endoscopic Research Center, Mashhad University of Medical Sciences, Mashhad, Iran

Mehdi Bakhshaee

Sinus and Surgical Endoscopic Research Center, Mashhad University of Medical Sciences, Mashhad, Iran

Mohammad Salehi-Marzijarani

Cancer Epidemiology Research Center and Clinical Trial Center, AJA University of Medical Sciences, Tehran, Iran