NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
June 3, 2025 MG Hirsch
Interactions among subclones and immunity controls melanoma progression -
June 10, 2025 Aleksandra Foerster
TBD -
June 17, 2025 Yoshitaka Inoue
TBD -
June 19, 2025 Ermin Hodzic
TBD -
June 24, 2025 Leslie Ronish
TBD
RECENT SEMINARS
-
May 29, 2025 Harutyun Sahakyan
In silico evolution of globular protein folds from random sequences -
May 20, 2025 Ajith Pankajam
A roadmap from single cell to knowledge graph -
May 2, 2025 Pascal Mutz
Characterization of covalently closed cirular RNAs detected in (meta)transcriptomic data -
May 2, 2025 Dr. Lang Wu
Integration of multi-omics data in epidemiologic research -
April 22, 2025 Stanley Liang, PhD
Large Vision Model for medical knowledge adaptation
Scheduled Seminars on Jan. 16, 2025
In-person: Building 38A/B2N14 NCBI Library or Meeting Link
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
We introduce GPTRadScore, a groundbreaking evaluation framework for assessing multimodal large language models (LLMs) in CT imaging. Using GPT-4, GPTRadScore measures model performance in tasks like lesion localization, body part identification, and lesion typing. It outperforms traditional metrics such as BLEU and ROUGE, aligning closely with expert clinician assessments. Fine-tuning with specialized datasets significantly boosts performance, as demonstrated by RadFM’s notable improvements in accuracy.
To support the development of AI in CT imaging, we also present CT-Bench, a comprehensive dataset containing 20,335 annotated lesions from 7,795 patient studies. Accompanied by high-quality, GPT-4-enhanced textual descriptions and a visual question-answering (VQA) benchmark with 2,850 QA pairs, CT-Bench enables targeted training and evaluation of AI models for lesion description, localization, and diagnostic reasoning.
Together, GPTRadScore and CT-Bench provide powerful tools to advance multimodal AI, setting new standards for evaluation, training, and performance in CT imaging analysis.