NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
April 8, 2025 Jaya Srivastava
Leveraging a deep learning model to assess the impact of regulatory variants on traits and diseases -
April 15, 2025 Pascal Mutz
TBD -
April 18, 2025 Valentina Boeva, Department of Computer Science, ETH Zurich
Decoding tumor heterogeneity: computational methods for scRNA-seq and spatial omics -
April 22, 2025 Stanley Liang
TBD -
April 29, 2025 MG Hirsch
TBD
RECENT SEMINARS
-
April 1, 2025 Roman Kogay
Horizontal transfer of bacterial operons into eukaryote genomes -
March 25, 2025 Yifan Yang
Adversarial Manipulation and Data Memorization in Large Language Models for Medicine -
March 11, 2025 Sofya Garushyants
Tmn – bacterial anti-phage defense system -
March 4, 2025 Sanasar Babajanyan
Evolution of antivirus defense in prokaryotes depending on the environmental virus load -
Feb. 25, 2025 Zhizheng Wang
GeneAgent: Self-verification Language Agent for Gene Set Analysis using Domain Databases
Scheduled Seminars on Jan. 16, 2025
In-person: Building 38A/B2N14 NCBI Library or Meeting Link
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
We introduce GPTRadScore, a groundbreaking evaluation framework for assessing multimodal large language models (LLMs) in CT imaging. Using GPT-4, GPTRadScore measures model performance in tasks like lesion localization, body part identification, and lesion typing. It outperforms traditional metrics such as BLEU and ROUGE, aligning closely with expert clinician assessments. Fine-tuning with specialized datasets significantly boosts performance, as demonstrated by RadFM’s notable improvements in accuracy.
To support the development of AI in CT imaging, we also present CT-Bench, a comprehensive dataset containing 20,335 annotated lesions from 7,795 patient studies. Accompanied by high-quality, GPT-4-enhanced textual descriptions and a visual question-answering (VQA) benchmark with 2,850 QA pairs, CT-Bench enables targeted training and evaluation of AI models for lesion description, localization, and diagnostic reasoning.
Together, GPTRadScore and CT-Bench provide powerful tools to advance multimodal AI, setting new standards for evaluation, training, and performance in CT imaging analysis.