NLM DIR Seminar Schedule
UPCOMING SEMINARS
RECENT SEMINARS
-
April 7, 2026 Henry Secaira Morocho
Toward a systematic method of database enrichment for reference-based metagenomics -
March 17, 2026 Roman Kogay
Diversification vs Streamlining: Selection Landscapes of Prokaryotic Genome Evolution -
March 10, 2026 Zhizheng Wang
Large Language Models for Gene Set Analysis -
March 5, 2026 Hasan Balci
From Sketch to SBGN: An AI-Assisted and Interactive Workflow for Generating Pathway Maps -
March 3, 2026 Gianlucca Goncalves Nicastro
Systematic identification of Salmonella T6SS effectors uncovers a lipid-targeting family.
Scheduled Seminars on Nov. 25, 2025
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Early detection and timely treatment are critical in medicine. For example, surgical excision of skin lesions can cure early-stage skin cancer, but once metastasis occurs, even the most advanced therapies often fail. In this work, we introduce MIMIC-EXT-TE, a large-scale dataset provides a structured timeline of over a million clinical events from MIMIC-IV-Note. It is the first dataset with temporal information of events in patient level. To achieve the dataset, we propose to integrate retrieval-augmented generation with large language models to capture the temporal trajectories of patient events. To evaluate the dataset, we introduce TimeLife, a temporal-aware medical question answering system by fine-tuning the Qwen3-4B-Base language model on our dataset. TimeLife achieves an 18% overall accuracy boost on MedMCQA dataset compared with the base model. By fine-tuning TimeLife with downstream tasks such as PubMedQA and MedMCQA, TimeLife achieves the superiority most of the time compared with fine-tuning only on the base model without our dataset.