NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
April 8, 2025 Jaya Srivastava
Leveraging a deep learning model to assess the impact of regulatory variants on traits and diseases -
April 15, 2025 Pascal Mutz
TBD -
April 18, 2025 Valentina Boeva, Department of Computer Science, ETH Zurich
Decoding tumor heterogeneity: computational methods for scRNA-seq and spatial omics -
April 22, 2025 Stanley Liang
TBD -
April 29, 2025 MG Hirsch
TBD
RECENT SEMINARS
-
April 1, 2025 Roman Kogay
Horizontal transfer of bacterial operons into eukaryote genomes -
March 25, 2025 Yifan Yang
Adversarial Manipulation and Data Memorization in Large Language Models for Medicine -
March 11, 2025 Sofya Garushyants
Tmn – bacterial anti-phage defense system -
March 4, 2025 Sanasar Babajanyan
Evolution of antivirus defense in prokaryotes depending on the environmental virus load -
Feb. 25, 2025 Zhizheng Wang
GeneAgent: Self-verification Language Agent for Gene Set Analysis using Domain Databases
Scheduled Seminars on March 30, 2023
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Medical imaging AI has demonstrated its potential to deliver efficiencies and improvements to healthcare through many studies in the literature and a growing number of applications seeking FDA approval. However, significant hurdles and challenges remain for effective, trustworthy, and general real-world implementation and adoption. In this talk, I will present several of our recent works for addressing some of the challenges hindering wide use. I will focus on quality control and data preprocessing which are key early components in the medical imaging AI pipeline and play a vital role in the overall AI performance. The research was mostly motivated by issues that we observed in some datasets for automating the visual evaluation in the screening of cervical cancer, oral cancer, and pulmonary diseases. Specifically, I will introduce our research in identifying low-quality relating to images (such as blur, noise), existence of unrelated data (such as non-medical, atypical images), and mislabeled images. I will also briefly describe methods to extract information (such as anatomical site, ruler) from images that would be useful in the subsequent pipeline tasks of disease detection and classification. In addition, I will touch on issues such as cross-dataset variance, data imbalance, domain shift, and catastrophic forgetting that are commonly encountered when applying AI to medical datasets.