NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
Jan. 14, 2025 Ryan Bell
Comprehensive analysis of the YprA-like helicase family provides deep insight into the evolution and potential mechanisms of widespread and largely uncharacterized prokaryotic antiviral defense systems -
Jan. 16, 2025 Qingqing Zhu
GPTRadScore and CT-Bench: Advancing Multimodal AI Evaluation and Benchmarking in CT Imaging -
Jan. 17, 2025 Xuegong Zhang
Using Large Cellular Models to Understand Cell Transcriptomics Language -
Jan. 21, 2025 Qiao Jin
Artificial Intelligence for Evidence-based Medicine -
Jan. 28, 2025 Kaleb Abram
TBD
RECENT SEMINARS
-
Jan. 14, 2025 Ryan Bell
Comprehensive analysis of the YprA-like helicase family provides deep insight into the evolution and potential mechanisms of widespread and largely uncharacterized prokaryotic antiviral defense systems -
Dec. 17, 2024 Joey Thole
Training set associations drive AlphaFold initial predictions of fold-switching proteins -
Dec. 10, 2024 Amr Elsawy
AI for Age-Related Macular Degeneration on Optical Coherence Tomography -
Dec. 3, 2024 Sarvesh Soni
Toward Relieving Clinician Burden by Automatically Generating Progress Notes -
Nov. 19, 2024 Benjamin Lee
Reiterative Translation in Stop-Free Circular RNAs
Scheduled Seminars on June 22, 2023
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Gene regulation in eukaryotes mainly involves transcription factors (TFs). These proteins bind to regulatory DNA elements such as enhancers and determine the amount and timing of target gene expression. Mutations at TF binding sites (TFBSs) are associated with complex human diseases and traits. Consequently, accurately identifying TFBSs is crucial to pinpoint causal variants.
Computational state-of-the-art algorithms typically use position weight matrices (PWMs) to identify TFBSs in the human genome; however, these algorithms produce too many false positives. Here, we use TREDNet—a deep learning model developed in our research group—to identify TFBSs in HepG2 cell line enhancers accurately. We identify TFBSs at enhancer regions that would damage the enhancer upon mutation, called positive active regions (PARs), and that would strengthen the enhancer upon mutation, called negative active regions (NARs). We found that the NARs are more GC enriched than the PARs. Clustering analysis of the TFBSs at PARs revealed ~10 groups of binding sites. In addition, analysis of TF pair co-occurrence revealed that the forkhead box (FOX) family of TFs is prevalent in PAR regions.