NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
July 3, 2025 Matthew Diller
Using Ontologies to Make Knowledge Computable -
July 15, 2025 Noam Rotenberg
Cell phenotypes in the biomedical literature: a systematic analysis and the NLM CellLink text mining corpus
RECENT SEMINARS
-
July 1, 2025 Yoshitaka Inoue
Graph-Aware Interpretable Drug Response Prediction and LLM-Driven Multi-Agent Drug-Target Interaction Prediction -
June 10, 2025 Aleksandra Foerster
Interactions at pre-bonding distances and bond formation for open p-shell atoms: a step toward biomolecular interaction modeling using electrostatics -
June 3, 2025 MG Hirsch
Interactions among subclones and immunity controls melanoma progression -
May 29, 2025 Harutyun Sahakyan
In silico evolution of globular protein folds from random sequences -
May 20, 2025 Ajith Pankajam
A roadmap from single cell to knowledge graph
Scheduled Seminars on June 22, 2023
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Gene regulation in eukaryotes mainly involves transcription factors (TFs). These proteins bind to regulatory DNA elements such as enhancers and determine the amount and timing of target gene expression. Mutations at TF binding sites (TFBSs) are associated with complex human diseases and traits. Consequently, accurately identifying TFBSs is crucial to pinpoint causal variants.
Computational state-of-the-art algorithms typically use position weight matrices (PWMs) to identify TFBSs in the human genome; however, these algorithms produce too many false positives. Here, we use TREDNet—a deep learning model developed in our research group—to identify TFBSs in HepG2 cell line enhancers accurately. We identify TFBSs at enhancer regions that would damage the enhancer upon mutation, called positive active regions (PARs), and that would strengthen the enhancer upon mutation, called negative active regions (NARs). We found that the NARs are more GC enriched than the PARs. Clustering analysis of the TFBSs at PARs revealed ~10 groups of binding sites. In addition, analysis of TF pair co-occurrence revealed that the forkhead box (FOX) family of TFs is prevalent in PAR regions.