NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
Sept. 9, 2025 Chih-Hsuan Wei
No Data Left Behind: FAIR-SMart Enables FAIR Access to Supplementary Materials for Research Transparency -
Sept. 16, 2025 James Leaman JR.
TBD -
Sept. 23, 2025 Martha Nelson
TBD -
Sept. 30, 2025 Erez Persi
TBD -
Oct. 7, 2025 Liana Yeganova
TBD
RECENT SEMINARS
-
July 15, 2025 Noam Rotenberg
Cell phenotypes in the biomedical literature: a systematic analysis and the NLM CellLink text mining corpus -
July 3, 2025 Matthew Diller
Using Ontologies to Make Knowledge Computable -
July 1, 2025 Yoshitaka Inoue
Graph-Aware Interpretable Drug Response Prediction and LLM-Driven Multi-Agent Drug-Target Interaction Prediction -
June 10, 2025 Aleksandra Foerster
Interactions at pre-bonding distances and bond formation for open p-shell atoms: a step toward biomolecular interaction modeling using electrostatics -
June 3, 2025 MG Hirsch
Interactions among subclones and immunity controls melanoma progression
Scheduled Seminars on March 28, 2023
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Viruses with large double-stranded DNA genomes appear to have captured the majority of their genes from the hosts at different stages of evolution. The origin of many virus genes is readily detected through highly significant sequence similarity with cellular homologs. This is the case, in particular, for virus enzymes, such as DNA and RNA polymerases or nucleotide kinases, that retain their catalytic activity after capture by an ancestral virus. However, a large fraction of virus genes have no readily detectable cellular homologs so that their origin remains enigmatic. We sought to explore potential origins of proteins of unknown provenance encoded in the genomes of orthopoxviruses, a thoroughly studied virus genus which includes major human pathogens. To this end, we used AlphaFold2, to predict the structures of all 214 proteins encoded by orthopoxviruses. Among the proteins of unknown provenance, structure prediction yielded a clear indication of origin for 14, along with validating several inferences previously made by sequence analysis. A notable trend that emerges from these findings is the exaptation of enzymes from cellular organisms for non-enzymatic, structural roles in virus reproduction which is accompanied by disruption of catalytic sites and overall drastic divergence which precludes detection of homology at the sequence level. Among the 16 orthopoxvirus proteins found to be inactivated enzyme derivatives, are the poxvirus replication processivity factor A20, an inactivated derivative of NAD-dependent DNA ligase; major core protein A3, an inactivated deubiquitinase; F11, an inactivated prolyl hydroxylase; and more similar cases. However, for nearly one third of the orthopoxvirus virion proteins, no significantly similar structures were identified, suggesting exaptation with subsequent major structural rearrangement, yielding unique protein folds.