NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
April 8, 2025 Jaya Srivastava
Leveraging a deep learning model to assess the impact of regulatory variants on traits and diseases -
April 15, 2025 Pascal Mutz
TBD -
April 18, 2025 Valentina Boeva, Department of Computer Science, ETH Zurich
Decoding tumor heterogeneity: computational methods for scRNA-seq and spatial omics -
April 22, 2025 Stanley Liang
TBD -
April 29, 2025 MG Hirsch
TBD
RECENT SEMINARS
-
April 1, 2025 Roman Kogay
Horizontal transfer of bacterial operons into eukaryote genomes -
March 25, 2025 Yifan Yang
Adversarial Manipulation and Data Memorization in Large Language Models for Medicine -
March 11, 2025 Sofya Garushyants
Tmn – bacterial anti-phage defense system -
March 4, 2025 Sanasar Babajanyan
Evolution of antivirus defense in prokaryotes depending on the environmental virus load -
Feb. 25, 2025 Zhizheng Wang
GeneAgent: Self-verification Language Agent for Gene Set Analysis using Domain Databases
Scheduled Seminars on May 9, 2024
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Metatranscriptomics as well as targeted approaches uncover more and more diverse RNA virus families, yet more to be expected. Thorough protein annotation and comparison is essential to get insights into function and evolution of the viruses and their proteins. In addition to sequence and protein profile based methods, protein structure comparison adds a powerful tool to uncover protein function and relationships. In this study, we used protein structure modeling and subsequent structure comparison searches to illuminate the remaining ‘dark matter’ in hundreds of thousands of previously discovered RNA viruses. Only a few domains and small proteins within this ‘dark matter’ could be confidently assigned a distinct fold and function. The vast majority of the domains showed either ‘generic’ folds (e.g. single alpha-helices) or no high confidence structure prediction. Thus, it appears that notwithstanding the continuing discovery of new RNA viruses by metatranscriptomics, all the protein domains shared by large groups of these viruses have already been identified. The rest of the viral proteome appears to consist of poorly structured domains including intrinsically disordered ones that likely mediate interactions between viral and host proteins. In the course of this work, a Riboviria ‘structurome’ was compiled from already annotated and initially non-annotated (‘dark matter’) proteins and domains encoded in viral genomes. Comparing structures within this ’structurome’ helps to understand protein relationship across virus families.