NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
Feb. 18, 2025 Samuel Lee
Efficient predictions of alternative protein conformations by AlphaFold2-based sequence association -
Feb. 25, 2025 Zhizheng Wang
GeneAgent: Self-verification Language Agent for Gene Set Analysis using Domain Databases -
March 4, 2025 Sofya Garushyants
TBD -
March 11, 2025 Sanasar Babajanyan
TBD -
March 18, 2025 MG Hirsch
TBD
RECENT SEMINARS
-
Feb. 11, 2025 Po-Ting Lai
Enhancing Biomedical Relation Extraction with Directionality -
Feb. 4, 2025 Victor Tobiasson
On the dominance of Asgard contributions to Eukaryogenesis -
Jan. 28, 2025 Kaleb Abram
Leveraging metagenomics to investigate the co-occurrence of virome and defensome elements at large scale -
Jan. 21, 2025 Qiao Jin
Artificial Intelligence for Evidence-based Medicine -
Jan. 17, 2025 Xuegong Zhang
Using Large Cellular Models to Understand Cell Transcriptomics Language
The NLM DIR holds a public weekly seminar series for NLM trainees, staff scientists, and investigators to share details on current and exciting research projects at NLM. Seminars take place on Tuesdays at 11:00 AM, EST and some Thursdays at 3:00 PM, EST. Seminars are held in the B2 Library of Building 38A on the main NIH campus in Bethesda, MD. Due to the Covid-19 pandemic, all seminars are currently held virtually.
To schedule a seminar, click the “Schedule Seminar” button to the right, select an appropriate date on the calendar to sign up, and then complete the form. You will need an NIH PIV card to access the “Schedule Seminar” page.
Please include seminars by invited visiting scientists in the NLM DIR seminar series. These need not be on a Tuesday or Thursday.
If you would like to schedule a seminar by a visiting scientist, click the “Schedule Seminar” and complete the form. Contact NLMDIRSeminarScheduling@mail.nih.gov with questions. Please follow this link to subscribe/unsubscribe to/from the NLM DIR seminar mailing list.
Titles and Abstracts for Upcoming Seminars
(based on the current date)
Efficient predictions of alternative protein conformations by AlphaFold2-based sequence association
The many successes of AlphaFold2 (AF2) have inspired methods to predict multiple protein conformations, many of which have biological significance. These methods assume that AF2 uses coevolutionary information to predict alternative protein conformations, but they perform poorly on fold-switching proteins, which remodel their secondary structures and modulate their functions in response to cellular stimuli. Here, we present a method designed to leverage AF2’s learning of protein structure more than coevolutionary inference. This method–called CF-random–outperforms other methods for predicting alternative conformations of not only fold switchers but also dozens of other proteins that undergo rigid body motions and local conformational rearrangements. CF-random captures multiple conformations more frequently and requires 3-8x less sampling than all other methods. It also enabled predictions of fold-switched assemblies unpredicted by AlphaFold3. Several lines of evidence indicate that CF-random works by sequence association, suggesting that training-set structures and sequences play an important role in which conformations can be predicted readily. This observation inspired a blind prediction mode for alternative protein conformations. We release CF-random for community use, specifying its strengths and limitations.
GeneAgent: Self-verification Language Agent for Gene Set Analysis using Domain Databases
Gene set analysis allows researchers to explore groups of genes that likely act together in specific biological processes or molecular functions. Recent work in gene set analysis has shown promising performance utilizing large language models (LLMs). Nonetheless, their results are subject to limitations common in LLMs, such as hallucinations. In response, we develop GeneAgent, the first language agent for gene set analysis that self-verifies by autonomously interacting with biological databases, reducing hallucinations and enhancing accuracy. GeneAgent generates novel function names or aligns with notable enriched terms for input gene sets. Benchmarking on 1,106 gene sets from different sources, GeneAgent consistently outperforms vanilla GPT-4 by a significant margin. A detailed manual review confirms the effectiveness of the self-verification module in minimizing hallucinations and generating a more reliable explanatory analysis. We also apply GeneAgent to seven novel gene sets derived from mouse B2905 melanoma cell lines, with expert evaluations showing that GeneAgent offers novel insights into gene functions and expediting knowledge discovery.
TBD