NLM DIR Seminar Schedule

UPCOMING SEMINARS

RECENT SEMINARS


The NLM DIR holds a public weekly seminar series for NLM trainees, staff scientists, and investigators to share details on current and exciting research projects at NLM. Seminars take place on Tuesdays at 11:00 AM, EST and some Thursdays at 3:00 PM, EST. Seminars are held in the B2 Library of Building 38A on the main NIH campus in Bethesda, MD. Due to the Covid-19 pandemic, all seminars are currently held virtually.

To schedule a seminar, click the “Schedule Seminar” button to the right, select an appropriate date on the calendar to sign up, and then complete the form. You will need an NIH PIV card to access the “Schedule Seminar” page.

Please include seminars by invited visiting scientists in the NLM DIR seminar series. These need not be on a Tuesday or Thursday.

If you would like to schedule a seminar by a visiting scientist, click the “Schedule Seminar” and complete the form. Contact NLMDIRSeminarScheduling@mail.nih.gov with questions. Please follow this link to subscribe/unsubscribe to/from the NLM DIR seminar mailing list.

Titles and Abstracts for Upcoming Seminars


(based on the current date)

Samuel Lee
Feb. 18, 2025 at 11 a.m.

Efficient predictions of alternative protein conformations by AlphaFold2-based sequence association

The many successes of AlphaFold2 (AF2) have inspired methods to predict multiple protein conformations, many of which have biological significance. These methods assume that AF2 uses coevolutionary information to predict alternative protein conformations, but they perform poorly on fold-switching proteins, which remodel their secondary structures and modulate their functions in response to cellular stimuli. Here, we present a method designed to leverage AF2’s learning of protein structure more than coevolutionary inference. This method–called CF-random–outperforms other methods for predicting alternative conformations of not only fold switchers but also dozens of other proteins that undergo rigid body motions and local conformational rearrangements. CF-random captures multiple conformations more frequently and requires 3-8x less sampling than all other methods. It also enabled predictions of fold-switched assemblies unpredicted by AlphaFold3. Several lines of evidence indicate that CF-random works by sequence association, suggesting that training-set structures and sequences play an important role in which conformations can be predicted readily. This observation inspired a blind prediction mode for alternative protein conformations. We release CF-random for community use, specifying its strengths and limitations.

Zhizheng Wang
Feb. 25, 2025 at 11 a.m.

GeneAgent: Self-verification Language Agent for Gene Set Analysis using Domain Databases

Gene set analysis allows researchers to explore groups of genes that likely act together in specific biological processes or molecular functions. Recent work in gene set analysis has shown promising performance utilizing large language models (LLMs). Nonetheless, their results are subject to limitations common in LLMs, such as hallucinations. In response, we develop GeneAgent, the first language agent for gene set analysis that self-verifies by autonomously interacting with biological databases, reducing hallucinations and enhancing accuracy. GeneAgent generates novel function names or aligns with notable enriched terms for input gene sets. Benchmarking on 1,106 gene sets from different sources, GeneAgent consistently outperforms vanilla GPT-4 by a significant margin. A detailed manual review confirms the effectiveness of the self-verification module in minimizing hallucinations and generating a more reliable explanatory analysis. We also apply GeneAgent to seven novel gene sets derived from mouse B2905 melanoma cell lines, with expert evaluations showing that GeneAgent offers novel insights into gene functions and expediting knowledge discovery.

Sofya Garushyants
March 4, 2025 at 11 a.m.

TBD