NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
May 26, 2026 Harutyun Saakyan
Emergence of ribonucleoproteins in molecular evolution simulations -
May 27, 2026 Brian Abraham
Cis-Regulatory Organization and Transcription Factor Control of Cell Identity and Disease -
June 4, 2026 Yin Fang
TBD -
June 9, 2026 Pascal Mutz
TBD -
June 11, 2026 Angela Jiang
TBD
RECENT SEMINARS
-
May 19, 2026 Leann Lindsey
Are Genomic Language Models Learning? Insights from Tokenization Analysis and Prophage Detection in Bacterial Genomes -
May 14, 2026 Brandon Colelough
Biomedical LLM Hallucinations: Detection, Taxonomy, and Mechanistic Knowledge Localization -
May 12, 2026 John Bridgers
A bi-partition function algorithm to evaluate inferred subclonal structures in single-cell sequencing data -
May 5, 2026 Benjamin Hou
Machine Learning for Craniofacial Malocclusion Prediction -
April 28, 2026 Niccolo Marini
From Unimodal Datasets to Multimodal Foundation Models: Synthetic Clinical Notes for Dermatology AI
Scheduled Seminars on Feb. 18, 2025
In-person: Building 38A/B2N14 NCBI Library or Meeting Link
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
The many successes of AlphaFold2 (AF2) have inspired methods to predict multiple protein conformations, many of which have biological significance. These methods assume that AF2 uses coevolutionary information to predict alternative protein conformations, but they perform poorly on fold-switching proteins, which remodel their secondary structures and modulate their functions in response to cellular stimuli. Here, we present a method designed to leverage AF2’s learning of protein structure more than coevolutionary inference. This method–called CF-random–outperforms other methods for predicting alternative conformations of not only fold switchers but also dozens of other proteins that undergo rigid body motions and local conformational rearrangements. CF-random captures multiple conformations more frequently and requires 3-8x less sampling than all other methods. It also enabled predictions of fold-switched assemblies unpredicted by AlphaFold3. Several lines of evidence indicate that CF-random works by sequence association, suggesting that training-set structures and sequences play an important role in which conformations can be predicted readily. This observation inspired a blind prediction mode for alternative protein conformations. We release CF-random for community use, specifying its strengths and limitations.