NLM DIR Seminar Schedule
UPCOMING SEMINARS
RECENT SEMINARS
-
Dec. 17, 2024 Joey Thole
Training set associations drive AlphaFold initial predictions of fold-switching proteins -
Dec. 10, 2024 Amr Elsawy
AI for Age-Related Macular Degeneration on Optical Coherence Tomography -
Dec. 3, 2024 Sarvesh Soni
Toward Relieving Clinician Burden by Automatically Generating Progress Notes -
Nov. 19, 2024 Benjamin Lee
Reiterative Translation in Stop-Free Circular RNAs -
Nov. 12, 2024 Devlina Chakravarty
Fold-switching reveals blind spots in AlphaFold predictions
Scheduled Seminars on Jan. 20, 2022
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Since a genome is essentially a document written in the alphabet of nucleotides, the field of Computational Biology has been informed by Natural Language Processing techniques since its inception. In this talk I will describe how "MinHash", a relatively obscure algorithm developed for searching the web, has been transformative for the task of genomic similarity estimation. I will go into how and why the algorithm works for sequences of nucleotides and amino acids rather than natural language documents, and I will discuss the creation and validation of tools employing the algorithm, variations for different kinds of searches, and the range of applications it can help with.