NLM DIR Seminar Schedule
UPCOMING SEMINARS
RECENT SEMINARS
-
Dec. 2, 2025 Qingqing Zhu
CT-Bench & CARE-CT: Building Reliable Multimodal AI for Lesion Analysis in Computed Tomography -
Nov. 25, 2025 Jing Wang
MIMIC-EXT-TE: Millions Clinical Temporal Event Time-Series Dataset -
Oct. 21, 2025 Yifan Yang
TBD -
Oct. 14, 2025 Devlina Chakravarty
TBD -
Oct. 9, 2025 Ziynet Nesibe Kesimoglu
TBD
Scheduled Seminars on April 21, 2022
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Long-read sequencing technologies have substantially improved our ability to study large and complex genomes. However, de novo assembly of complex genomic and metagenomic datasets remains difficult. In this talk, I will give an algorithmic overview of the genome assembly problem. I will also highlight our Flye assembler that uses repeat graphs to generate accurate and complete assemblies. Finally, I will also present our new metagenomic assembler metaFlye, which addresses important long-read metagenomic assembly challenges, such as uneven bacterial composition and intra-species heterogeneity. Using metaFlye, we were able to recover complete or nearly-complete bacterial genomes from complex environmental samples, such as human gut or cow rumen. We also showed that long-read assembly of human microbiomes enables the discovery of full-length biosynthetic gene clusters that encode biomedically important natural products.