NLM DIR Seminar Schedule
UPCOMING SEMINARS
-
May 12, 2026 John Bridgers
A bi-partition function algorithm to evaluate inferred subclonal structures in single-cell sequencing data -
May 14, 2026 Brandon Colelough
TBD -
May 19, 2026 Leann Lindsey
Are Genomic Language Models Learning? Insights from Tokenization Analysis and Prophage Detection in Bacterial Genomes -
May 26, 2026 Harutyun Saakyan
TBD -
May 27, 2026 Brian Abraham
Cis-Regulatory Organization and Transcription Factor Control of Cell Identity and Disease
RECENT SEMINARS
-
May 5, 2026 Benjamin Hou
Machine Learning for Craniofacial Malocclusion Prediction -
April 28, 2026 Niccolo Marini
From Unimodal Datasets to Multimodal Foundation Models: Synthetic Clinical Notes for Dermatology AI -
April 21, 2026 Yoshitaka Inoue
Drug Response Prediction: Generalization using Graph Neural Networks & Reasoning over Predictions using LLMs -
April 16, 2026 Matthew Diller
Analyzing Similarity in Common Data Elements in the NIH CDE Repository via Semantic Clustering -
April 7, 2026 Henry Secaira Morocho
Toward a systematic method of database enrichment for reference-based metagenomics
Scheduled Seminars on March 5, 2026
In-person: Building 38A/B2N14 NCBI Library or Meeting Link
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
The Systems Biology Graphical Notation (SBGN) provides standardized visual languages for representing complex biological processes, facilitating communication, reproducibility, and model sharing in systems biology. However, generating high-quality SBGN maps from scratch can be challenging, particularly for new users, due to the steep learning curve of advanced editors and the difficulty of translating informal ideas into structured diagrams. To address these challenges, we present a workflow that streamlines SBGN map creation and refinement through three key steps, supporting both Process Description (PD) and Activity Flow (AF) languages. The first step enables automatic conversion of hand-drawn SBGN sketches into SBGN-ML format using large language models with in-context learning. Quick correction of small recognition errors is supported through an interactive interface, while biological identifiers are mapped automatically using an external library. Second, the workflow supports flexible merging and splitting of maps. Digitized maps can be merged with existing ones to create larger networks in incremental steps by identifying nodes and edges with common attributes, or reorganized into smaller components as needed, while respecting existing layouts in both cases. Finally, we introduce layout refinement methods. A user-guided layout algorithm allows sketch-based hints to influence the arrangement of the entire network or selected subgraphs, while a polishing step improves readability by aligning edges orthogonally or diagonally and organizing nodes by functional role (input, output, modifier). Together, these features provide an end-to-end solution for transforming informal sketches into structured, publication-ready SBGN maps, lowering the entry barrier for new users while offering flexible control for experts.