NLM DIR Seminar Schedule

Seminars Home

Schedule Seminar

UPCOMING SEMINARS

Sept. 9, 2025 Chih-Hsuan Wei
No Data Left Behind: FAIR-SMart Enables FAIR Access to Supplementary Materials for Research Transparency
Sept. 16, 2025 James Leaman JR.
TBD
Sept. 23, 2025 Martha Nelson
TBD
Sept. 30, 2025 Erez Persi
TBD
Oct. 7, 2025 Liana Yeganova
TBD

RECENT SEMINARS

July 15, 2025 Noam Rotenberg
Cell phenotypes in the biomedical literature: a systematic analysis and the NLM CellLink text mining corpus
July 3, 2025 Matthew Diller
Using Ontologies to Make Knowledge Computable
July 1, 2025 Yoshitaka Inoue
Graph-Aware Interpretable Drug Response Prediction and LLM-Driven Multi-Agent Drug-Target Interaction Prediction
June 10, 2025 Aleksandra Foerster
Interactions at pre-bonding distances and bond formation for open p-shell atoms: a step toward biomolecular interaction modeling using electrostatics
June 3, 2025 MG Hirsch
Interactions among subclones and immunity controls melanoma progression

Scheduled Seminars on Feb. 8, 2022

Speaker

Po-Ting Lai

PI/Lab

Time

11 a.m.

Presentation Title

Moving from Sentence-level to Document-level Relation Extraction

Location

Building 38A - B2 NCBI Library

Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.

Abstract:

Previous studies on biomedical relation extraction (RE) typically focus on extracting binary relations between two entities from a single sentence. However, complex inter-sentence relations involving multiple entity pairs, such as drug-protein and protein-disease, are commonly seen in the biomedical literature. In this talk, I will first introduce the characteristics of sentence-level RE and use the BioCreative VII DrugProt task to showcase a general text classification framework for sentence-level RE. The second part will introduce a new document-level dataset called BioRED, which covers six concept types (cell line, chemical, disease, gene, species, and variant) and eight relation pairs (e.g., chemical-disease, chemical-gene, chemical-chemical) in 600 MEDLINE abstracts. In total, BioRED consists of 20,000 entity and 6,000 relation annotations. The BioRED dataset is currently being used for developing and evaluating state-of-the-art relation extraction methods at the LitCoin natural language processing (NLP) challenge.

NLM DIR Seminar Schedule

UPCOMING SEMINARS

RECENT SEMINARS

Scheduled Seminars on Feb. 8, 2022

Abstract:

ARCHIVES