NLM DIR Seminar Schedule
UPCOMING SEMINARS
RECENT SEMINARS
-
Dec. 2, 2025 Qingqing Zhu
CT-Bench & CARE-CT: Building Reliable Multimodal AI for Lesion Analysis in Computed Tomography -
Nov. 25, 2025 Jing Wang
MIMIC-EXT-TE: Millions Clinical Temporal Event Time-Series Dataset -
Oct. 21, 2025 Yifan Yang
TBD -
Oct. 14, 2025 Devlina Chakravarty
TBD -
Oct. 9, 2025 Ziynet Nesibe Kesimoglu
TBD
Scheduled Seminars on Feb. 8, 2022
Contact NLMDIRSeminarScheduling@mail.nih.gov with questions about this seminar.
Abstract:
Previous studies on biomedical relation extraction (RE) typically focus on extracting binary relations between two entities from a single sentence. However, complex inter-sentence relations involving multiple entity pairs, such as drug-protein and protein-disease, are commonly seen in the biomedical literature. In this talk, I will first introduce the characteristics of sentence-level RE and use the BioCreative VII DrugProt task to showcase a general text classification framework for sentence-level RE. The second part will introduce a new document-level dataset called BioRED, which covers six concept types (cell line, chemical, disease, gene, species, and variant) and eight relation pairs (e.g., chemical-disease, chemical-gene, chemical-chemical) in 600 MEDLINE abstracts. In total, BioRED consists of 20,000 entity and 6,000 relation annotations. The BioRED dataset is currently being used for developing and evaluating state-of-the-art relation extraction methods at the LitCoin natural language processing (NLP) challenge.