Path-based reasoning in biomedical knowledge graphs

Publication date: Jun 17, 2024

Understanding complex interactions in biomedical networks is crucial for advancements in biomedicine. Traditional link prediction (LP) methods, using similarity metrics like Personalized PageRank, are limited in capturing the complexity of biological networks. Recently, representation-based learning techniques have emerged, mapping nodes to low-dimensional embeddings to enhance prediction accuracy. However, these methods often face challenges with interpretability and scalability in large, complex networks. Based on a representation of biological systems as knowledge graphs (KGs), which encode entities and their relationships as triplets, we propose here BioKGC, a novel graph neural network framework which builds upon the Neural Bellman-Ford Network (NBFNet). It addresses the limitations of previous methods by utilizing path-based reasoning for LP in biomedical knowledge graphs (KGs). Unlike node-embedding learning frameworks that optimize the embedding space based on single triplets, BioKGC learns representations between nodes by considering all relations along paths. This approach enhances prediction accuracy and interpretability, allowing for the visualization of influential paths and facilitating the validation of biological plausibility. BioKGC leverages a background regulatory graph (BRG) for enhanced message passing and implements a stringent negative sampling strategy to improve learning precision. In evaluations across various LP tasks, gene function annotation, drug-disease interaction prediction, synthetic lethality prediction, and lncRNA-mRNA regulatory relationship inference, BioKGC consistently outperformed state-of-the art methods. BioKGC outperformed knowledge graph embedding and GNN-based methods in gene function prediction, especially with BRG information. We demonstrated that BioKGC effectively predicts drug-disease interactions in zero-shot learning scenarios, surpassing state-of-the-art models like TxGNN. Additionally, BioKGC demonstrated robust performance in synthetic lethality prediction and the capacity for scoring novel lncRNA-mRNA interactions, showcasing its versatility in diverse biomedical applications. One of BioKGC’s key advantages is its interpretability, enabling researchers to trace prediction paths and gain insights into molecular mechanisms. Combined with its use of regulatory information for message passing, BioKGC is a powerful tool for predicting complex biological interactions, making it valuable for drug discovery and personalized medicine.

PDF

Concepts Keywords
Biomedi Based
Dementia Biokgc
June Biological
Outperformed Biorxiv
Planegg Brg
Drug
Embedding
Figure
Graph
Network
Node
Nodes
Path
Paths
Preprint

Semantics

Type Source Name
gene GENE ELOVL6
gene GENE FANCE
gene GENE LARGE1
disease MESH synthetic lethality
gene GENE AGRP
gene GENE ARTN
gene GENE TTC41P
gene GENE SHOX2
drug DRUGBANK Aspartame
gene GENE ZBED1
gene GENE H3-4
gene GENE STATH
gene GENE NDC80
drug DRUGBANK Coenzyme M
gene GENE EPHB1
gene GENE SLC6A2
gene GENE ELK3
gene GENE PHB2
drug DRUGBANK Ademetionine
gene GENE ABCC8
gene GENE FBLIM1
gene GENE GOPC
gene GENE TAMALIN
gene GENE LAT2
drug DRUGBANK Spinosad
disease MESH adverse drug reaction
disease MESH COVID 19
drug DRUGBANK Sulpiride
gene GENE SNW1
gene GENE SPHKAP
gene GENE PLEKHM2
gene GENE INPP5K
disease MESH morbidities
disease MESH drug interactions
gene GENE LPO
gene GENE SYNPR
gene GENE MAPK1IP1L
gene GENE TNFRSF11A
gene GENE NEU1
gene GENE NEURL1
gene GENE ERBB2
gene GENE RALA
gene GENE FEA
gene GENE KCNK3
gene GENE C3orf62
disease MESH contraindication
gene GENE GEN1
gene GENE SLC2A10
gene GENE SNAP25
gene GENE HERPUD1
gene GENE RNF2
gene GENE STIP1
gene GENE HOPX
gene GENE ST13
disease MESH Alzheimer’s disease
gene GENE DEPP1
gene GENE LITAF
gene GENE CD5L
gene GENE CD69
gene GENE DNMT1
gene GENE MAX
gene GENE RNF128
gene GENE POLR3E
gene GENE EFS
gene GENE RELA
gene GENE FAM107B
gene GENE LINC00273
gene GENE THOP1
gene GENE SSRP1
gene GENE CRY1
gene GENE IK
gene GENE DYRK3
drug DRUGBANK Flunarizine
gene GENE PER3
gene GENE BMAL1
gene GENE CLOCK
gene GENE CRY2
gene GENE SET
disease MESH anemia
gene GENE MKS1
gene GENE ME1
gene GENE SAGE1
drug DRUGBANK Isoxaflutole
drug DRUGBANK Piroxicam
disease MESH Acute Lymphoblastic Leukemia
disease MESH syndromes
disease MESH viral infections
disease MESH chromosomal aberrations
drug DRUGBANK L-Tyrosine
gene GENE RN7SL263P
gene GENE BCR
gene GENE ABL1
disease MESH Philadelphia chromosome
drug DRUGBANK Clofarabine
drug DRUGBANK Teniposide
drug DRUGBANK Methotrexate
drug DRUGBANK Dasatinib
drug DRUGBANK Bosutinib
drug DRUGBANK Pentaerythritol tetranitrate
drug DRUGBANK Lauric Acid
gene GENE DSP-AS1
gene GENE MTTP
disease MESH leukemia
gene GENE AICDA
gene GENE LOC124906461
gene GENE DUX4L1
gene GENE DUX4L2
gene GENE DUX4
gene GENE LOC107987485
gene GENE LOC107987487
gene GENE LOC107987484
gene GENE LOC107987486
disease MESH chronic myelogenous leukemia
disease MESH lymphocytic leukemia
gene GENE SMC1A
gene GENE POLA1
gene GENE ERVK-19
gene GENE ERVK-9
gene GENE ERVK-11
drug DRUGBANK Nebularine
gene GENE PICK1
disease MESH gastric cancer
drug DRUGBANK Acitretin
disease MESH skin diseases
drug DRUGBANK Clarithromycin
disease MESH squamous carcinoma
gene GENE RBP1
gene GENE ARID4A
drug DRUGBANK Vitamin A
drug DRUGBANK Tretinoin
disease MESH neurodegenerative disorder
gene GENE MAPT
disease MESH atrophy
disease MESH dementia
disease MESH neurofibrillary tangles
disease MESH Lewy body dementia
drug DRUGBANK Epicriptine
drug DRUGBANK Acetylcarnitine
drug DRUGBANK Memantine
drug DRUGBANK Aducanumab
drug DRUGBANK Nicotine
drug DRUGBANK Choline
drug DRUGBANK Bupropion

Download Document

(Visited 3 times, 1 visits today)

Leave a Comment

Your email address will not be published. Required fields are marked *