Publication date: Oct 11, 2024
Target discovery is a crucial step in drug development, as it directly affects the success rate of clinical trials. Knowledge graphs (KGs) offer unique advantages in processing complex biological data and inferring new relationships. Existing biomedical KGs primarily focus on tasks such as drug repositioning and drug-target interactions, leaving a gap in the construction of KGs tailored for target discovery. We established a comprehensive biomedical KG focusing on target discovery, termed TarKG, by integrating seven existing biomedical KGs, nine public databases, and traditional Chinese medicine knowledge databases. TarKG consists of 1,143,313 entities and 32,806,467 relations across 15 entity categories and 171 relation types, all centered around three core entity types: Disease, Gene, Compound. TarKG provides specialized knowledges for the core entities including chemical structures, protein sequences or text descriptions. By using different KG embedding algorithms, we assessed the knowledge completion capabilities of TarKG, particularly for disease-target link prediction. In case studies, we further examined TarKG’s ability to predict potential protein targets for Alzheimer’s disease (AD) and to identify diseases potentially associated with the metallo-deubiquitinase CSN5, using literature analysis for validation. Furthermore, we provided a user-friendly web server (https://tarkg. ddtmlab. org) that enables users to perform knowledge retrieval and relation inference using TarKG. TarKG is accessible at https://tarkg. ddtmlab. org. Supplementary data are available at Bioinformatics online.
Open Access PDF
Concepts | Keywords |
---|---|
Alzheimer | Biomedical |
Biomedical | Comprehensive |
Chinese | Core |
Online | Databases |
Discovery | |
Drug | |
Entities | |
Entity | |
Existing | |
Graphs | |
Kgs | |
Relation | |
Target | |
Tarkg | |
Types |
Semantics
Type | Source | Name |
---|---|---|
drug | DRUGBANK | Spinosad |
disease | MESH | Alzheimer’s disease |
drug | DRUGBANK | Guanosine |
drug | DRUGBANK | Coenzyme M |
disease | MESH | Syndrome |
drug | DRUGBANK | Alpha-1-proteinase inhibitor |
disease | MESH | death |
disease | MESH | neurodegenerative diseases |
drug | DRUGBANK | Xanthine |
drug | DRUGBANK | Oxygen |
disease | MESH | myelogenous leukemia |
drug | DRUGBANK | Pentaerythritol tetranitrate |
disease | MESH | Precursor T Cell Lymphoblastic Leukemia Lymphoma |
disease | MESH | cognitive impairment |
drug | DRUGBANK | Pioglitazone |
disease | MESH | lung cancer |
drug | DRUGBANK | Troleandomycin |
disease | MESH | Malignancies |
disease | MESH | Covid 19 |
disease | MESH | Parkinson’s disease |
drug | DRUGBANK | Pilocarpine |
disease | MESH | Seizures |
drug | DRUGBANK | Carboxyamidotriazole |
disease | MESH | Morbidities |