SemRep

Knowledge Graph Exploration with NLM semrep

Title: A study of Drug to cardiovascular disease (CVD) Association with SemRep and Deep learning

</img>)

           Image Courtesy: WIKIPEDIA

Description: Starting with well defined oxidative stress categories (e.g., Initiation, Regulation and Outcome of Oxidative Stress) and a list of drugs in cardiovascular disease (CVD), we will explore semrep to extract all relevant SPO- triplet. We further build knowledge graphs with these triplets and prepare a muli-order association matrix to represent graph data structure. Using this graph structure, we will build a sequence prediction model for drug to CVD association. This project will provide a detailed analysis of drugs to CVD association with both qualitative evidence and quantitative scores.

Leaders/Instructors: Dr. Dibakar Sigdel & Dr. David Liem (Mr. Vincent Kyi for technical support)

Participants

Note: Participants with * sign are also involved in other projects in Project 1 (A,B and C)

Project walk-through

Get familiar with NLM SemRep for Biomedical Documents (https://semrep.nlm.nih.gov/). Learn to extract Drug information from DrugBank API(https://www.drugbank.ca/) and learn a curated list of oxidative stress categories and associated molecules. Extract knowledge graph triplets (SPO) for drug and CVD association from SemRep tool and create a graph data structure (Association Matrix). Build RNN (LSTM) model to predict/classify/partition the drug/molecule for the category of oxidative stress with associated information and analysis. Organize codes and prepare project documentation sites at PingLab Intern GitHub account. Final presentation at lab meeting.

Education goals: The students will learn how to work with innovative text mining tools (e.g., semrap, caseOLAP, Neo4J) for biomedical documents and machine learning approach (RNN, LSTM) for model development and implementation to answer important biomedical questions.

Scientific goals: The students will explore knowledge graphs for drug and CVD associations with a focus on oxidative stress categories (e.g., Initiation, Regulation and Outcome) and underlying molecular mechanism.

Preparing Foundation

Project Detail

References

  1. Github for SemRep (https://github.com/CaseOLAP/SemRep)
  2. SemRep Paper (https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8574025)
  3. SemRep NLM (https://semrep.nlm.nih.gov/)
  4. Oxidative Stress (https://en.wikipedia.org/wiki/Oxidative_stress)