Skip to content

aabbccgithub/IGT-Intermolecular-Graph-Transformer

 
 

Repository files navigation

IGT: Improved Drug-target Interaction Prediction with Intermolecular Graph Transformer

Authors: Siyuan Liu*, Yusong Wang*, Yifan Deng, Liang He, Bin Shao, Jian Yin, Nanning Zheng, Tie-Yan Liu, Tong Wang# (*equal contribution, #corresponding author)

Article: https://academic.oup.com/bib/advance-article-abstract/doi/10.1093/bib/bbac162/6581433

Introduction

The identification of active binding drugs for target proteins (referred to as drug-target interaction prediction) is the key challenge in virtual screening, which plays an essential role in drug discovery. Although recent deep learningbased approaches achieve better performance than molecular docking, existing models often neglect topological or spatial of intermolecular information, hindering prediction performance. We recognize this problem and propose a novel approach called the Intermolecular Graph Transformer (IGT) that employs a dedicated attention mechanism to model intermolecular information with a three-way Transformer-based architecture. IGT outperforms state-of-the-art (SoTA) approaches by 9.1% and 20.5% over the second best option for binding activity and binding pose prediction respectively, and exhibits superior generalization ability to unseen receptor proteins than SoTA approaches. Furthermore, IGT exhibits promising drug screening ability against SARS-CoV-2 by identifying 83.1% active drugs that have been validated by wetlab experiments with near-native predicted binding poses.

image

Prerequisites

Please first download the data and model checkpoint using the prerequisites.sh.

bash prerequisites.sh

The following Python packages are needed for running the code:

numpy
scipy
pandas
scikit-learn
torch
rdkit
oddt
hydra-core

Run evaluation

Below is an example command line for evaluting the model on the test set of LIT-PCBA (assuming two GPUs):

cd code
python -m torch.distributed.launch \
    --nnode=1 --node_rank=0 --nproc_per_node=2 --use_env evaluate.py \
    hydra.run.dir=$PWD \
    local_world_size=2 \
    data_fpath=../data/lit-pcba dataset=lit-pcba batch_size=1 \
    +model=igt \
    load_from=../ckpt/igt.pt output_path=lit-pcba.eval.csv data_keys=keys/lit-pcba.pkl

To run evaluation on MUV, change all occurences of "lit-pcba" in the command line to "muv".

About

No description, website, or topics provided.

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.5%
  • Shell 0.5%