mirror of
https://github.com/kexinhuang12345/DeepPurpose.git
synced 2026-04-03 00:19:34 -06:00
Query Regarding MPNN Drug Encoder #18
Labels
No labels
bug
bug
enhancement
enhancement
enhancement
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: github/DeepPurpose#18
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @SudhirGhandikota on 5/30/2023
Hello,
Before I ask my query, I would like to congratulate everybody involved in building this great framework.
I was trying to train an MPNN-CNN model on the latest BindingDB data and while exploring the existing MPNN Drug encoder implementation, I ran into a couple of doubts/queries. I was hoping to get some clarification from you guys.
From what I understand, the
agraphproperty associated with a drug is basically an adjacency list where therow_indexrepresents an atom and each column value is thein_bondnumber/index. I deduced this from the following code snippet in thesmiles2mpnnfeaturemethod.Then, during the MPNN model training, I can see that the
agraphsassociated with all drugs in a given batch are combined into one combinedagraph_1stvariable. In this consolidation step, you used two variablesN_aandN_bto maintain the running sum of atoms and bonds from each drug. Also, to re-index this combined adjacency list, I can see that you are adding theN_avalue at each step (code snippet below).However, since the values in these adjacency lists are bond numbers or indices, shouldn't we be adding the
N_bvariable value instead? Could you please clarify this doubt of mine?Thanks in advance!