benedekrozemberczki / capsgnn Goto Github PK

A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).

License: GNU General Public License v3.0

Python 100.00%

capsgnn capsule-network capsule-neural-networks convolution pytorch research tensorflow machine-learning deep-learning gnn

capsgnn's Introduction

CapsGNN

A PyTorch implementation of Capsule Graph Neural Network (ICLR 2019).

Abstract

The high-quality node embeddings learned from the Graph Neural Networks (GNNs) have been applied to a wide range of node-based applications and some of them have achieved state-of-the-art (SOTA) performance. However, when applying node embeddings learned from GNNs to generate graph embeddings, the scalar node representation may not suffice to preserve the node/graph properties efficiently, resulting in sub-optimal graph embeddings. Inspired by the Capsule Neural Network (CapsNet), we propose the Capsule Graph Neural Network (CapsGNN), which adopts the concept of capsules to address the weakness in existing GNN-based graph embeddings algorithms. By extracting node features in the form of capsules, routing mechanism can be utilized to capture important information at the graph level. As a result, our model generates multiple embeddings for each graph to capture graph properties from different aspects. The attention module incorporated in CapsGNN is used to tackle graphs with various sizes which also enables the model to focus on critical parts of the graphs. Our extensive evaluations with 10 graph-structured datasets demonstrate that CapsGNN has a powerful mechanism that operates to capture macroscopic properties of the whole graph by data-driven. It outperforms other SOTA techniques on several graph classification tasks, by virtue of the new instrument.

This repository provides a PyTorch implementation of CapsGNN as described in the paper:

Capsule Graph Neural Network. Zhang Xinyi, Lihui Chen. ICLR, 2019. [Paper]

The core Capsule Neural Network implementation adapted is available [here].

Requirements

The codebase is implemented in Python 3.5.2. package versions used for development are just below.

networkx          2.4
tqdm              4.28.1
numpy             1.15.4
pandas            0.23.4
texttable         1.5.0
scipy             1.1.0
argparse          1.1.0
torch             1.1.0
torch-scatter     1.4.0
torch-sparse      0.4.3
torch-cluster     1.4.5
torch-geometric   1.3.2
torchvision       0.3.0

Datasets

The code takes graphs for training from an input folder where each graph is stored as a JSON. Graphs used for testing are also stored as JSON files. Every node id and node label has to be indexed from 0. Keys of dictionaries are stored strings in order to make JSON serialization possible.

Every JSON file has the following key-value structure:

{"edges": [[0, 1],[1, 2],[2, 3],[3, 4]],
 "labels": {"0": "A", "1": "B", "2": "C", "3": "A", "4": "B"},
 "target": 1}

The **edges** key has an edge list value which descibes the connectivity structure. The **labels** key has labels for each node which are stored as a dictionary -- within this nested dictionary labels are values, node identifiers are keys. The **target** key has an integer value which is the class membership.

Outputs

The predictions are saved in the `output/` directory. Each embedding has a header and a column with the graph identifiers. Finally, the predictions are sorted by the identifier column.

Options

Training a CapsGNN model is handled by the `src/main.py` script which provides the following command line arguments.

Input and output options

  --training-graphs   STR    Training graphs folder.      Default is `input/train/`.
  --testing-graphs    STR    Testing graphs folder.       Default is `input/test/`.
  --prediction-path   STR    Output predictions file.     Default is `output/watts_predictions.csv`.

Model options

  --epochs                      INT     Number of epochs.                  Default is 100.
  --batch-size                  INT     Number fo graphs per batch.        Default is 32.
  --gcn-filters                 INT     Number of filters in GCNs.         Default is 20.
  --gcn-layers                  INT     Number of GCNs chained together.   Default is 2.
  --inner-attention-dimension   INT     Number of neurons in attention.    Default is 20.  
  --capsule-dimensions          INT     Number of capsule neurons.         Default is 8.
  --number-of-capsules          INT     Number of capsules in layer.       Default is 8.
  --weight-decay                FLOAT   Weight decay of Adam.              Defatuls is 10^-6.
  --lambd                       FLOAT   Regularization parameter.          Default is 0.5.
  --theta                       FLOAT   Reconstruction loss weight.        Default is 0.1.
  --learning-rate               FLOAT   Adam learning rate.                Default is 0.01.

Examples

The following commands learn a model and save the predictions. Training a model on the default dataset:

$ python src/main.py

Training a CapsGNNN model for a 100 epochs.

$ python src/main.py --epochs 100

Changing the batch size.

$ python src/main.py --batch-size 128

License

GNU License

capsgnn's People

Contributors

Stargazers

Watchers

Forkers

francescosaveriozuppichini endruk ml-lab awesome-archive xuehaouwa matic0209 liyaangy fisspoint shadowkun rouniuyizu dlwbm123 hack121 leo-xxx buaaplayer lingguomeng jdc08161063 widemeadows chaoyue729 jonathanchiang caprdzv gnn2qsu aymenwah charlottesean xuewengeophysics juingzhou jacke121 mingli1988 zhujunnan shubhampachori12110095 rjt1990 ximinwu wangdxf desire142 nnu-gisa tj1116 ice-river mingli-ai hookk wuxiujin songfgh alucardmini glshi semchan zhuruijie christinaliang ammieqi changewow messier-16 ye-man bingai geant44 hyzcn trantorrepository avinash-chouhan pandinosaurus tgnco1218 phdshliang emily2he jecktion 1360885769 zlannnn jessicadufirst innat shengyupei qss2012 big-yu shenlant314 liangzhenm hardikgw qinghuazhao zwxxj doublepoints excuses123 frankey419 ariacheng debbiely77 themushrr00m giser18 janaldochen jlqzzz aoe-khkhan batermj sienna13 hujilin1229 codeaudit shashankhalo7 skyewang tcwltcwl jiaxu0017 fengnote wengdunfang courage-chen yukewang96 li-study xmaha plenari nanyuu bowenzzzzzz999 yichengdwu zhh0998

capsgnn's Issues

Not able to install torch-scatter with torch 0.4.1

Hello,

Thanks for sharing the implementation.

While I'm try to run your code I get some error for installing the environment.
I have torch 0.4.1, but not able to install torch-scatter.Got the following error:
fatal error: torch/extension.h: No such file or directory

But I can successfully install them for torch 1.0.

Is your code working for torch 1.0? Or how to install torch-scatter for torch 0.4.1?

Details:

$ pip list
Package Version

backcall 0.1.0
certifi 2018.8.24
....
torch 0.4.1.post2
torch-geometric 1.1.1
torchfile 0.1.0
torchvision 0.2.1
tornado 5.1
tqdm 4.31.1
traitlets 4.3.2
urllib3 1.23
visdom 0.1.8.5
vispy 0.5.3
.... ....

$pip install torch-scatter

Coordinate Addition module & Routing

Hi, thanks for your codes of GapsGNN. And I have some questions about Coordinate Addition module and Routing.

Do you use Coordinate Addition module in this codes?
In /src/layers.py, line 137 : c_ij = torch.nn.functional.softmax(b_ij, dim=0) .
At this time, b_ij.size(0) == 1, why use dim =0 ?

Thanks again.

Why is the accuracy always 0.3333?

The model's accuracy is always 0.3333. It never increases or decreases. and the predictions are all zero. Why?

Other datasets

I notice some datasets in your paper such as RE-M5K and RE-M12K. May I know how to obtain these datasets? The processed datasets would be appreciated. Thank you.

Experimental result

Thank you for your work. Then the experimental results I ran out were very different from those in the paper.

Reproduce Issues

Hi, thanks for your PyTorch codes of GapsGNN. I try to run the codes on NCI, DD, and other graph classification datasets, but it doesn't work (For example, training loss converges to 2.0, and test acc is about 50% on NCI1 after several iterations.)
How should I do if I want to run these codes on NCI, DD and etc?
Thanks again.

script to convert other graph datasets into json formate to experiment with code

I need a script that can convert the graph data into the format this code is using?

I was trying to run this code but got this error. See pic below

std = gain / math.sqrt(fan) ZeroDivisionError: float division by zero

std = gain / math.sqrt(fan) ZeroDivisionError: float division by zero
Process finished with exit code 1
what's wrong?who meet the problem.

Dataset size

Why there are only 30 graphs for training?

D&D dataset

I notice some datasets in your paper such as D&D dataset. May I know how to obtain these datasets? The processed datasets would be appreciated. Thank you.

how to repeat your expriments?

Enumerating feature and target values.

100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 60/60 [00:00<00:00, 14754.82it/s]

Training started.

Epochs: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 10/10 [00:05<00:00, 1.90it/s]
CapsGNN (Loss=0.7279): 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 1.92it/s]

Scoring.

Accuracy: 0.3333

Accuracy is too small

Which dataset is used in this code?

Which dataset is used in this code? In your readme file, I couldn't find which dataset is used in this code? Would you please tell me?

Train on Image Dataset

Hi.. thanks for sharing this repo.
What in need to modify so I can train this on an image dataset?
Thanks

CapsGNN (Loss=nan)?

I was trying to run this code but got this error. See pic below

How to generate such dataset

When I transfer the MUTAG to the specific json file, the accuracy is only round 62%. Can you give me some suggestion or the generate file

About dataset

Hi, could you tell me what dataset use in this project?

Something about reshape

Hi @benedekrozemberczki ! Thank you for your work!

I have a question at line 61 and 62 of CapsGNN/src/capsgnn.py

hidden_representations = torch.cat(tuple(hidden_representations))
hidden_representations = hidden_representations.view(1, self.args.gcn_layers, self.args.gcn_filters,-1)

Why you directly reshape L*N,D to 1,L,D,N instead of using permutation after reshape, e.g

hidden_representations = hidden_representations.view(1, self.args.gcn_layers, -1,self.args.gcn_filters).permute(0,1,3,2)

Thank you for your help!

运行代码就报错？都没修改

Traceback (most recent call last):
File "src/main.py", line 19, in
main()
File "src/main.py", line 14, in main
model.fit()
File "/lmq/CapsGNN/src/capsgnn.py", line 293, in fit
prediction, reconstruction_loss = self.model(data)
File "/usr/local/miniconda3/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "/lmq/CapsGNN/src/capsgnn.py", line 147, in forward
first_capsule_output = self.first_capsule(hidden_representations)
File "/usr/local/miniconda3/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "/lmq/CapsGNN/src/layers.py", line 86, in forward
u = [self.unitsi for i in range(self.num_units)]
File "/lmq/CapsGNN/src/layers.py", line 86, in
u = [self.unitsi for i in range(self.num_units)]
File "/usr/local/miniconda3/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "/usr/local/miniconda3/lib/python3.8/site-packages/torch/nn/modules/conv.py", line 313, in forward
return self._conv_forward(input, self.weight, self.bias)
File "/usr/local/miniconda3/lib/python3.8/site-packages/torch/nn/modules/conv.py", line 309, in _conv_forward
return F.conv1d(input, weight, bias, self.stride,
RuntimeError: Expected 2D (unbatched) or 3D (batched) input to conv1d, but got input of size: [1, 2, 20, 15]

Not available with GPU training?

Hi, thank you for your code! May I ask if your code can support GPU computation?