[Model & Dataset] facebook & sp2gcl #201

777sssa · 2024-04-23T06:18:34Z

Description

Checklist

Please feel free to remove inapplicable items for your PR.

The PR title starts with [$CATEGORY] (such as [NN], [Model], [Doc], [Feature]])
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the best of my knowledge, examples are either not affected by this change,
or have been fixed to be compatible with this change
Related issue is referred in this PR

Changes

dddg617 · 2024-05-20T06:28:16Z

gammagl/datasets/facebook.py

+        x = tlx.convert_to_tensor(data['features'], dtype=tlx.float32)
+        y = tlx.convert_to_tensor(data['target'], dtype=tlx.int64)
+        edge_index = tlx.convert_to_tensor(data['edges'], dtype=tlx.int64)
+        edge_index = edge_index.T


Have you tried if this can work in the other backend like 'mindspore'?

dddg617 · 2024-05-20T06:31:25Z

gammagl/datasets/facebook.py

+
+    def __init__(
+        self,
+        root: str,


Currently, this argument can be optional, as we have a cached mechanism compared to PyG.

dddg617 · 2024-05-27T09:11:19Z

examples/sp2_gcl/node_main.py

+        loss = 0.5 * tlx.losses.softmax_cross_entropy_with_logits(logits, labels) + 0.5 * tlx.losses.softmax_cross_entropy_with_logits(logits.transpose(-2, -1), labels)
+        return loss
+def main(args):
+    global edge, e, u, test_idx


What is this line doing?

dddg617 · 2024-05-27T09:13:29Z

examples/sp2_gcl/node_main.py

+                val_idx = tlx.where(data.val_mask)[0]
+                test_idx = tlx.where(data.test_mask)[0]
+        else:
+            train_idx, val_idx, test_idx = split(y)


I do not think this is useful. Usually, the train, valid, test split should be done in the dataset. You may directly use data.train_mask .etc to get the idx instead of add a new function in the util.

dddg617 · 2024-05-27T09:17:52Z

gammagl/models/sp2gcl.py

+import tensorlayerx as tlx
+import tensorlayerx.nn as nn
+from gammagl.layers.conv import GCNConv
+class Encoder(nn.Module):


I remember there are many Encoder in the gammagl. I do not recommend you to place this function here.

dddg617 · 2024-05-27T09:18:00Z

gammagl/models/sp2gcl.py

+        return x
+
+
+class MLP(nn.Module):


dddg617 · 2024-05-27T09:18:06Z

gammagl/models/sp2gcl.py

+
+
+
+class EigenMLP(nn.Module):


dddg617 · 2024-05-27T09:18:31Z

gammagl/utils/split.py

+import tensorlayerx as tlx
+import numpy as np
+
+def split(y):


This util is useless, remove it.

from sklearn.model_selection import train_test_split

gyzhou2000 · 2024-06-03T03:33:43Z

examples/sp2_gcl/readme.md

+@ -0,0 +1,40 @@
+# Graph Contrastive Learning with Stable and Scalable
+
+- Paper link: [https://proceedings.neurips.cc/paper_files/paper/2023/file/8e9a6582caa59fda0302349702965171-Paper-Conference.pdf](https://arxiv.org/abs/2201.11349)


链接不对

gyzhou2000 · 2024-06-03T03:38:40Z

gammagl/models/sp2gcl.py

+        period_e = e.unsqueeze(1) * tlx.pow(2, period_term)
+        # period_e = period_e.to(u.device)
+        fourier_e = tlx.concat([tlx.sin(period_e), tlx.cos(period_e)], axis=-1)
+        h = u @ fourier_e


tlx.unsqueeze，矩阵乘用tlx.matmul

gyzhou2000 · 2024-06-03T03:49:51Z

examples/sp2_gcl/readme.md

+| PubMed     | 82.3±0.3 | OOM        |
+| Wiki-CS | 79.42±0.19 | 76.79 ± 0.61 |
+| Facebook   | 90.43±0.13 | 85.35±0.26 |


gyzhou2000 · 2024-06-03T03:52:15Z

gammagl/models/sp2gcl.py

+        self.phi = nn.Sequential(nn.Linear(1, 16), nn.ReLU(), nn.Linear(16, 16))
+        self.psi = nn.Sequential(nn.Linear(16, 16), nn.ReLU(), nn.Linear(16, 1))


gyzhou2000 · 2024-06-03T03:54:05Z

tests/datasets/test_facebook.py

+	dataset = FacebookPagePage(root='data/facebook')
+	g = dataset[0]
+	pass


节点数量，边数量，节点特征维度，类别数量都判断一下

gyzhou2000 · 2024-06-03T03:56:00Z

examples/sp2_gcl/sp2gcl_trainer.py

+    parser.add_argument('--seed', type=int, default=0)
+    parser.add_argument('--cuda', type=int, default=3)


seed去点，设置device参考其他trainer写法

gyzhou2000 · 2024-06-03T03:58:28Z

examples/sp2_gcl/sp2gcl_trainer.py

+def compute_laplacian(data):
+
+    edge_index = data.edge_index
+    num_nodes = data.num_nodes
+    row, col = edge_index
+    data_adj = csr_matrix((np.ones(len(row)), (row, col)), shape=(num_nodes, num_nodes))
+    degree = np.array(data_adj.sum(axis=1)).flatten()
+    deg_inv_sqrt = 1.0 / np.sqrt(degree)
+    deg_inv_sqrt[np.isinf(deg_inv_sqrt)] = 0
+    I = csr_matrix(np.eye(num_nodes))
+    D_inv_sqrt = csr_matrix((deg_inv_sqrt, (np.arange(num_nodes), np.arange(num_nodes))))
+    L = I - D_inv_sqrt.dot(data_adj).dot(D_inv_sqrt)
+    e, u = scipy.sparse.linalg.eigsh(L, k=100, which='SM', tol=1e-3)
+    data.e = tlx.convert_to_tensor(e, dtype=tlx.float32)
+    data.u = tlx.convert_to_tensor(u, dtype=tlx.float32)
+
+    return data


试试用 get_laplacian 接口替换

777sssa added 4 commits April 23, 2024 14:11

Create facebook.py

08b2faa

sp2_gcl

7810d58

sp2gcl

85dd253

sp2-gcl

68bc722

gyzhou2000 changed the title ~~Create facebook.py~~ [Model & Dataset] facebook & sp2gcl May 17, 2024

dddg617 reviewed May 20, 2024

View reviewed changes

sp2_gcl_1

04e1400

dddg617 reviewed May 27, 2024

View reviewed changes

777sssa added 2 commits May 28, 2024 21:16

sp2_gcl

4a477d6

sp2_gcl_new

d321bca

gyzhou2000 reviewed Jun 3, 2024

View reviewed changes

777sssa and others added 12 commits June 4, 2024 16:15

sp2_gcl_new

7a69f51

Merge branch 'main' into sp2gcl

e50e2e8

sp2gcl_new

d9b6d60

Merge branch 'sp2gcl' of https://github.com/777sssa/GammaGL into sp2gcl

d41db9c

Merge remote-tracking branch 'upstream/main' into sp2gcl

94ab090

change the code of sp2gcl

d4fb492

update

ba9a242

update

d731cdd

update test file

58e55a0

update

b8146d6

update

4a14c22

update

26d1f1b

gyzhou2000 merged commit e276485 into BUPT-GAMMA:main Jul 5, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model & Dataset] facebook & sp2gcl #201

[Model & Dataset] facebook & sp2gcl #201

777sssa commented Apr 23, 2024

dddg617 May 20, 2024

dddg617 May 20, 2024

dddg617 May 27, 2024

dddg617 May 27, 2024

dddg617 May 27, 2024

dddg617 May 27, 2024

dddg617 May 27, 2024

dddg617 May 27, 2024

gyzhou2000 May 31, 2024

gyzhou2000 Jun 3, 2024

gyzhou2000 Jun 3, 2024

gyzhou2000 Jun 3, 2024

gyzhou2000 Jun 3, 2024

gyzhou2000 Jun 3, 2024

gyzhou2000 Jun 3, 2024

gyzhou2000 Jun 3, 2024

		self.phi = nn.Sequential(nn.Linear(1, 16), nn.ReLU(), nn.Linear(16, 16))
		self.psi = nn.Sequential(nn.Linear(16, 16), nn.ReLU(), nn.Linear(16, 1))

		parser.add_argument('--seed', type=int, default=0)
		parser.add_argument('--cuda', type=int, default=3)




		class EigenMLP(nn.Module):

[Model & Dataset] facebook & sp2gcl #201

[Model & Dataset] facebook & sp2gcl #201

Conversation

777sssa commented Apr 23, 2024

Description

Checklist

Changes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment