High-level API for `torch.sparse.mm` with optimized `spmm_reduce` kernel using CSR format #6699

JakubPietrakIntel · 2023-02-14T13:08:44Z

Related to important optimization in Pytorch:
✅ port sparse_mm.reduce to pytorch and optimize it on CPU #83727

Updating simplified high-level API for spmm_reduce() kernel and tests.

The current kernel implementation has limitation to process src of type torch.Tensor in torch.sparse_csr format, therefore I've added an option to auto-convert src to CSR format using src.to_sparse_csr(), which is False by default and will result in ValueError if the input is not provided in the correct format.

The conversion from SparseTensor to torch.Tensor is enabled by default for Pytorch > 1.13.

Added transfrom to remove duplicated in ogbn-products dataset, because the new kernel can't handle duplicate entries (useful for benchmarks).

Re-opened this PR because the draft (#6689) needed to be scrapped after a rebase.

for more information, see https://pre-commit.ci

CHANGELOG.md

torch_geometric/transforms/__init__.py

torch_geometric/utils/spmm.py

…_geometric into spmm_reduce_api

for more information, see https://pre-commit.ci

…_geometric into spmm_reduce_api

for more information, see https://pre-commit.ci

…_geometric into spmm_reduce_api

for more information, see https://pre-commit.ci

…_geometric into spmm_reduce_api

for more information, see https://pre-commit.ci

Removes duplicate edged to the given homogeneous or heterogeneous graph. It will change the original order of dataset by concatenating one of duplicated edges at the end of the dataset. It can be used to clean-up a known repeated self-connecting edges issue in ogbn-products. Reference to ogbn-products Leaderboard: [here](https://ogb.stanford.edu/docs/nodeprop/#:~:text=Note%3A%20A%20very%20small%20number%20of%20self%2Dconnecting%20edges%20are%20repeated%20(see%20here)%3B%20you%20may%20remove%20them%20if%20necessary) Moved this to separate PR from #6699 --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

…into spmm_reduce_api

yanbing-j · 2023-02-20T05:10:29Z

torch_geometric/utils/spmm.py

+    reduce = 'sum' if reduce == 'add' else reduce
+
+    if reduce not in ['sum', 'mean', 'min', 'max']:
+        raise ValueError(f"`reduce` argument '{reduce}' not supported")

    if isinstance(src, SparseTensor):
        return torch_sparse.matmul(src, other, reduce)


Hi @JakubPietrakIntel @rusty1s , May I know the logic here? If src is SparseTensor which is the default data type, it will never go into the spmm optimized impl in PT.

Yes, I changed this back for now since torch.sparse.mm is missing CUDA support. I think we can either patch this in SparseTensor or patch this here such that we only call torch.sparse.mm in case PyTorch >= 2.0 and CPU. Otherwise, I think this solution is fine, as we support torch.sparse.Tensor now anyway (and are in the process of removing torch-sparse dependency).

Understand. So for now, we need add a patch to call torch.sparse.mm when PT >= 2.0 and CPU. Do you know the time of adding CUDA support?

I don‘t have any insights into that. Let me add the optimized routine in a separate PR.

Fixed this in #6759

Thanks. The fix LGTM.

one commit spmm_reduce_api

b0b3276

JakubPietrakIntel requested review from rusty1s, mingfeima, andreazanetti, mszarma, DamianSzwichtenberg, kgajdamo and yanbing-j February 14, 2023 13:08

JakubPietrakIntel self-assigned this Feb 14, 2023

github-actions bot added benchmark transform utils labels Feb 14, 2023

JakubPietrakIntel added feature 0 - Priority P0 and removed benchmark transform labels Feb 14, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

6544bff

for more information, see https://pre-commit.ci

github-actions bot added benchmark transform labels Feb 14, 2023

JakubPietrakIntel marked this pull request as ready for review February 14, 2023 13:12

Merge branch 'master' into spmm_reduce_api

1eb4b95

rusty1s reviewed Feb 14, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

torch_geometric/transforms/__init__.py Outdated Show resolved Hide resolved

torch_geometric/utils/spmm.py Outdated Show resolved Hide resolved

torch_geometric/utils/spmm.py Outdated Show resolved Hide resolved

yanbing-j reviewed Feb 15, 2023

View reviewed changes

torch_geometric/utils/spmm.py Show resolved Hide resolved

JakubPietrakIntel and others added 6 commits February 15, 2023 01:16

code review

4775e39

Merge branch 'spmm_reduce_api' of https://github.com/pyg-team/pytorch…

a050d00

…_geometric into spmm_reduce_api

[pre-commit.ci] auto fixes from pre-commit.com hooks

a2d9ae0

for more information, see https://pre-commit.ci

code review

a80fecb

Merge branch 'spmm_reduce_api' of https://github.com/pyg-team/pytorch…

cfa6f33

…_geometric into spmm_reduce_api

moved transform for duplicate edges to other PR

09bc5db

github-actions bot removed the benchmark label Feb 15, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

c58f7a4

for more information, see https://pre-commit.ci

JakubPietrakIntel mentioned this pull request Feb 15, 2023

Add RemoveDuplicatedEdges transform #6709

Merged

JakubPietrakIntel and others added 7 commits February 15, 2023 02:30

change tested torch version to 2.0

58f466c

Merge branch 'spmm_reduce_api' of https://github.com/pyg-team/pytorch…

4ec5ced

…_geometric into spmm_reduce_api

[pre-commit.ci] auto fixes from pre-commit.com hooks

7f363d3

for more information, see https://pre-commit.ci

fixed conditionals for tensor type & version parse

fbfd7f8

Merge branch 'spmm_reduce_api' of https://github.com/pyg-team/pytorch…

0745af8

…_geometric into spmm_reduce_api

[pre-commit.ci] auto fixes from pre-commit.com hooks

ef0c5ee

for more information, see https://pre-commit.ci

Merge branch 'master' into spmm_reduce_api

0df07db

Merge branch 'master' of https://github.com/pyg-team/pytorch_geometric …

0890f33

…into spmm_reduce_api

JakubPietrakIntel removed transform utils labels Feb 17, 2023

rusty1s added 6 commits February 19, 2023 18:34

typo

69c29b0

typo

167cd6b

typo

bda7973

typo

5949523

typo

2cf6eea

Merge branch 'master' into spmm_reduce_api

9ef2833

github-actions bot added the utils label Feb 19, 2023

rusty1s merged commit 6ee08da into master Feb 19, 2023

rusty1s deleted the spmm_reduce_api branch February 19, 2023 18:48

yanbing-j reviewed Feb 20, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High-level API for `torch.sparse.mm` with optimized `spmm_reduce` kernel using CSR format #6699

High-level API for `torch.sparse.mm` with optimized `spmm_reduce` kernel using CSR format #6699

JakubPietrakIntel commented Feb 14, 2023 •

edited

yanbing-j Feb 20, 2023

rusty1s Feb 20, 2023

yanbing-j Feb 20, 2023

rusty1s Feb 21, 2023

rusty1s Feb 21, 2023

yanbing-j Feb 22, 2023

High-level API for torch.sparse.mm with optimized spmm_reduce kernel using CSR format #6699

High-level API for torch.sparse.mm with optimized spmm_reduce kernel using CSR format #6699

Conversation

JakubPietrakIntel commented Feb 14, 2023 • edited

yanbing-j Feb 20, 2023

Choose a reason for hiding this comment

rusty1s Feb 20, 2023

Choose a reason for hiding this comment

yanbing-j Feb 20, 2023

Choose a reason for hiding this comment

rusty1s Feb 21, 2023

Choose a reason for hiding this comment

rusty1s Feb 21, 2023

Choose a reason for hiding this comment

yanbing-j Feb 22, 2023

Choose a reason for hiding this comment

High-level API for `torch.sparse.mm` with optimized `spmm_reduce` kernel using CSR format #6699

High-level API for `torch.sparse.mm` with optimized `spmm_reduce` kernel using CSR format #6699

JakubPietrakIntel commented Feb 14, 2023 •

edited