Remove torch_scatter and torch_cluster from CI pipeline's dependencies #233

dhpitt · 2023-10-09T18:18:27Z

We currently use the module torch_scatter on main for one function called segment_csr, which aggregates features across each neighborhood for the GINO model in neuralop.layers.integral_transform.py. This implements a simple python version of the same function to remove our dependence on torch_scatter and fix that annoying OSError in our CI.

Second note: the same error occurs with the CPU version of torch_cluster, so I wrote a neighborhood search for CPU runners only.

dhpitt · 2023-10-10T18:47:13Z

Small empirical tests show that my neighbor search function runs in about 1/3 the time of torch_cluster.radius.

JeanKossaifi

Thanks for the changes @dhpitt -- should we have a small unit-test for the fallbacks?

neuralop/layers/neighbor_search.py

JeanKossaifi · 2023-10-11T17:45:18Z

neuralop/layers/neighbor_search.py

-                                                          device=neighbors_count.device)), 
-                                                          dim=0)
+            if not self.use_torch_cluster:
+                return_dict = self.search_fn(data, queries, radius)


@kovachki Should we just keep the manual implementation and just choose between open3d (fast) and fallback (slow)?

Implemented.

Yes, I think this is best. Let's just remove cluster altogether.

JeanKossaifi · 2023-10-11T17:46:39Z

neuralop/layers/segment_csr.py

+    if torch.backends.cuda.is_built():
+        """only import torch_scatter when cuda is available"""
+        import torch_scatter.segment_csr as scatter_segment_csr
+        return scatter_segment_csr(src, indptr, reduce)


Hmm cuda being built doesn't necessarily mean torch_scatter will be available? Shouldn't we check for both?

For now I thought we would keep torch_scatter as a dependency, but only for GPU users. In that case is it reasonable to assume it's available whenever CUDA is?

Logic is changed.

@dhpitt could you please check that your segment_csr works with backwards. You're doing some inplace operations so I'm not sure.

JeanKossaifi · 2023-10-12T18:00:01Z

@kovachki suggested also checking whether this works with backwards -- can you do add a simple sum loss and check for backward in the unit-test?

JeanKossaifi · 2023-10-12T22:19:08Z

Awesome, thanks @dhpitt !

dhpitt added 5 commits October 9, 2023 18:15

integral transform uses python segment_csr method

50f614b

remove torch_scatter from dependencies

367ce02

pure python replacements for segment_csr and neighbor search

5a479f6

remove torch cluster cpu install

5dead77

bug fix for cuda availability check

c756b46

dhpitt changed the title ~~Remove torch_scatter from dependencies~~ Remove torch_scatter and torch_cluster from CI pipeline's dependencies Oct 9, 2023

bug fix for edge case

82d9807

dhpitt added 5 commits October 10, 2023 18:50

remove print statement in integral transform

84f1700

fix indexing bug

9bb6d87

fix indexing bug

e81df69

install torch_cluster but force pure python segment_csr on cpu

a330d14

no installation of torch_scatter on ci

fdc1af0

JeanKossaifi reviewed Oct 11, 2023

View reviewed changes

dhpitt added 10 commits October 11, 2023 17:56

Merge branch 'main' into segment_csr

aa03af4

LEQ, not less than radius

f74bd9d

test neighbor search w/simple grid

b85dd4d

removed torch_cluster flag from FNOGNO

4eaaffe

removed torch_cluster flag from test_fnogno.py

d4a16ec

removed torch_cluster flag

21bb90b

only import scatter when available

cd25169

more elegant torch scatter check

021896b

smaller manual test

c7a064c

Merge branch 'main' into segment_csr

7744cea

dhpitt added 5 commits October 12, 2023 11:15

Merge branch 'main' into segment_csr

5fa6711

test segment_csr for backward

c867ac3

assert segment_csr produces grad

6a22749

inplace grad op

3cd7e5d

skip grad segment_csr for now

b04e5be

JeanKossaifi mentioned this pull request Oct 12, 2023

Fixing gradient backprop in #233 #236

Closed

turn off requires_grad to avoid unused params

53b7f2d

JeanKossaifi merged commit f543775 into neuraloperator:main Oct 12, 2023
1 check passed

dhpitt deleted the segment_csr branch October 30, 2023 21:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove torch_scatter and torch_cluster from CI pipeline's dependencies #233

Remove torch_scatter and torch_cluster from CI pipeline's dependencies #233

dhpitt commented Oct 9, 2023 •

edited

Loading

dhpitt commented Oct 10, 2023

JeanKossaifi left a comment

JeanKossaifi Oct 11, 2023

dhpitt Oct 11, 2023

kovachki Oct 12, 2023

JeanKossaifi Oct 11, 2023

dhpitt Oct 11, 2023

dhpitt Oct 11, 2023

kovachki Oct 12, 2023

JeanKossaifi commented Oct 12, 2023

JeanKossaifi commented Oct 12, 2023

Remove torch_scatter and torch_cluster from CI pipeline's dependencies #233

Remove torch_scatter and torch_cluster from CI pipeline's dependencies #233

Conversation

dhpitt commented Oct 9, 2023 • edited Loading

dhpitt commented Oct 10, 2023

JeanKossaifi left a comment

Choose a reason for hiding this comment

JeanKossaifi Oct 11, 2023

Choose a reason for hiding this comment

dhpitt Oct 11, 2023

Choose a reason for hiding this comment

kovachki Oct 12, 2023

Choose a reason for hiding this comment

JeanKossaifi Oct 11, 2023

Choose a reason for hiding this comment

dhpitt Oct 11, 2023

Choose a reason for hiding this comment

dhpitt Oct 11, 2023

Choose a reason for hiding this comment

kovachki Oct 12, 2023

Choose a reason for hiding this comment

JeanKossaifi commented Oct 12, 2023

JeanKossaifi commented Oct 12, 2023

dhpitt commented Oct 9, 2023 •

edited

Loading