[OpenGL] dynamic range for-loop support #785

archibate · 2020-04-14T15:43:21Z

Related issue = #... (if any)

[Click here for the format server]

k-ye

LGTM with a few nits..

taichi/backends/opengl/opengl_api.cpp

taichi/codegen/codegen_opengl.cpp

archibate · 2020-04-15T16:50:52Z

taichi/backends/opengl/opengl_api.cpp

+      if (ker->rse.has_value()) ker->num_groups = (*ker->rse)((const char *)gtmp_base);
+      TI_DEBUG("kernel [{}] num_groups = {}", ker->kernel_name, ker->num_groups);


I specify dynamic work group size for dyn-range-kernels (thanks to the GL feature enable me).
Is this better than grid-stride-loop?
@k-ye @yuanming-hu Does other backends enables us to dynamically change work group size? If so we may make use of that?

For metal, I don't think so .

For CUDA, I guess the answer is no. A seemingly related feature is https://devblogs.nvidia.com/cuda-dynamic-parallelism-api-principles/

My feeling is grid-strided loops are usually good enough.

taichi/codegen/codegen_opengl.cpp

taichi/backends/opengl/opengl_api.cpp

archibate · 2020-04-16T16:37:03Z

TODO: remove unneeded check_opengl_error's.

k-ye

Cool! Most of my comments are nits or TODOs, and i think we're very close

examples/mpm128.py

taichi/backends/opengl/opengl_api.cpp

taichi/backends/opengl/opengl_api.h

taichi/codegen/codegen_opengl.cpp

taichi/backends/opengl/opengl_api.cpp

k-ye

Thanks LGTM! (with two more nits..)

examples/mpm128.py

taichi/backends/opengl/opengl_api.cpp

yuanming-hu · 2020-04-17T16:43:03Z

Thanks, @archibate, and @k-ye. Sorry about not participating in this PR activity - I got quite occupied by my research project this week. I'm merging this in now.

archibate added 7 commits April 14, 2020 23:39

dynamic range support at num_threads = 1

0ea865a

add test

9df3e7e

better TI_DEBUG

098556e

no print accessor glsl

df11e62

fix test_loop_args_as_range

3fa88a7

fix long-lasting state-leakage (related to the multi-offloading bug)

15bde3b

add ScopeIndent

7d9d33a

archibate marked this pull request as ready for review April 15, 2020 01:40

archibate requested review from yuanming-hu and k-ye April 15, 2020 01:40

k-ye reviewed Apr 15, 2020

View reviewed changes

taichi/backends/opengl/opengl_api.cpp Outdated Show resolved Hide resolved

taichi/codegen/codegen_opengl.cpp Outdated Show resolved Hide resolved

taichi/codegen/codegen_opengl.cpp Outdated Show resolved Hide resolved

archibate and others added 4 commits April 15, 2020 22:02

Merge branch 'master' into dyn

c085c96

[skip ci] no ad-hoc

22a5324

[skip ci] pre

c5c7059

dyn range = dyn work group size

68bea5e

archibate requested a review from k-ye April 15, 2020 16:46

[skip ci] clean

e384922

archibate commented Apr 15, 2020

View reviewed changes

k-ye reviewed Apr 16, 2020

View reviewed changes

taichi/codegen/codegen_opengl.cpp Outdated Show resolved Hide resolved

taichi/codegen/codegen_opengl.cpp Show resolved Hide resolved

taichi/backends/opengl/opengl_api.cpp Outdated Show resolved Hide resolved

archibate added 3 commits April 16, 2020 21:18

[skip ci] nit

12d8869

fix long-lasting NV-GL-not-preserving-data issue

3b85e4a

map/unmap gtmp for dyn range

1bd1887

[skip travis] fix win-build

ca2c318

k-ye reviewed Apr 16, 2020

View reviewed changes

[skip travis] fix non-GL again

5f136ca

archibate mentioned this pull request Apr 17, 2020

[OpenGL] NVIDIA GLSL stucks when compile or link some shaders #804

Closed

archibate added 3 commits April 17, 2020 10:00

[skip travis] fix

3e8aed7

[skip travis] apply reviews

ea17448

use class RangeSizeEvaluator_

da33271

archibate requested a review from k-ye April 17, 2020 02:54

k-ye approved these changes Apr 17, 2020

View reviewed changes

examples/mpm128.py Outdated Show resolved Hide resolved

taichi/backends/opengl/opengl_api.cpp Outdated Show resolved Hide resolved

taichi/backends/opengl/opengl_api.cpp Show resolved Hide resolved

archibate mentioned this pull request Apr 17, 2020

[OpenGL] fix NVIDIA's GLSL stuck at compile by sorting snodes #808

Merged

archibate added 2 commits April 18, 2020 00:39

[skip ci] revert mpm128

5cc25d1

[skip ci] nit: move rse

eb629bb

[skip ci] enforce code format

a84208d

yuanming-hu merged commit 76e4026 into taichi-dev:master Apr 17, 2020

yuanming-hu pushed a commit that referenced this pull request Apr 17, 2020

[OpenGL] Support range for-loops with non-constant bounds (#785)

b79d93d

yuanming-hu mentioned this pull request Apr 18, 2020

[lang] Support matrix initialization with a list of vectors #811

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenGL] dynamic range for-loop support #785

[OpenGL] dynamic range for-loop support #785

archibate commented Apr 14, 2020

k-ye left a comment

archibate Apr 15, 2020

k-ye Apr 16, 2020

yuanming-hu Apr 17, 2020

archibate commented Apr 16, 2020

k-ye left a comment

k-ye left a comment

yuanming-hu commented Apr 17, 2020

		if (ker->rse.has_value()) ker->num_groups = (ker->rse)((const char )gtmp_base);
		TI_DEBUG("kernel [{}] num_groups = {}", ker->kernel_name, ker->num_groups);

[OpenGL] dynamic range for-loop support #785

[OpenGL] dynamic range for-loop support #785

Conversation

archibate commented Apr 14, 2020

k-ye left a comment

Choose a reason for hiding this comment

archibate Apr 15, 2020

Choose a reason for hiding this comment

k-ye Apr 16, 2020

Choose a reason for hiding this comment

yuanming-hu Apr 17, 2020

Choose a reason for hiding this comment

archibate commented Apr 16, 2020

k-ye left a comment

Choose a reason for hiding this comment

k-ye left a comment

Choose a reason for hiding this comment

yuanming-hu commented Apr 17, 2020