Runtime call gpu by Chapaman · Pull Request #1677 · elixir-nx/nx

Chapaman · 2026-03-02T22:20:34Z

polvalente

This is basically in the right direction! I gotta check if there's more code we can share between the cuda and host paths.

polvalente · 2026-03-02T22:55:24Z

exla/Makefile


 SOURCES = $(EXLA_DIR)/exla.cc $(EXLA_DIR)/exla_client.cc $(EXLA_DIR)/exla_mlir.cc $(EXLA_DIR)/ipc.cc
-SOURCES += $(wildcard $(EXLA_DIR)/custom_calls/*.cc)
+SOURCES += $(filter-out $(EXLA_DIR)/custom_calls/runtime_callback_cuda.cc,$(wildcard $(EXLA_DIR)/custom_calls/*.cc))


There's gotta be a better way than this. Let's leave this for last, though.

polvalente · 2026-03-02T22:55:51Z

exla/Makefile

+OBJECTS += $(EXLA_CACHE_OBJ_DIR)/custom_calls/runtime_callback_cuda.o
+$(EXLA_CACHE_OBJ_DIR)/custom_calls/runtime_callback_cuda.o: $(EXLA_DIR)/custom_calls/runtime_callback_cuda.cc $(HEADERS)
+	@ mkdir -p $(EXLA_CACHE_OBJ_DIR)/custom_calls
+	$(NVCC) $(NVCCFLAGS) -c $< -o $@


I thought we wouldn't have to use gpu stuff for our custom calls

exla/test/exla/defn/runtime_call_test.exs

Co-authored-by: Paulo Valente <16843419+polvalente@users.noreply.github.com>

… enabled

The CUDA runtime_call handler (added in elixir-nx#1677) still used the old CallbackServer approach. Update it to register callbacks in the Outfeed struct and use the 4-arg InvokeRuntimeCallback (no PID). Since the host and CUDA handlers are now identical, merge them into a single clause with a `when platform in [:host, :cuda]` guard. Also update runtime_callback_cuda.cc to match the new bridge signature. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Chapaman added 3 commits February 27, 2026 19:40

added support for cuda and DtoH, HtoD approach to runtime_call

a543e17

first draft of the implementation

b0f8aec

fix formatting

80e74d1

polvalente reviewed Mar 2, 2026

View reviewed changes

Chapaman and others added 5 commits March 2, 2026 20:19

Update exla/test/exla/defn/runtime_call_test.exs

3e411b5

Co-authored-by: Paulo Valente <16843419+polvalente@users.noreply.github.com>

updated based on suggestions by polvalente

5901f63

fallback implementation for cuda disabled

31aa186

added a test to assert failure on runtime_callback_cuda when cuda not…

452bb9f

… enabled

docs: improve error message

751f48c

polvalente approved these changes Mar 6, 2026

View reviewed changes

polvalente marked this pull request as ready for review March 6, 2026 22:28

polvalente merged commit 1b112fd into elixir-nx:main Mar 6, 2026
17 of 18 checks passed

blasphemetheus mentioned this pull request Mar 11, 2026

CUDA runtime_call test fails: stub handler invoked despite CUDA_ENABLED build #1687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runtime call gpu#1677

Runtime call gpu#1677
polvalente merged 8 commits intoelixir-nx:mainfrom
Chapaman:runtime_call-gpu

Chapaman commented Mar 2, 2026 •

edited

Loading

Uh oh!

polvalente left a comment

Uh oh!

polvalente Mar 2, 2026

Uh oh!

polvalente Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Chapaman commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

polvalente left a comment

Choose a reason for hiding this comment

Uh oh!

polvalente Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

polvalente Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Chapaman commented Mar 2, 2026 •

edited

Loading