[lld][Webassembly] Avoid a signed overflow on large sections by FatihBAKIR · Pull Request #178287 · llvm/llvm-project

FatihBAKIR · 2026-01-27T20:07:52Z

wasm sections sizes are specified as u32s, and thus can be as large as 4GB. wasm-ld currently stores the offset into a section as an int32_t which overflows on large sections and results in a crash. This change makes it a int64_t to accommodate any valid wasm section and allow catching even larger sections instead of wrapping around.

This PR fixes the issue by storing the offset as a int64_t, as well as adding extra checks to handle un-encodeable sections to fail instead of producing garbage wasm binaries, and also adds lit tests to make sure it works. I confirmed the test fails on main but passes with this fix.

Fixes: #178286

github-actions · 2026-01-27T20:08:16Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2026-01-27T20:08:46Z

@llvm/pr-subscribers-lld-wasm

@llvm/pr-subscribers-lld

Author: Fatih BAKIR (FatihBAKIR)

Changes

wasm sections sizes are specified as u32s, and thus can be as large as 4GB. wasm-ld currently stores the offset into a section as an int32_t which overflows on large sections and results in a crash. This change makes it a uint32_t to accommodate any valid wasm section.

This PR fixes the issue by storing the offset as a uint32_t, and also adds a lit test to make sure it works. I confirmed the test fails on main but passes with this fix.

Fixes #178286

Full diff: https://github.com/llvm/llvm-project/pull/178287.diff

2 Files Affected:

(added) lld/test/wasm/large-section.test (+34)
(modified) lld/wasm/InputChunks.h (+1-1)

diff --git a/lld/test/wasm/large-section.test b/lld/test/wasm/large-section.test
new file mode 100644
index 0000000000000..6f399000d9c89
--- /dev/null
+++ b/lld/test/wasm/large-section.test
@@ -0,0 +1,34 @@
+# RUN: split-file %s %t
+# RUN: llvm-mc -filetype=obj -triple=wasm32-unknown-unknown %t/chunk1.s -o %t/chunk1.o
+# RUN: llvm-mc -filetype=obj -triple=wasm32-unknown-unknown %t/chunk2.s -o %t/chunk2.o
+# --no-gc-sections to prevent the linker from optimizing the chunk away, otherwise it produces a tiny output
+# RUN: wasm-ld --no-entry --no-gc-sections %t/chunk1.o %t/chunk2.o -o %t/combined.wasm
+# RUN: llvm-readobj --sections %t/combined.wasm | FileCheck %s
+
+# Just making sure the linker doesn't crash for now and it has the combined, gigantic section, may need a better check
+# CHECK: Size: 2348810260
+
+# A 2GB + some extra bytes of data to make sure we go over 2G 
+#--- chunk1.s
+.section .data.chunk1,"",@
+.globl chunk1_start
+.type chunk1_start,@object
+chunk1_start:
+  .int32 0xAAAAAAAA
+  .int32 0xBBBBBBBB
+  .zero 2214592504
+  .int32 0xCCCCCCCC
+  .int32 0xDDDDDDDD
+.size chunk1_start, 2214592512
+
+#--- chunk2.s
+.section .data.chunk2,"",@
+.globl chunk2_start
+.type chunk2_start,@object
+chunk2_start:
+  .int32 0x11111111
+  .int32 0x22222222
+  .zero 134217712
+  .int32 0x44444444
+  .int32 0x55555555
+.size chunk2_start, 134217728
diff --git a/lld/wasm/InputChunks.h b/lld/wasm/InputChunks.h
index 1fe78d76631f1..462fa766081e6 100644
--- a/lld/wasm/InputChunks.h
+++ b/lld/wasm/InputChunks.h
@@ -97,7 +97,7 @@ class InputChunk {
 
   // After assignAddresses is called, this represents the offset from
   // the beginning of the output section this chunk was assigned to.
-  int32_t outSecOff = 0;
+  uint32_t outSecOff = 0;
 
   uint8_t sectionKind : 3;

sbc100 · 2026-01-27T21:22:25Z

Maybe "large section" -> "large data sections" in the PR title?

sbc100

Can you prefix the PR title with [lld][Webassembly]?

lgtm!

Although i makes me wonder if we have a test for when the 4gb limit is reached? Do we have a reasonable error in that case? I would guess not :-/

lgtm to this change as it though since it fixes the immediate issue.

FatihBAKIR · 2026-01-27T21:41:09Z

Updated the title with the prefix, but since the member is used in all section types (including debug info, function etc), I believe it applies to all types, and we have experienced it with debug info sections too.

dschuff · 2026-01-27T21:41:19Z

Yeah, we should probably have better coverage of this kind of thing for custom sections too, since huge custom sections are even more common than huge data sections. Like, test with 2-4G custom sections, maybe multiple of them, etc.

Actually, thinking about this test a little more... why does this test result in a big data sections? We shouldn't be generating any data segments at all for huge runs of zeros, should we? Or do we not have that optimization in lld?

FatihBAKIR · 2026-01-27T21:43:47Z

For catching even larger sections I think we should change the member to an [u]int64_t and check we aren't going over the 4G mark.

For the full zeros thing, I defensively added the non-zero int32_ts to the beginning and the end of the sections to avoid the linker from optimizing it away, not sure if it recognizes it though.

sbc100 · 2026-01-27T21:44:26Z

Oh sorry, I was thinking the InputChunk was only used to data segments... but if its used to custom sections too then this change LGTM

dschuff · 2026-01-27T21:44:44Z

Interesting... actually maybe we could also just write a test like this one, but with .section .debug_foo instead of a data section.

lld/test/wasm/large-section.test

wasm sections sizes are specified as u32s, and thus can be as large as 4GB. wasm-ld currently stores the offset into a section as an int32_t which overflows on large sections and results in a crash. This change makes it a int64_t to accommodate any valid wasm section. This change also catches section size overflows early and fail instead of wrapping and producing a corrupt wasm binary.

FatihBAKIR · 2026-02-10T19:12:45Z

I added 2 more tests, one for checking debug sections specifically, and one that goes even above 4GB and expects the linker to fail instead of silently producing a broken wasm binary.

sbc100

Thanks for the extra tests and error checks!

sbc100 · 2026-02-10T19:25:23Z

Can you update the PR description which still mentions uint32_t?

FatihBAKIR · 2026-02-10T19:31:31Z

Oops, forgot the PR description, updated

github-actions · 2026-02-10T20:04:40Z

@FatihBAKIR Congratulations on having your first Pull Request (PR) merged into the LLVM Project!

Your changes will be combined with recent changes from other authors, then tested by our build bots. If there is a problem with a build, you may receive a report in an email or a comment on this PR.

Please check whether problems have been caused by your change specifically, as the builds can include changes from many authors. It is not uncommon for your change to be included in a build that fails due to someone else's changes, or infrastructure issues.

How to do this, and the rest of the post-merge process, is covered in detail here.

If your change does cause a problem, it may be reverted, or you can revert it yourself. This is a normal part of LLVM development. You can fix your changes and open a new PR to merge them again.

If you don't get any reports, no action is required from you. Your changes are working as expected, well done!

llvm-ci · 2026-02-10T20:08:58Z

LLVM Buildbot has detected a new failure on builder lldb-x86_64-debian running on lldb-x86_64-debian while building lld at step 6 "test".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/162/builds/40738

Here is the relevant piece of the build log for the reference

Step 6 (test) failure: build (failure)
...
PASS: lldb-api :: functionalities/data-formatter/root-reference-children/TestRootReferenceChildren.py (616 of 3388)
PASS: lldb-api :: commands/expression/ir-interpreter-phi-nodes/TestIRInterpreterPHINodes.py (617 of 3388)
PASS: lldb-api :: functionalities/breakpoint/breakpoint_command/TestBreakpointCommandsFromPython.py (618 of 3388)
PASS: lldb-api :: macosx/duplicate-archive-members/TestDuplicateMembers.py (619 of 3388)
PASS: lldb-shell :: Settings/TestChildDepthTruncation.test (620 of 3388)
PASS: lldb-api :: commands/expression/dollar-in-variable/TestDollarInVariable.py (621 of 3388)
PASS: lldb-api :: functionalities/memory/cache/TestMemoryCache.py (622 of 3388)
PASS: lldb-api :: tools/lldb-server/TestGdbRemoteFork.py (623 of 3388)
PASS: lldb-api :: lang/c/struct_types/TestStructTypes.py (624 of 3388)
UNRESOLVED: lldb-api :: python_api/run_locker/TestRunLocker.py (625 of 3388)
******************** TEST 'lldb-api :: python_api/run_locker/TestRunLocker.py' FAILED ********************
Script:
--
/usr/bin/python3 /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/test/API/dotest.py -u CXXFLAGS -u CFLAGS --env LLVM_LIBS_DIR=/home/worker/2.0.1/lldb-x86_64-debian/build/./lib --env LLVM_INCLUDE_DIR=/home/worker/2.0.1/lldb-x86_64-debian/build/include --env LLVM_TOOLS_DIR=/home/worker/2.0.1/lldb-x86_64-debian/build/./bin --arch x86_64 --build-dir /home/worker/2.0.1/lldb-x86_64-debian/build/lldb-test-build.noindex --lldb-module-cache-dir /home/worker/2.0.1/lldb-x86_64-debian/build/lldb-test-build.noindex/module-cache-lldb/lldb-api --clang-module-cache-dir /home/worker/2.0.1/lldb-x86_64-debian/build/lldb-test-build.noindex/module-cache-clang/lldb-api --executable /home/worker/2.0.1/lldb-x86_64-debian/build/./bin/lldb --compiler /home/worker/2.0.1/lldb-x86_64-debian/build/./bin/clang --dsymutil /home/worker/2.0.1/lldb-x86_64-debian/build/./bin/dsymutil --make /usr/bin/gmake --llvm-tools-dir /home/worker/2.0.1/lldb-x86_64-debian/build/./bin --lldb-obj-root /home/worker/2.0.1/lldb-x86_64-debian/build/tools/lldb --lldb-libs-dir /home/worker/2.0.1/lldb-x86_64-debian/build/./lib --cmake-build-type Release -t /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/test/API/python_api/run_locker -p TestRunLocker.py
--
Exit Code: -11

Command Output (stdout):
--
lldb version 23.0.0git (https://github.com/llvm/llvm-project.git revision c703f5a1632973dd6eade473614dfbed1b088d9e)
  clang revision c703f5a1632973dd6eade473614dfbed1b088d9e
  llvm revision c703f5a1632973dd6eade473614dfbed1b088d9e
"can't evaluate expressions when the process is running."

--
Command Output (stderr):
--
Change dir to: /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/test/API/python_api/run_locker
runCmd: settings clear --all

output: 

runCmd: settings set symbols.enable-external-lookup false

output: 

runCmd: settings set target.inherit-tcc true

output: 

runCmd: settings set target.disable-aslr false

output: 

runCmd: settings set target.detach-on-error false

output: 

runCmd: settings set target.auto-apply-fixits false

omjavaid · 2026-02-16T15:54:09Z

This is failing on 32 bit Arm Linux:

https://lab.llvm.org/staging/#/builders/160/builds/1189

lld/test/wasm/large-debug-section.test
lld/test/wasm/large-section.test
lld/test/wasm/section-too-large.test

The failure got masked because bot was already failing.

omjavaid · 2026-02-16T16:43:18Z

Looking at the logs tests are failing on ARM 32-bit with OOM errors possibly hitting the 32-bit process address space limit around 3GB. Not sure if we should just mark these UNSUPPORTED on 32-bit or if theres something off with llvm-mc allocating the full 2.3GB for .zero 2214592504 directives - seems like it shouldnt need to materialize all that in memory but maybe im missing something about wasm object format requirements

@sbc100 since you merged this - any thoughts on whether this is expected behavior or if theres a better approach here? Should I just add UNSUPPORTED for 32-bit targets or is the memory allocation worth looking into

…178287)" This reverts commit c703f5a.

omjavaid · 2026-02-17T11:54:49Z

This is failing on 32 bit Arm Linux:

https://lab.llvm.org/staging/#/builders/160/builds/1189

lld/test/wasm/large-debug-section.test lld/test/wasm/large-section.test lld/test/wasm/section-too-large.test

The failure got masked because bot was already failing.

@FatihBAKIR I am going revert this change temporarily to unblock the buildbot.

…178287)" This reverts commit c703f5a. I have reverted this change as it was failing lld arm 32bit buildbot. https://lab.llvm.org/staging/#/builders/160/builds/1189

FatihBAKIR · 2026-02-17T17:06:00Z

I think we should mark the tests as unavailable on arm32 until we have a proper fix for out of memory issues. I wouldn't mind the revert if it was reverting a regression, but it would just remove tests that exercise an existing bug.

sbc100 · 2026-02-17T20:11:08Z

I guess these tests are fundamentally using more memory than is available on 32-bit systems?

Do we have any way to express that certain tests require this much memory?

I guess the memory requirements are unavoidable since we are creating a file which needs to be >2gb in size and we use mmap for the output file.

…8287) wasm sections sizes are specified as u32s, and thus can be as large as 4GB. wasm-ld currently stores the offset into a section as an int32_t which overflows on large sections and results in a crash. This change makes it a int64_t to accommodate any valid wasm section and allow catching even larger sections instead of wrapping around. This PR fixes the issue by storing the offset as a int64_t, as well as adding extra checks to handle un-encodeable sections to fail instead of producing garbage wasm binaries, and also adds lit tests to make sure it works. I confirmed the test fails on `main` but passes with this fix. Fixes: llvm#178286

…lvm#178287)" This reverts commit c703f5a. I have reverted this change as it was failing lld arm 32bit buildbot. https://lab.llvm.org/staging/#/builders/160/builds/1189

teresajohnson · 2026-02-19T15:07:30Z

I see that this was reverted, but I noticed in the meantime that the tests are leaving some very large files on my local disk. I'm not sure what the policy is for this, but ideally a few tests wouldn't leave multiple multi-gig files. Here are currently the largest files on my disk, all from a single client's test directory, and I have multiple clients in use. Can the tests clean up after themselves?

2.2G ./llvm/llvm_m5_build/tools/lld/test/wasm/Output/large-section.test.tmp/combined.wasm
2.2G ./llvm/llvm_m5_build/tools/lld/test/wasm/Output/large-debug-section.test.tmp/combined.wasm
2.1G ./llvm/llvm_m5_build/tools/lld/test/wasm/Output/section-too-large.test.tmp/chunk2.o
2.1G ./llvm/llvm_m5_build/tools/lld/test/wasm/Output/section-too-large.test.tmp/chunk1.o
2.1G ./llvm/llvm_m5_build/tools/lld/test/wasm/Output/large-section.test.tmp/chunk1.o
2.1G ./llvm/llvm_m5_build/tools/lld/test/wasm/Output/large-debug-section.test.tmp/debug1.o

wasm sections sizes are specified as u32s, and thus can be as large as 4GB. wasm-ld currently stores the offset into a section as an int32_t which overflows on large sections and results in a crash. This change makes it a int64_t to accommodate any valid wasm section and allow catching even larger sections instead of wrapping around. This PR fixes the issue by storing the offset as a int64_t, as well as adding extra checks to handle un-encodeable sections to fail instead of producing garbage wasm binaries, and also adds lit tests to make sure it works. I confirmed the test fails on main but passes with this fix. This is the same as #178287 but deletes the temporary files the tests create and requires the tests run on a 64-bit platform to avoid OOM issues due to the large binaries it creates.

…ns (#183225) wasm sections sizes are specified as u32s, and thus can be as large as 4GB. wasm-ld currently stores the offset into a section as an int32_t which overflows on large sections and results in a crash. This change makes it a int64_t to accommodate any valid wasm section and allow catching even larger sections instead of wrapping around. This PR fixes the issue by storing the offset as a int64_t, as well as adding extra checks to handle un-encodeable sections to fail instead of producing garbage wasm binaries, and also adds lit tests to make sure it works. I confirmed the test fails on main but passes with this fix. This is the same as llvm/llvm-project#178287 but deletes the temporary files the tests create and requires the tests run on a 64-bit platform to avoid OOM issues due to the large binaries it creates.

llvmbot added lld lld:wasm labels Jan 27, 2026

sbc100 reviewed Jan 27, 2026

View reviewed changes

FatihBAKIR changed the title ~~[wasm-ld] Avoid a signed overflow on large sections~~ [lld][Webassembly] Avoid a signed overflow on large sections Jan 27, 2026

sbc100 approved these changes Jan 27, 2026

View reviewed changes

dschuff reviewed Jan 27, 2026

View reviewed changes

lld/test/wasm/large-section.test Outdated Show resolved Hide resolved

sbc100 reviewed Jan 27, 2026

View reviewed changes

lld/test/wasm/large-section.test Outdated Show resolved Hide resolved

FatihBAKIR force-pushed the wasm-ld-crash-fix branch from 0a97d96 to a765aa9 Compare February 10, 2026 19:11

sbc100 approved these changes Feb 10, 2026

View reviewed changes

sbc100 merged commit c703f5a into llvm:main Feb 10, 2026
10 checks passed

omjavaid added a commit that referenced this pull request Feb 17, 2026

Revert "[lld][Webassembly] Avoid a signed overflow on large sections (#…

aa95a8f

…178287)" This reverts commit c703f5a.

omjavaid mentioned this pull request Feb 17, 2026

Revert "[lld][Webassembly] Avoid a signed overflow on large sections … #181807

Open

FatihBAKIR mentioned this pull request Feb 25, 2026

[lld][Webassembly] Avoid a signed overflow on large sections #183225

Merged

Conversation

FatihBAKIR commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 27, 2026

Uh oh!

llvmbot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 commented Jan 27, 2026

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

FatihBAKIR commented Jan 27, 2026

Uh oh!

dschuff commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FatihBAKIR commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 commented Jan 27, 2026

Uh oh!

dschuff commented Jan 27, 2026

Uh oh!

Uh oh!

Uh oh!

FatihBAKIR commented Feb 10, 2026

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

sbc100 commented Feb 10, 2026

Uh oh!

FatihBAKIR commented Feb 10, 2026

Uh oh!

Uh oh!

github-actions bot commented Feb 10, 2026

Uh oh!

llvm-ci commented Feb 10, 2026

Uh oh!

omjavaid commented Feb 16, 2026

Uh oh!

omjavaid commented Feb 16, 2026

Uh oh!

omjavaid commented Feb 17, 2026

Uh oh!

FatihBAKIR commented Feb 17, 2026

Uh oh!

sbc100 commented Feb 17, 2026

Uh oh!

teresajohnson commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

FatihBAKIR commented Jan 27, 2026 •

edited

Loading

llvmbot commented Jan 27, 2026 •

edited

Loading

dschuff commented Jan 27, 2026 •

edited

Loading

FatihBAKIR commented Jan 27, 2026 •

edited

Loading