[BREAKING][fix] Use jagged tensor as default tensor type by 0oshowero0 · Pull Request #92 · Ascend/TransferQueue

0oshowero0 · 2026-05-07T11:21:45Z

Background

Previously, TransferQueue would try torch.stack() first when merging per-sample tensors into a batched tensordict for user retrieval. As a result, tensors with uniform shapes were returned as regular dense tensors, while jagged data fell back to nested tensors. This inconsistency forced downstream code to handle two distinct data types (torch.Tensor vs. nested tensor), adding unnecessary branching logic.

Changes

This PR changes the default aggregation strategy so that all tensor fields are returned as nested tensors by default, eliminating the torch.stack() fast-path.

Specifically:

KVStorageManager._merge_tensors_to_tensordict: Removed the torch.stack(chunk) fallback. The new chain is as_nested_tensor(jagged) → nested_tensor(strided) → NonTensorStack.
AsyncSimpleStorageManager._pack_field_values: Removed the torch.stack(values) fast-path for uniform-shape tensors. The new in is as_nested_tensor(jagged) → as_nested_tensor(strided) → NonTensorStack, consistent with the KV backend.
Unified strided fallback: Added the missing strided layout fallback to KVStorageManager, ensuring both backends behave identically when jagged layout fails (e.g., for zero-dim tensors).
Docstring & comment cleanup: Updated all outdated docstrings and comments that referenced the old torch.stack-first behavior.

Test updates

Adapted test_async_simple_storage_manager.py, test_kv_storage_manager.py, and e2e tests to accept nested tensors as the default return type.
Reworked the test_kv_storage_manager.py fixture to use realistic variable-length fields (input_ids, prompt_ids, response_ids, response_mask) aligned the single_controller_demo.py schema, replacing the oversimplified text/label/mask example.
Replaced all torch.equal(dense, nested) assertions with safe per-component comparisons (unbind(0) + torch.equal) to accommodate the new nested-tensor contract

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

ascend-robot · 2026-05-07T11:21:58Z

CLA Signature Pass

0oshowero0, thanks for your pull request. All authors of the commits have signed the CLA. 👍

Copilot

Pull request overview

This PR changes TransferQueue’s tensor aggregation contract so that batched tensor fields are returned as nested tensors by default (instead of using torch.stack() when shapes are uniform), aligning behavior across backends and reducing downstream branching on return types.

Changes:

Removed the dense torch.stack() fast-path during aggregation in both KV and async simple storage manager codepaths, preferring as_nested_tensor(..., layout=jagged) with a strided fallback.
Updated metadata/docs/comments to reflect the new “nested-by-default” behavior.
Updated unit + e2e tests to compare nested tensors safely (per-sample comparisons) and to use more realistic variable-length test data.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
transfer_queue/utils/common.py	Updates docstring to reflect broader “tensor aggregation” use (not just `torch.stack`).
transfer_queue/storage/managers/simple_storage_manager.py	Removes `torch.stack` fast-path; always builds nested tensors (jagged → strided).
transfer_queue/storage/managers/base.py	KV merge path now always attempts nested tensors (jagged → strided) before `NonTensorStack`.
tests/test_kv_storage_manager.py	Updates fixtures/assertions for nested-by-default behavior and variable-length fields.
tests/test_async_simple_storage_manager.py	Updates `_pack_field_values` expectations: uniform tensors now return nested tensors.
tests/e2e/test_kv_interface_e2e.py	Updates equality/close helpers to handle nested vs dense comparisons.
tests/e2e/test_e2e_lifecycle_consistency.py	Updates e2e verification helpers and comparisons for nested-by-default return values.

Comments suppressed due to low confidence (1)

tests/e2e/test_e2e_lifecycle_consistency.py:896

np_array is treated as a dense 2D tensor ([0, 0]), but earlier in this file it’s noted it may be returned as a nested tensor by default. For nested tensors, update the mutation/assertion to use per-sample indexing (e.g., [0][0]) to avoid indexing errors.

        # 8. np_array: verify it's a tensor now (TensorDict auto-converts numeric numpy)
        # If it's a tensor, writability is guaranteed by nested tensor creation
        np_arr_retrieved = retrieved["np_array"]
        if isinstance(np_arr_retrieved, torch.Tensor):
            np_arr_retrieved[0, 0] = 22222.0
            assert np_arr_retrieved[0, 0].item() == 22222.0, "np_array (as tensor) should be writable"

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

ascend-robot · 2026-05-08T03:16:41Z

CLA Signature Pass

0oshowero0, thanks for your pull request. All authors of the commits have signed the CLA. 👍

Copilot

Pull request overview

Copilot reviewed 12 out of 12 changed files in this pull request and generated 1 comment.

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

ascend-robot · 2026-05-08T03:30:19Z

CLA Signature Pass

0oshowero0, thanks for your pull request. All authors of the commits have signed the CLA. 👍

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

ascend-robot · 2026-05-08T03:33:37Z

CLA Signature Pass

0oshowero0, thanks for your pull request. All authors of the commits have signed the CLA. 👍

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

ascend-robot · 2026-05-08T03:34:08Z

CLA Signature Pass

0oshowero0, thanks for your pull request. All authors of the commits have signed the CLA. 👍

Copilot

Pull request overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 2 comments.

+            if all(isinstance(v, torch.Tensor) for v in chunk) and all(v.dim() == 0 for v in chunk):
+                return field, torch.stack(chunk)


  {
   "cell_type": "code",
-   "execution_count": 23,
   "metadata": {},
+   "source": [],
   "outputs": [],
-   "source": []
+   "execution_count": 23
  }


use jagged tensor by default

6efd2ad

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

Copilot AI review requested due to automatic review settings May 7, 2026 11:21

ascend-robot added the ascend-cla/yes label May 7, 2026

Copilot started reviewing on behalf of 0oshowero0 May 7, 2026 11:22 View session

Copilot AI reviewed May 7, 2026

View reviewed changes

tardis-key mentioned this pull request May 8, 2026

'Tensor' object has no attribute 'offsets' in main_ppo_sync & transfer_queue verl-project/verl#6261

Closed

4 tasks

fix

c002139

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

0oshowero0 requested a review from Copilot May 8, 2026 03:16

Copilot started reviewing on behalf of 0oshowero0 May 8, 2026 03:17 View session

Copilot AI reviewed May 8, 2026

View reviewed changes

Comment thread transfer_queue/storage/managers/base.py Outdated

fix

e1b826c

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

0oshowero0 requested a review from Copilot May 8, 2026 03:30

Copilot started reviewing on behalf of 0oshowero0 May 8, 2026 03:31 View session

fix

63b4891

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

fix

96adb46

Signed-off-by: 0oshowero0 <o0shower0o@outlook.com>

Copilot AI reviewed May 8, 2026

View reviewed changes

ji-huazhong approved these changes May 9, 2026

View reviewed changes

ji-huazhong merged commit d147a33 into Ascend:main May 9, 2026
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BREAKING][fix] Use jagged tensor as default tensor type#92

[BREAKING][fix] Use jagged tensor as default tensor type#92
ji-huazhong merged 5 commits intoAscend:mainfrom
0oshowero0:jagged_fix

0oshowero0 commented May 7, 2026

Uh oh!

ascend-robot commented May 7, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ascend-robot commented May 8, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

ascend-robot commented May 8, 2026

Uh oh!

ascend-robot commented May 8, 2026

Uh oh!

ascend-robot commented May 8, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if all(isinstance(v, torch.Tensor) for v in chunk) and all(v.dim() == 0 for v in chunk):
		return field, torch.stack(chunk)

Conversation

0oshowero0 commented May 7, 2026

Background

Changes

Test updates

Uh oh!

ascend-robot commented May 7, 2026

CLA Signature Pass

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ascend-robot commented May 8, 2026

CLA Signature Pass

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

ascend-robot commented May 8, 2026

CLA Signature Pass

Uh oh!

ascend-robot commented May 8, 2026

CLA Signature Pass

Uh oh!

ascend-robot commented May 8, 2026

CLA Signature Pass

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants