[QDP] Fix latency benchmark to use batch encoding #794

guan404ming · 2026-01-05T11:04:24Z

Purpose of PR

Added encode_batch() method to Python bindings accepting NumPy 2D array directly (zero-copy)
Updated run_mahout() to process entire batch in single GPU kernel call

Related Issues or PRs

Changes Made

Breaking Changes

Yes
No

Checklist

Added or updated unit tests for all changes
Added or updated documentation for all changes
Successfully built and ran all unit tests or manual tests locally
PR title follows "MAHOUT-XXX: Brief Description" format (if related to an issue)
Code follows ASF guidelines

guan404ming · 2026-01-05T11:06:06Z

(qdp-python) titan% uv run python benchmark/benchmark_latency.py --qubits 12 --batches 20 --batch-size 8 --prefetch 4
Uninstalled 1 package in 1ms
Installed 1 package in 5ms
Generating 160 samples of 12 qubits...
  Batch size   : 8
  Vector length: 4096
  Batches      : 20
  Prefetch     : 4
  Frameworks   : pennylane, qiskit-init, qiskit-statevector, mahout
  Generated 160 samples
  PennyLane/Qiskit format: 5.00 MB
  Mahout format: 5.00 MB

======================================================================
DATA-TO-STATE LATENCY BENCHMARK: 12 Qubits, 160 Samples
======================================================================

[PennyLane] Full Pipeline (DataLoader -> GPU)...
  Total Time: 0.0423 s (0.264 ms/vector)

[Qiskit Initialize] Full Pipeline (DataLoader -> GPU)...
  Total Time: 11.8223 s (73.890 ms/vector)

[Qiskit Statevector] Full Pipeline (DataLoader -> GPU)...
  Total Time: 0.0222 s (0.139 ms/vector)

[Mahout] Full Pipeline (DataLoader -> GPU)...
  Total Time: 0.0157 s (0.098 ms/vector)

======================================================================
LATENCY (Lower is Better)
Samples: 160, Qubits: 12
======================================================================
Mahout                  0.098 ms/vector
Qiskit Statevector      0.139 ms/vector
PennyLane               0.264 ms/vector
Qiskit Initialize      73.890 ms/vector
----------------------------------------------------------------------
Speedup vs PennyLane:            2.69x
Speedup vs Qiskit Init:         751.84x
Speedup vs Qiskit Statevec:       1.41x

400Ping · 2026-01-05T11:08:36Z

Thanks for the fix! I was referencing benchmark_throughput when I wrote the code so I didn't notice this issue. I also opened a issue to fix benchmark_throughput to batch encoding.

guan404ming · 2026-01-05T11:13:56Z

Thanks for the fix! I was referencing benchmark_throughput when I wrote the code so I didn't notice this issue. I also opened a issue to fix benchmark_throughput to batch encoding.

Sure, let's do it after this one.

guan404ming · 2026-01-05T11:15:06Z

cc @ryankert01 @rich7420

ryankert01 · 2026-01-05T11:30:40Z

This is nice

guan404ming · 2026-01-05T11:32:11Z

Thanks!

guan404ming force-pushed the fix/benchmark-batch-encoding branch 2 times, most recently from e076c25 to c5897de Compare January 5, 2026 11:11

Fix benchmark to use batch encoding

82eaf73

guan404ming force-pushed the fix/benchmark-batch-encoding branch from c5897de to 82eaf73 Compare January 5, 2026 11:13

guan404ming marked this pull request as ready for review January 5, 2026 11:14

guan404ming changed the title ~~[QDP] Fix benchmark to use batch encoding~~ [QDP] Fix latency benchmark to use batch encoding Jan 5, 2026

ryankert01 approved these changes Jan 5, 2026

View reviewed changes

guan404ming merged commit 4c99293 into apache:dev-qdp Jan 5, 2026
2 checks passed

guan404ming deleted the fix/benchmark-batch-encoding branch January 5, 2026 11:32

guan404ming added a commit that referenced this pull request Jan 6, 2026

[QDP] Fix benchmark to use batch encoding (#794)

509c28f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QDP] Fix latency benchmark to use batch encoding #794

[QDP] Fix latency benchmark to use batch encoding #794

guan404ming commented Jan 5, 2026

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

400Ping commented Jan 5, 2026

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

ryankert01 commented Jan 5, 2026

Uh oh!

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[QDP] Fix latency benchmark to use batch encoding #794

[QDP] Fix latency benchmark to use batch encoding #794

Conversation

guan404ming commented Jan 5, 2026

Purpose of PR

Related Issues or PRs

Changes Made

Breaking Changes

Checklist

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

400Ping commented Jan 5, 2026

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

ryankert01 commented Jan 5, 2026

Uh oh!

Uh oh!

guan404ming commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants