Enables comprehensive benchmark configurations #66

LoserCheems · 2025-07-10T12:48:32Z

Uncomments all test configurations to enable comprehensive performance benchmarking across multiple dimensions including sequence length, batch size, head count, and head dimension variations.

Updates sequence lengths in batch size, head count, and head dimension tests from 1024 to 4096 for more realistic testing scenarios.

Activates non-causal attention testing with updated configuration.

Removes duplicate num_runs assignment to eliminate redundancy.

Uncomments all test configurations to enable comprehensive performance benchmarking across multiple dimensions including sequence length, batch size, head count, and head dimension variations. Updates sequence lengths in batch size, head count, and head dimension tests from 1024 to 4096 for more realistic testing scenarios. Activates non-causal attention testing with updated configuration. Removes duplicate num_runs assignment to eliminate redundancy.

Copilot

Pull Request Overview

This PR enables comprehensive performance benchmarking by uncommenting and updating all test configurations, adjusting sequence lengths for more realistic scenarios, activating non-causal attention tests, and removing a redundant num_runs assignment.

Uncomments and expands the configs list to cover sequence length, inference, batch size, head count, and head dimension variations.
Updates sequence lengths from 1024 to 4096 for batch size, head count, and head dimension tests.
Activates the non-causal attention test with the updated configuration and removes the duplicate num_runs reassignment.

Copilot · 2025-07-10T12:49:24Z

benchmarks/benchmark_forward_performance.py

    configs = [
-        # # Vary sequence length
-        # (1, 2, 1, 256, 256, 128, 2048, True),
-        # (1, 2, 1, 512, 512, 128, 2048, True),
-        # (1, 2, 1, 1024, 1024, 128, 2048, True),
-        # (1, 2, 1, 2048, 2048, 128, 2048, True),
-        # (1, 2, 1, 4096, 4096, 128, 2048, True),
-        # (1, 2, 1, 8192, 8192, 128, 2048, True),
-        # (1, 2, 1, 16384, 16384, 128, 2048, True),
-        # (1, 2, 1, 32768, 32768, 128, 2048, True),
-
-        # # Inference
-        # (1, 2, 1, 2, 256, 128, 2048, True),
-        # (1, 2, 1, 2, 512, 128, 2048, True),
-        # (1, 2, 1, 2, 1024, 128, 2048, True),
-        # (1, 2, 1, 2, 2048, 128, 2048, True),
-        # (1, 2, 1, 2, 4096, 128, 2048, True),
-        # (1, 2, 1, 2, 8192, 128, 2048, True),
-        # (1, 2, 1, 2, 16384, 128, 2048, True),
-        # (1, 2, 1, 2, 32768, 128, 2048, True),
-        # (1, 2, 1, 2, 65536, 128, 2048, True),
-        # (1, 2, 1, 2, 131072, 128, 2048, True),
-        # (1, 2, 1, 2, 262144, 128, 2048, True),
-        # (1, 2, 1, 2, 524288, 128, 2048, True),
-
-        # # Vary batch size
-        # (1, 2, 1, 1024, 1024, 32, 2048, True),
-        # (2, 2, 1, 1024, 1024, 32, 2048, True),
-        # (4, 2, 1, 1024, 1024, 32, 2048, True),
-        # (8, 2, 1, 1024, 1024, 32, 2048, True),
-
-        # # Vary head count
-        # (1, 1, 1, 1024, 1024, 32, 2048, True),
-        # (1, 2, 1, 1024, 1024, 32, 2048, True),
-        # (1, 4, 1, 1024, 1024, 32, 2048, True),
-        # (1, 8, 2, 1024, 1024, 32, 2048, True),
-
-        # # Vary head dimension
-        # (1, 2, 1, 1024, 1024, 32, 2048, True),
-        # (1, 2, 1, 1024, 1024, 64, 2048, True),
-        # (1, 2, 1, 1024, 1024, 96, 2048, True),
-        # (1, 2, 1, 1024, 1024, 128, 2048, True),
-        # (1, 2, 1, 1024, 1024, 192, 2048, True),
-        # (1, 2, 1, 1024, 1024, 256, 2048, True),
+        # Vary sequence length
+        (1, 2, 1, 256, 256, 128, 2048, True),
+        (1, 2, 1, 512, 512, 128, 2048, True),
+        (1, 2, 1, 1024, 1024, 128, 2048, True),
+        (1, 2, 1, 2048, 2048, 128, 2048, True),
+        (1, 2, 1, 4096, 4096, 128, 2048, True),
+        (1, 2, 1, 8192, 8192, 128, 2048, True),
+        (1, 2, 1, 16384, 16384, 128, 2048, True),
+        (1, 2, 1, 32768, 32768, 128, 2048, True),
+
+        # Inference
+        (1, 2, 1, 2, 256, 128, 2048, True),
+        (1, 2, 1, 2, 512, 128, 2048, True),
+        (1, 2, 1, 2, 1024, 128, 2048, True),
+        (1, 2, 1, 2, 2048, 128, 2048, True),
+        (1, 2, 1, 2, 4096, 128, 2048, True),
+        (1, 2, 1, 2, 8192, 128, 2048, True),


[nitpick] The manually unrolled configs list is quite large and repetitive. Consider generating these configurations programmatically using loops or list comprehensions to improve readability and reduce duplication.

Copilot · 2025-07-10T12:49:24Z

benchmarks/benchmark_forward_performance.py

+        # Test non-causal
+        (1, 2, 1, 4096, 4096, 128, 2048, False),


[nitpick] Instead of hardcoding a single non-causal test tuple, you could extend your programmatic generation approach (e.g., include is_causal as a parameter) so adding or modifying test modes scales more cleanly.

Suggested change

# Test non-causal

(1, 2, 1, 4096, 4096, 128, 2048, False),

# Generate configurations for both causal and non-causal cases

*[(1, 2, 1, 4096, 4096, 128, 2048, is_causal) for is_causal in [True, False]],

LoserCheems requested review from Evanwu1125, SNHuan, Copilot and wubingheng111 and removed request for Copilot July 10, 2025 12:48

LoserCheems assigned LoserCheems, Copilot, Evanwu1125, SNHuan and wubingheng111 Jul 10, 2025

Copilot AI reviewed Jul 10, 2025

View reviewed changes

LoserCheems merged commit 09f3b82 into main Jul 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enables comprehensive benchmark configurations #66

Enables comprehensive benchmark configurations #66

Uh oh!

LoserCheems commented Jul 10, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 10, 2025

Uh oh!

Copilot AI Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Enables comprehensive benchmark configurations #66

Enables comprehensive benchmark configurations #66

Uh oh!

Conversation

LoserCheems commented Jul 10, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants