Skip to content

Fix tests#210

Merged
SimJeg merged 8 commits intomainfrom
simon/fix-tests
Apr 9, 2026
Merged

Fix tests#210
SimJeg merged 8 commits intomainfrom
simon/fix-tests

Conversation

@SimJeg
Copy link
Copy Markdown
Collaborator

@SimJeg SimJeg commented Apr 8, 2026

  • KVComposePress compression ratio failed test because of using quantile
  • Breaking changes after 5.3 in transformers
  • Fixed issue with the GitLab runner

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 8, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@SimJeg SimJeg force-pushed the simon/fix-tests branch from 9e61a91 to ac07ed2 Compare April 8, 2026 14:18
@alessiodevoto
Copy link
Copy Markdown
Collaborator

/ok to test ac07ed2

SimJeg and others added 2 commits April 8, 2026 14:21
Replace quantile-based threshold with topk ranking in compute_important_per_layer
to match the pattern used by all other presses and avoid overshooting the target
compression ratio due to quantile interpolation.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Signed-off-by: SimJeg <sjegou@nvidia.com>
Signed-off-by: SimJeg <sjegou@nvidia.com>
@SimJeg SimJeg force-pushed the simon/fix-tests branch from 038990e to efc294b Compare April 8, 2026 14:21
@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 8, 2026

/ok to test efc294b

@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 8, 2026

/ok to test 31b4a0e

@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 8, 2026

/ok to test 8b5325b

@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 8, 2026

/ok to test 3fd743a

@SimJeg SimJeg force-pushed the simon/fix-tests branch from 3fd743a to 8b5325b Compare April 8, 2026 15:56
CompactorPress, LeverageScorePress, and NonCausalAttnPress OOM when
run with Qwen3-4B on the L4 GPU (23GB), corrupting CUDA state and
cascading into 31 test failures.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 8, 2026

/ok to test 2982401

SimJeg added 2 commits April 9, 2026 08:54
Signed-off-by: SimJeg <sjegou@nvidia.com>
@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 9, 2026

/ok to test c27fe1c

Signed-off-by: SimJeg <sjegou@nvidia.com>
@SimJeg
Copy link
Copy Markdown
Collaborator Author

SimJeg commented Apr 9, 2026

/ok to test f04313c

@SimJeg SimJeg linked an issue Apr 9, 2026 that may be closed by this pull request
Copy link
Copy Markdown
Collaborator

@alessiodevoto alessiodevoto left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! This one should fix #209
Follow-ups:

  • compatibility with transformers > 5.2
  • Consider adding tests without FA

@SimJeg SimJeg merged commit fa7a0dc into main Apr 9, 2026
3 checks passed
@SimJeg SimJeg deleted the simon/fix-tests branch April 9, 2026 09:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Tests fail at collection due to torch/NCCL version mismatch

2 participants