`attribute` speed and memory optimizations #245

gsarti · 2024-01-10T10:32:36Z

Description

Small speed and memory optimizations to the main attribute loop, addressing issues #243 #240

Move tensors to CPU right away in the forward pass to avoid OOM when cloning
Fix remap_from_filtered behavior on sequence_scores tensors.
Use torch-native padding when converting lists of FeatureAttributionStepOutput to FeatureAttributionSequenceOutput in get_sequences_from_batched_steps.
Bump ruff version + update deps

gsarti · 2024-01-10T15:39:09Z

Description of the problem of optimizing the get_sequences_from_batched_steps function:

You have a list of Pytorch tensors representing the results of computations performed at every step of a batched generation process involving one or more sequences. Every tensor in the list has a shape of [batch_size, sequence_length, ...], corresponding to scores for all previously generated steps for all sequences. Importantly, the sequences are not guaranteed to have the same length, so a decrease in batch_size between two consecutive tensors would denote the end of one or more sequences. Write a Python function to convert the list described above to a list of tensors of shape [max_sequence_length, num_sequence_steps, ...] representing the individual sequences, where max_sequence_length is the maximal length of the specific sequence, and num_sequence_steps is the number of steps for which scores are produced for the sequence.

* origin: Fix `aggregate_contiguous` (#247) `attribute-context` CLI command (#237)

Speed and memory optimizations

fbf81f0

gsarti added the enhancement New feature or request label Jan 10, 2024

gsarti mentioned this pull request Jan 10, 2024

Slow attribution possibly due to FeatureAttributionSequenceOutput.from_step_attributions #243

Closed

This was linked to issues Jan 12, 2024

Slow attribution possibly due to FeatureAttributionSequenceOutput.from_step_attributions #243

Closed

CUDA out of memory error. #240

Closed

gsarti added 6 commits January 13, 2024 13:57

Merge remote-tracking branch 'origin' into speed-mem-opt

62df8e3

* origin: Fix `aggregate_contiguous` (#247) `attribute-context` CLI command (#237)

fix sequence_scores remap from filtered

66a58f8

Refactored get_sequences_from_batched_steps

e926fde

Add check for stack dimension

ca6e2e8

Bump dev version, update tutorial

8d89b11

Bump ruff style to py39

4872d5b

gsarti marked this pull request as ready for review January 17, 2024 12:15

gsarti merged commit f434192 into main Jan 17, 2024
3 checks passed

gsarti deleted the speed-mem-opt branch January 17, 2024 12:15

gsarti mentioned this pull request Jan 17, 2024

Applying inseq on text generation problems #232

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`attribute` speed and memory optimizations #245

`attribute` speed and memory optimizations #245

gsarti commented Jan 10, 2024 •

edited

gsarti commented Jan 10, 2024

attribute speed and memory optimizations #245

attribute speed and memory optimizations #245

Conversation

gsarti commented Jan 10, 2024 • edited

Description

gsarti commented Jan 10, 2024

`attribute` speed and memory optimizations #245

`attribute` speed and memory optimizations #245

gsarti commented Jan 10, 2024 •

edited