Persist IF tensors to storage for large models #377

Xuzzo · 2023-07-13T19:36:38Z

Problem: As highlighted in a discussion in MR #375, a lot of the tensors we create for influence calculation end up being very large. For example, the influence factors are stored in a matrix of size (N_test_points, N_model_parameters). Often, these cannot fit into RAM (be it ordinary or on GPU).

A simple fix would use generators. Instead of creating all the influence factors, we could calculate batch-by-batch within the compute_influences_up and compute_influences_pert methods. Same can be said of grads within these methods. However, this would imply re-calculating influence factors or grads several times, with massive computational overhead.

A better fix would cache the tensors to disk. We could create a CachedTensor class that saves the tensors into files and loads them whenever needed, thus freeing the VRAM. Even the output of the compute_influences method could be a CachedTensor: one would only need to remember to flush it when it is no longer needed.

Xuzzo added the enhancement New feature or request label Jul 13, 2023

Xuzzo mentioned this issue Jul 13, 2023

Better inversion method info and remove concatenation from general #375

Closed

4 tasks

Xuzzo changed the title ~~Cache IF tensors into storage for large models~~ Cache IF tensors to storage for large models Aug 24, 2023

mdbenito changed the title ~~Cache IF tensors to storage for large models~~ Persist IF tensors to storage for large models Aug 31, 2023

mdbenito added this to the v0.8.0 milestone Aug 31, 2023

mdbenito assigned schroedk Sep 4, 2023

mdbenito modified the milestones: v0.8.0, v0.7.2 Oct 26, 2023

schroedk mentioned this issue Nov 20, 2023

Feature/377 lazy tensor #456

Merged

4 tasks

schroedk closed this as completed in #456 Dec 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Persist IF tensors to storage for large models #377

Persist IF tensors to storage for large models #377

Xuzzo commented Jul 13, 2023 •

edited by mdbenito

Loading

Persist IF tensors to storage for large models #377

Persist IF tensors to storage for large models #377

Comments

Xuzzo commented Jul 13, 2023 • edited by mdbenito Loading

Xuzzo commented Jul 13, 2023 •

edited by mdbenito

Loading