Skip to content

Fast latent extraction with Nemo Curator#182

Draft
sippycoder wants to merge 8 commits intomainfrom
data/pollux_v1/fast_latent_extraction
Draft

Fast latent extraction with Nemo Curator#182
sippycoder wants to merge 8 commits intomainfrom
data/pollux_v1/fast_latent_extraction

Conversation

@sippycoder
Copy link
Contributor

@sippycoder sippycoder commented Mar 11, 2025

Throughput on H100:
256 resolution: 69M per day
512 resolution: 18M per day

data:
  data_path: /mnt/pollux/nemo/sample
  output_path: /mnt/pollux/nemo/sample_latents_512/
  id_col: key
  batch_size: 256
  num_threads_per_worker: 16
  image_size: 512
  image_latent_column: image_latent_512

For the above configuration the latents are stored in cfg.data.output_path in parquet format.

ToDo:

  • Wandb tracking
  • Fix memory leak

@sippycoder sippycoder requested a review from HaozheLiu-ST March 11, 2025 18:14
@sippycoder sippycoder self-assigned this Mar 11, 2025
@sippycoder sippycoder marked this pull request as draft March 11, 2025 18:19
@HaozheLiu-ST
Copy link
Contributor

Can we extract the latent code of the 256/512 resolutions simultaneously? Since the model on 512 resolution will be trained with the pair data (caption,latentcode_256, latentcode_512). 256 is for the planning model, and 512 is for the generation part.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants