Skip to content

Benchmark the new PyT data loader (with sparse tensors support) scalability with multi-GPU and larger datasets #28

@gabrielspmoreira

Description

@gabrielspmoreira

Benchmark the new PyT data loader with the REES46 ecommerce dataset, using multiple GPUs

Train set: All train.parquet files for 31 days (1 parquet file by week). P.s. Set row group size accordingly
Eval set: All valid.parquet files concatenated

  • Create a recsys_main.py variation for non-incremental training
  • Train with 3 weeks and evaluate on the last week
  • Run experiments varying the number of GPUs: Single GPU, Multi-GPU Data Parallel, Multi-GPU Distributed DataParallel

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions