Skip to content

Conversation

@RenfeiChen-FB
Copy link
Contributor

Summary:
First generate the data:
bash nvt_preproc.sh /data/criteo/ /data/criteo_1_day/ 8192

Then run the command:

torchx run -s local_cwd dist.ddp -j 1x8 --script train_torchrec.py -- --num_embeddings_per_feature 45833188,36746,17245,7413,20243,3,7114,1441,62,29275261,1572176,345138,10,2209,11267,128,4,974,14,48937457,11316796,40094537,452104,12606,104,35 --over_arch_layer_sizes 1024,1024,512,256,1 --binary_path /data/criteo_1_day/criteo_preproc/train/

And use nvidia-smi to check the ghosts process

Differential Revision: D37794009

Summary:
First generate the data:
bash nvt_preproc.sh /data/criteo/ /data/criteo_1_day/ 8192

Then run the command:

torchx run -s local_cwd dist.ddp -j 1x8 --script train_torchrec.py --  --num_embeddings_per_feature 45833188,36746,17245,7413,20243,3,7114,1441,62,29275261,1572176,345138,10,2209,11267,128,4,974,14,48937457,11316796,40094537,452104,12606,104,35 --over_arch_layer_sizes 1024,1024,512,256,1 --binary_path /data/criteo_1_day/criteo_preproc/train/

And use nvidia-smi to check the ghosts process

Differential Revision: D37794009

fbshipit-source-id: 58b34eb546715845c0500a8aa9fc8ca16bf9bd6c
@facebook-github-bot facebook-github-bot added CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported labels Jul 12, 2022
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D37794009

@TroyGarden TroyGarden closed this Jun 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants