[🐛BUG] Dataset save_path mismatches load_path for sequential dataset #1697

ShadowTinker · 2023-03-16T01:44:18Z

Describe the bug
I set save_dataset=True in recbole/properties/overall.yaml but find the model re-processes the dataset every time.

To Reproduce
Steps to reproduce the behavior:

set save_dataset=True in recbole/properties/overall.yaml
execute python run_recbole.py --model=DIN --dataset=ml-100k
results

save_path: path_prefix/ml-100k-dataset.pth
load_path: path_prefix/ml-100k-SequentialDataset.pth

way to fix it
I change the following code

RecBole/recbole/data/dataset/dataset.py

Line 1813 in edf2e6c

file = os.path.join(save_dir, f'{self.config["dataset"]}-dataset.pth')

to

file = os.path.join(save_dir, f'{self.config["dataset"]}-{self.__class__.__name__}.pth')

. And it works for DIN, AFM, BPR and SASRec.

Desktop (please complete the following information):

OS: Linux
RecBole Version 1.1.1
Python Version 3.9.15
PyTorch Version 1.13.1
cudatoolkit Version 11.6

The text was updated successfully, but these errors were encountered:

ShadowTinker · 2023-03-16T03:44:20Z

Besides, I also find some bugs in loading saved dataloaders by setting save_dataloaders=True in the config file. In the following code, I find eval_neg_sample_args does not exist in the config while valid_neg_sample_args or test_neg_sample_args exists,

RecBole/recbole/data/dataloader/general_dataloader.py

Lines 140 to 144 in edf2e6c

    
           def update_config(self, config): 
        
               self._set_neg_sample_args( 
        
                   config, self._dataset, InputType.POINTWISE, config["eval_neg_sample_args"] 
        
               ) 
        
               super().update_config(config)

, which will result in a key error in line139 in the following code:

RecBole/recbole/data/dataloader/abstract_dataloader.py

Lines 132 to 142 in edf2e6c

    
           def _set_neg_sample_args(self, config, dataset, dl_format, neg_sample_args): 
        
               self.uid_field = dataset.uid_field 
        
               self.iid_field = dataset.iid_field 
        
               self.dl_format = dl_format 
        
               self.neg_sample_args = neg_sample_args 
        
               self.times = 1 
        
               if ( 
        
                   self.neg_sample_args["distribution"] == "uniform" 
        
                   or "popularity" 
        
                   and self.neg_sample_args["sample_num"] != "none" 
        
               ):

I opened a pull request #1698 to fix the two problems mentioned above.

Paitesanshi · 2023-03-18T03:40:17Z

Hello @ShadowTinker,

Thank you for your attention and contributions to RecBole! You are correct that there are some bugs. To improve the rationality of the code structure and convenience of the changes, I have made some minor modifications. Once you have reviewed the modifications in #1698, the updates can be merged into the main code.

Thank you again for your support to our team!

ShadowTinker · 2023-03-18T07:52:13Z

Hello @Paitesanshi,

Thank you for your reply! But I just find another problem, which is that the saved dataloader will still be loaded when modifying train_batch_size or eval_batch_size, while the batch_size is not changed correspondingly. And I find it is caused by the following lines:

RecBole/recbole/data/utils.py

Lines 130 to 132 in edf2e6c

    
           for arg in dataset_arguments + ["seed", "repeatable", "eval_args"]: 
        
               if config[arg] != train_data.config[arg]: 
        
                   return None

where train_batch_size or eval_batch_size is not checked.

But I'm not sure whether it is designed on purpose, because these arguments usually are fixed. So I wonder if it is needed to add the two args to the above lines.

Paitesanshi · 2023-03-18T13:21:26Z

@ShadowTinker We think that user loading existing data_loader means using fixed batch_size. We will consider whether to modify this setting after collecting more user feedback. Thanks for your suggestion again!

ShadowTinker · 2023-03-18T14:04:17Z

@Paitesanshi Thank you for the explanation. I've reviewed the modifications in #1698 and It will be my honor to contribute to recbole. Thank you again for the great repo and contribution to the community.

Fix: fix saving and loading for datasets and dataloaders (#1697)

christopheralex · 2023-11-10T17:24:01Z

This fails again when this is run in multiprocessing with 4GPUs on 1 node. One process writes to disk while another process thinks the file exists and tries to read. Even if this doesnt happen since 4 processes write to disk in a pickle file with the mode "wb", the file is going to be corrupt on loading it later for inference.

ShadowTinker added the bug Something isn't working label Mar 16, 2023

ShadowTinker mentioned this issue Mar 16, 2023

Fix: fix saving and loading for datasets and dataloaders (#1697) #1698

Merged

Paitesanshi self-assigned this Mar 18, 2023

ShadowTinker closed this as completed Mar 18, 2023

Paitesanshi added a commit that referenced this issue Mar 18, 2023

Merge pull request #1698 from ShadowTinker/master

27cb949

Fix: fix saving and loading for datasets and dataloaders (#1697)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[🐛BUG] Dataset save_path mismatches load_path for sequential dataset #1697

[🐛BUG] Dataset save_path mismatches load_path for sequential dataset #1697

ShadowTinker commented Mar 16, 2023 •

edited

Loading

ShadowTinker commented Mar 16, 2023 •

edited

Loading

Paitesanshi commented Mar 18, 2023

ShadowTinker commented Mar 18, 2023

Paitesanshi commented Mar 18, 2023

ShadowTinker commented Mar 18, 2023

christopheralex commented Nov 10, 2023

[🐛BUG] Dataset save_path mismatches load_path for sequential dataset #1697

[🐛BUG] Dataset save_path mismatches load_path for sequential dataset #1697

Comments

ShadowTinker commented Mar 16, 2023 • edited Loading

ShadowTinker commented Mar 16, 2023 • edited Loading

Paitesanshi commented Mar 18, 2023

ShadowTinker commented Mar 18, 2023

Paitesanshi commented Mar 18, 2023

ShadowTinker commented Mar 18, 2023

christopheralex commented Nov 10, 2023

ShadowTinker commented Mar 16, 2023 •

edited

Loading

ShadowTinker commented Mar 16, 2023 •

edited

Loading