Question about random sampling. #13

SongRb · 2018-10-20T07:35:51Z

BERT-pytorch/bert_pytorch/dataset/dataset.py

Lines 50 to 64 in 7efd2b5

    
           prob = random.random() 
        
           if prob < 0.15: 
        
               # 80% randomly change token to make token 
        
               if prob < prob * 0.8: 
        
                   tokens[i] = self.vocab.mask_index 
        
               # 10% randomly change token to random token 
        
               elif prob * 0.8 <= prob < prob * 0.9: 
        
                   tokens[i] = random.randrange(len(self.vocab)) 
        
               # 10% randomly change token to current token 
        
               elif prob >= prob * 0.9: 
        
                   tokens[i] = self.vocab.stoi.get(token, self.vocab.unk_index) 
        
               output_label.append(self.vocab.stoi.get(token, self.vocab.unk_index))

Well, seems random.random() always returns a positive number, so prob >= prob * 0.9 will always be true?

The text was updated successfully, but these errors were encountered:

codertimo · 2018-10-20T08:31:25Z

Haha your right it seems else is more efficient. thank you for your comment 👀

leon-cas · 2018-10-22T02:45:16Z

if prob < prob * 0.8: always False?
elif prob * 0.8 <= prob < prob * 0.9: always False?

artemisart · 2018-10-22T22:58:53Z

I also think these conditions are still wrong, I'm sending a PR

fix conditions for #13

codertimo added a commit that referenced this issue Oct 20, 2018

Change elif to else to fix #13 issue

7b53875

codertimo closed this as completed Oct 20, 2018

artemisart added a commit to artemisart/BERT-pytorch that referenced this issue Oct 22, 2018

really fix conditions codertimo#13

ed68f5a

codertimo mentioned this issue Oct 23, 2018

Erroneous code #21

Closed

codertimo added a commit that referenced this issue Oct 23, 2018

Merge pull request #22 from artemisart/alpha0.0.1a4

9e76604

fix conditions for #13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about random sampling. #13

Question about random sampling. #13

SongRb commented Oct 20, 2018

codertimo commented Oct 20, 2018

leon-cas commented Oct 22, 2018

artemisart commented Oct 22, 2018

Question about random sampling. #13

Question about random sampling. #13

Comments

SongRb commented Oct 20, 2018

codertimo commented Oct 20, 2018

leon-cas commented Oct 22, 2018

artemisart commented Oct 22, 2018