Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

noisechannel code issue #2738

Closed
wns823 opened this issue Oct 16, 2020 · 3 comments
Closed

noisechannel code issue #2738

wns823 opened this issue Oct 16, 2020 · 3 comments

Comments

@wns823
Copy link

wns823 commented Oct 16, 2020

🐛 Bug

For reproducing noisechannel code, I just executed the Example usage Script. But I saw some errors.
'FileNotFoundError: [Errno 2] No such file or directory:
'/home/compu/GAN/fairseq/examples/noisychannel/rerank_data/wmt17/nbest_10_subset_test_fw_name_model_ex_shard_0_of_1/rescore_data_new_bpe.en' '

In my guess, the Latest code is not uploaded. So, I hope to see the latest code.

Environment

  • fairseq Version (e.g., 1.0 or master): master
  • PyTorch Version (e.g., 1.0) : torch 1.4
  • OS (e.g., Linux): Ubuntu 18.04
  • How you installed fairseq (pip, source): pip install --editable ./
  • Python version: Python 3.8
  • CUDA/cuDNN version: CUDA 11.0
  • GPU models and configuration: NVIDIA TITAN RTX
  • Any other relevant information:

Additional context

@myleott
Copy link
Contributor

myleott commented Oct 17, 2020

What commands did you run? What's the full error message + stack trace?

@wns823
Copy link
Author

wns823 commented Oct 18, 2020

I just use the Example usage script with beam_size = 10.
default

This is error message.
스크린샷, 2020-10-15 22-57-10

@wns823 wns823 closed this as completed Oct 22, 2020
facebook-github-bot pushed a commit that referenced this issue Jan 11, 2022
Summary:
This is the equivalent to PR fairinternal/fairseq-py#2697 but on top of main instead of gshard (cherry-picked and merged the squash):

* reorganize preprocess.py code a bit
* use Binarizers objects in the multiprocess code
* clean up the make_binary
* multiprocess logic
* learn to count
* format and doc string
* add basic test for vocab binarizer
* generalize to one line
* move multiprocess in binarizer

Testing:
```
python -m fairseq_cli.preprocess --only-source --trainpref ~/fixathon/small_vocab_test/train.in --destdir ~/fixathon/small_vocab_test/data-bin.cherry --workers 20
python -m fairseq_cli.preprocess --only-source --trainpref ~/fixathon/small_vocab_test/train.in --destdir ~/fixathon/small_vocab_test/data-bin.main --workers 20
```

```
 md5sum ~/fixathon/small_vocab_test/data-bin.cherry/train.bin == md5sum ~/fixathon/small_vocab_test/data-bin.main/train.bin
```

```
diff ~/fixathon/small_vocab_test/data-bin.main/dict.txt ~/fixathon/small_vocab_test/data-bin.cherry/dict.tx
```

Pull Request resolved: fairinternal/fairseq-py#2738

Reviewed By: sshleifer, dianaml0

Differential Revision: D32830875

Pulled By: Mortimerp9

fbshipit-source-id: e7463d5cdd96a877691bf39666daa319ebb3dcb8
@kurtabela
Copy link

How was this issue fixed exactly? I still get the same error

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants