Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

when I run train script, an errors occurs #74

Closed
BitcoinNLPer opened this issue Oct 31, 2021 · 2 comments
Closed

when I run train script, an errors occurs #74

BitcoinNLPer opened this issue Oct 31, 2021 · 2 comments

Comments

@BitcoinNLPer
Copy link

When I run this train script, I encounter some errors. The error log is as follows:
image

Do you know how to solve it?

Furthermore. there are too many files in code_clippy_data file. Is there a script to download this dataset conveniently?

@reshinthadithyan
Copy link
Collaborator

Hello. As we discussed, this is a python specific error. You'll have to reset your recursion limit to some higher number.
Refer here. Feel free to re-open the issue if something pops up. Thanks.

@BitcoinNLPer
Copy link
Author

Hi, I increased the number of layers for the file with the error, and found that the error is still reported. I want to ask two questions.
image

  • May I just clone this project and install the corresponding library, and then I can run run_clm_streaming_flax.py? Do I need to download more than 200g of training corpus into my local server? Or does it download automatically? Can you provide a more detailed pre-training guide?
  • The server I use has a GPU, so why do I run run_clm_streaming_flax.py directly and use the CPU by default?
    image

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants