New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Worksheets.codalab.org down - prohibits HELM from completing #1930
Comments
Hi @yifanmai this is critical to fix for the LLM competition since we'd need to remove all MMLU datasets perturbations and CNN/DM from our configuration which doesn't feel so great before the Wednesday deadline
As a backup I can remove MMLU but we'll be defining the datasets a day before the competition deadline not great |
Azure unfortunately disabled the CodaLab server due to a technical glitch; trying to get support to bring it back. In the meantime, perhaps we can send you the relevant files? |
Would it be possible to temporarily change the download link to your own mirror? It'd be much more convenient for the leaderboard and competitors to just reinstall helm from source rather then have to manually download a dataset and place it in the right place. Although tbh either would be preferrable to delaying the competition since submissions are due in 3 days on Oct 25 |
@percyliang if you could share the files with me that would be appreciated, thanks! |
Deploying a hotfix shortly #1931 |
I have mirrored the files and updated main to use the new URLs. Please try pulling main and re-running. As an aside, because of #1932,
You'll need to run the respective command if you see one of these error messages:
|
Thanks! I can try it out tonight. |
@yifanmai The fix still does not work for me , even after removing cached data. I get the below error. EDIT: Manually downloading the file from Cloud and placing it correctly works. Error when running summarization_cnndm:temperature=0.3,device=cpu,model=neurips_local:
Traceback (most recent call last):
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/runner.py", line 173, in run_all
self.run_one(run_spec)
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/runner.py", line 221, in run_one
instances = scenario.get_instances(scenario_output_path)
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/scenarios/summarization_scenario.py", line 137, in get_instances
dataset, article_key, summary_key = self._load_dataset(self.dataset_name, output_path)
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/scenarios/summarization_scenario.py", line 128, in _load_dataset
dataset = self._download_dataset(url, "cnndm", output_path)
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/scenarios/summarization_scenario.py", line 102, in _download_dataset
dataset = pickle.load(fin)
_pickle.UnpicklingError: invalid load key, '<'.
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 2.02it/s]
} [0.541s]
Traceback (most recent call last):
File "/home/anmol/anaconda3/envs/wizard_coder/bin/helm-run", line 8, in <module>
sys.exit(main())
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/common/hierarchical_logger.py", line 104, in wrapper
return fn(*args, **kwargs)
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/run.py", line 309, in main
run_benchmarking(
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/run.py", line 111, in run_benchmarking
runner.run_all(run_specs)
File "/home/anmol/nips_challenge/efficiency_challenge_repo/external_repos/helm_tracking_remote/helm/src/helm/benchmark/runner.py", line 182, in run_all
raise RunnerError(f"Failed runs: [{failed_runs_str}]")
helm.benchmark.runner.RunnerError: Failed runs: ["summarization_cnndm:temperature=0.3,device=cpu,model=neurips_local"]
|
@yifanmai I have the same error as above |
@agoncharenko1992 Manually downloading the file from Cloud and placing it correctly in the correct folder ( |
Thanks for the bug report; will investigate shortly. |
This should be fixed by #1935. You may have to delete the file to redownload: |
CodaLab is back online now. Please do let us know if you are still facing issues. |
Hi!
I'm trying to run HELM with MMLU scenarios. It appears that https://worksheets.codalab.org/ is down, which is causing HELM to fail when using this scenario. I'm not sure if this is your data or that belonging to the scenario's authors, so I thought I'd post it here in case it is helm-related.
Best,
J.
The text was updated successfully, but these errors were encountered: