You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, because of it's multi-linguality and undertrainedness, I'd like to slice Falcon-11B.
There are 60 of these layers. Supposedly, it would just be an easy layer name change? Or bumping the requirements?
yaml_config = """
slices:
- sources:
- model: tiiuae/falcon-11B
layer_range: [0, 25]
- sources:
- model: tiiuae/falcon-11B
layer_range: [56,59]
merge_method: passthrough
dtype: bfloat16"""
with open('config.yaml', 'w', encoding="utf-8") as f:
f.write(yaml_config)
!mergekit-yaml config.yaml merge --copy-tokenizer --allow-crimes --out-shard-size 1B --lazy-unpickle
/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
warnings.warn(
Warmup loader cache: 0%| | 0/1 [00:00<?, ?it/s]
Fetching 11 files: 100%|█████████████████████| 11/11 [00:00<00:00, 94737.87it/s]
Warmup loader cache: 100%|████████████████████████| 1/1 [00:00<00:00, 2.22it/s]
Executing graph: 0%| | 1/946 [00:00<00:00, 1742.54it/s]
Traceback (most recent call last):
File "/opt/conda/bin/mergekit-yaml", line 8, in <module>
sys.exit(main())
File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1157, in __call__
return self.main(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1078, in main
rv = self.invoke(ctx)
File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1434, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 783, in invoke
return __callback(*args, **kwargs)
File "/kaggle/working/PruneMe/mergekit/mergekit/options.py", line 82, in wrapper
f(*args, **kwargs)
File "/kaggle/working/PruneMe/mergekit/mergekit/scripts/run_yaml.py", line 47, in main
run_merge(
File "/kaggle/working/PruneMe/mergekit/mergekit/merge.py", line 92, in run_merge
for _task, value in exec.run(quiet=options.quiet):
File "/kaggle/working/PruneMe/mergekit/mergekit/graph.py", line 197, in run
res = task.execute(**arguments)
File "/kaggle/working/PruneMe/mergekit/mergekit/io/tasks.py", line 86, in execute
raise RuntimeError(
RuntimeError: Tensor transformer.h.59.ln_mlp.weight required but not present in model tiiuae/falcon-11B
The text was updated successfully, but these errors were encountered:
Hi, because of it's multi-linguality and undertrainedness, I'd like to slice Falcon-11B.
There are 60 of these layers. Supposedly, it would just be an easy layer name change? Or bumping the requirements?
![Screenshot 2024-05-17 at 23 32 36](https://private-user-images.githubusercontent.com/167638923/331735129-c246a61b-6605-4d96-bac2-0fe1330c66d3.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTk5Nzg4ODAsIm5iZiI6MTcxOTk3ODU4MCwicGF0aCI6Ii8xNjc2Mzg5MjMvMzMxNzM1MTI5LWMyNDZhNjFiLTY2MDUtNGQ5Ni1iYWMyLTBmZTEzMzBjNjZkMy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzAzJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcwM1QwMzQ5NDBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1jN2Y0ODIzY2YzMDdjYTc5OWQ0NDExNzk1OGJlMjhhZDdhNTdmMDVhNWYyNDg3MzhlYjJhZmFkMmQ5ZGMxOTViJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.7Nq0_JMLOK-LtzaenNnJ5_gfI26PzRwelwqVxXVExIg)
The text was updated successfully, but these errors were encountered: