deps: Add protobuf to support ALLaM models #328

willmj · 2024-09-03T16:20:53Z

Description of the change

Add protobuf v5.28.0 to fms-hf-tuning for compatibility with certain models

Related issue number

How to verify the PR

Was the PR tested

I have added >=1 unit test(s) for every new method I have added.
I have ensured all unit tests pass

Signed-off-by: Will Johnson <mwjohnson728@gmail.com>

anhuong · 2024-09-03T16:27:11Z

Note the error occurs when loading the tokenizer for ALLaM model without protobuf:

ERROR:sft_trainer.py:Traceback (most recent call last):
  File "/home/tuning/.local/lib/python3.11/site-packages/tuning/sft_trainer.py", line 577, in main
    trainer = train(
              ^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/tuning/sft_trainer.py", line 195, in train
    tokenizer = AutoTokenizer.from_pretrained(
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/models/auto/tokenization_auto.py", line 916, in from_pretrained
    return tokenizer_class_fast.from_pretrained(pretrained_model_name_or_path, *inputs, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/tokenization_utils_base.py", line 2271, in from_pretrained
    return cls._from_pretrained(
           ^^^^^^^^^^^^^^^^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/tokenization_utils_base.py", line 2505, in _from_pretrained
    tokenizer = cls(*init_inputs, **init_kwargs)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/models/llama/tokenization_llama_fast.py", line 157, in __init__
    super().__init__(
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/tokenization_utils_fast.py", line 118, in __init__
    fast_tokenizer = convert_slow_tokenizer(slow_tokenizer)
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/convert_slow_tokenizer.py", line 1597, in convert_slow_tokenizer
    return converter_class(transformer_tokenizer).converted()
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/convert_slow_tokenizer.py", line 538, in __init__
    requires_backends(self, "protobuf")
  File "/home/tuning/.local/lib/python3.11/site-packages/transformers/utils/import_utils.py", line 1531, in requires_backends
    raise ImportError("".join(failed))
ImportError: 
LlamaConverter requires the protobuf library but it was not found in your environment. Checkout the instructions on the
installation page of its repo: https://github.com/protocolbuffers/protobuf/tree/master/python#installation and follow the ones
that match your environment. Please note that you may need to restart your runtime after installation.

pyproject.toml

Signed-off-by: Will Johnson <mwjohnson728@gmail.com>

anhuong

Change looks good to me, testing image build and tuning llama3 model and then good to merge after testing. Verified image build ran successfully and llama3 ran successfully.

anhuong · 2024-09-03T17:53:47Z

Squashing the commit here. To add this change to main branch, would cherry-pick in the single squashed commit with this change.

deps: Add protobuf for fms-hf-tuning compatibility with certain models

5e0e5a5

Signed-off-by: Will Johnson <mwjohnson728@gmail.com>

willmj requested review from anhuong, Ssukriti and alex-jw-brooks as code owners September 3, 2024 16:20

anhuong changed the title ~~deps: Add protobuf for fms-hf-tuning compatibility with certain models~~ deps: Add protobuf to support ALLaM models Sep 3, 2024

anhuong requested changes Sep 3, 2024

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

fix: Add range of protobuf releases

30c5e08

Signed-off-by: Will Johnson <mwjohnson728@gmail.com>

anhuong approved these changes Sep 3, 2024

View reviewed changes

anhuong merged commit 1107e00 into foundation-model-stack:release Sep 3, 2024
1 check passed

anhuong mentioned this pull request Sep 10, 2024

deps: Add protobuf to support aLLaM models #336

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deps: Add protobuf to support ALLaM models #328

deps: Add protobuf to support ALLaM models #328

willmj commented Sep 3, 2024

anhuong commented Sep 3, 2024 •

edited

Loading

anhuong left a comment •

edited

Loading

anhuong commented Sep 3, 2024

deps: Add protobuf to support ALLaM models #328

deps: Add protobuf to support ALLaM models #328

Conversation

willmj commented Sep 3, 2024

Description of the change

Related issue number

How to verify the PR

Was the PR tested

anhuong commented Sep 3, 2024 • edited Loading

anhuong left a comment • edited Loading

Choose a reason for hiding this comment

anhuong commented Sep 3, 2024

anhuong commented Sep 3, 2024 •

edited

Loading

anhuong left a comment •

edited

Loading