Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Require later version of transformers #326

Closed
tleyden opened this issue May 22, 2024 · 1 comment
Closed

Require later version of transformers #326

tleyden opened this issue May 22, 2024 · 1 comment

Comments

@tleyden
Copy link
Contributor

tleyden commented May 22, 2024

Currently it doesn't require any particular version of transformers, however running evo merge with v4.35.0 gives this error:

ray.exceptions.RayTaskError(TypeError): ray::InMemoryMergeEvaluator.evaluate_genotype() (pid=1424, ip=172.17.0.4, actor_id=7fc68474338a238d957914ee01000000, repr=<mergekit.evo.actors.InMemoryMergeEvaluator object at 0x7ff0846eb610>)
  File "/mergekit/mergekit/evo/actors.py", line 303, in evaluate_genotype
    return self.evaluate(genotype)
  File "/mergekit/mergekit/evo/actors.py", line 236, in evaluate
    self._maybe_init_model(config)
  File "/mergekit/mergekit/evo/actors.py", line 182, in _maybe_init_model
    transformers.AutoModelForCausalLM.from_config(
  File "/opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 441, in from_config
    return model_class._from_config(config, **kwargs)
  File "/opt/conda/lib/python3.10/site-packages/transformers/modeling_utils.py", line 1192, in _from_config
    model = cls(config, **kwargs)
TypeError: MistralForCausalLM.__init__() got an unexpected keyword argument 'attn_implementation'

WDYT of requiring transformers>=4.39.3, which is the latest version used by the HF Alignment Handbook and should probably work (untested)

@cg123
Copy link
Collaborator

cg123 commented May 26, 2024

Done in #331.

@cg123 cg123 closed this as completed May 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants