assert freqs_cis.shape == (x.shape[1], x.shape[-1]) #472

pseudotensor · 2024-05-06T19:30:21Z

Run this script:

testawq1.py.zip

gives:

/home/jon/miniconda3/envs/h2ogpt/bin/python3.10 /home/jon/h2ogpt/testawq1.py 
Console output is saving to: /home/jon/h2ogpt/pycharm.log
Fetching 11 files: 100%|██████████| 11/11 [00:00<00:00, 37267.64it/s]
Replacing layers...: 100%|██████████| 32/32 [00:01<00:00, 16.80it/s]
Fusing layers...: 100%|██████████| 32/32 [00:00<00:00, 138.34it/s]
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Token indices sequence length is longer than the specified maximum sequence length for this model (22689 > 16384). Running this sequence through the model will result in indexing errors
Traceback (most recent call last):
  File "/home/jon/h2ogpt/testawq1.py", line 840, in <module>
    generation_output = model.generate(
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/models/base.py", line 113, in generate
    return self.model.generate(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
    return func(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/transformers/generation/utils.py", line 1576, in generate
    result = self._greedy_search(
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/transformers/generation/utils.py", line 2494, in _greedy_search
    outputs = self(
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/accelerate/hooks.py", line 166, in new_forward
    output = module._old_forward(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/transformers/models/mistral/modeling_mistral.py", line 1158, in forward
    outputs = self.model(
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
    return func(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/modules/fused/model.py", line 127, in forward
    h, _, _ = layer(
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/modules/fused/block.py", line 123, in forward
    attn_output, _, past_key_value = self.attn.forward(
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/modules/fused/attn.py", line 235, in forward
    xq, xk = self.rope.forward(xq, xk, self.start_pos, seqlen)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/modules/fused/attn.py", line 60, in forward
    freqs_cis = self.reshape_for_broadcast(freqs_cis, xq_).to(xq_.device)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/modules/fused/attn.py", line 48, in reshape_for_broadcast
    assert freqs_cis.shape == (x.shape[1], x.shape[-1])
AssertionError

h2oai/h2ogpt#160

The text was updated successfully, but these errors were encountered:

pseudotensor mentioned this issue May 6, 2024

random assertion errors due to evaluate_nochat h2oai/h2ogpt#1600

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assert freqs_cis.shape == (x.shape[1], x.shape[-1]) #472

assert freqs_cis.shape == (x.shape[1], x.shape[-1]) #472

pseudotensor commented May 6, 2024

assert freqs_cis.shape == (x.shape[1], x.shape[-1]) #472

assert freqs_cis.shape == (x.shape[1], x.shape[-1]) #472

Comments

pseudotensor commented May 6, 2024