Token indices sequence length is longer than the specified maximum sequence length for this model (3000 > 512). Running this sequence through the model will result in indexing errors #18

BinchaoPeng · 2021-04-08T03:02:45Z

Token indices sequence length is longer than the specified maximum sequence length for this model (3000 > 512). Running this sequence through the model will result in indexing errors

Traceback (most recent call last):
File "", line 1, in
File "F:\PyCharm 2020.2.1\plugins\python\helpers\pydev_pydev_bundle\pydev_umd.py", line 197, in runfile
pydev_imports.execfile(filename, global_vars, local_vars) # execute the script
File "F:\PyCharm 2020.2.1\plugins\python\helpers\pydev_pydev_imps_pydev_execfile.py", line 18, in execfile
exec(compile(contents+"\n", file, 'exec'), glob, loc)
File "E:/Documents/PycharmProjects/bert/getBertWordvec.py", line 7, in
outputs = model(input_ids)
File "F:\Anaconda3\envs\dnabert\lib\site-packages\torch\nn\modules\module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "F:\Anaconda3\envs\dnabert\lib\site-packages\pytorch_transformers\modeling_bert.py", line 707, in forward
embedding_output = self.embeddings(input_ids, position_ids=position_ids, token_type_ids=token_type_ids)
File "F:\Anaconda3\envs\dnabert\lib\site-packages\torch\nn\modules\module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "F:\Anaconda3\envs\dnabert\lib\site-packages\pytorch_transformers\modeling_bert.py", line 252, in forward

hi, my input data length is 3000, so the error has happened. And could I fix it through changing your code such as changge Token indices sequence length?

Zhihan1996 · 2021-04-21T01:23:11Z

Hi,

To process long sequences, please use --model_type dnalong, and set the max sequence length as a multiple of 512 (e.g., 3072). Then the model should work well.

BinchaoPeng · 2021-04-21T02:09:39Z

Ok，I will try，Thanks！

jerryji1993 · 2021-04-27T03:06:08Z

Closed #18.

jerryji1993 closed this as completed Apr 27, 2021

BinchaoPeng mentioned this issue Apr 30, 2021

some new questions about how to process seq which is more than 512 #27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token indices sequence length is longer than the specified maximum sequence length for this model (3000 > 512). Running this sequence through the model will result in indexing errors #18

Token indices sequence length is longer than the specified maximum sequence length for this model (3000 > 512). Running this sequence through the model will result in indexing errors #18

BinchaoPeng commented Apr 8, 2021

Zhihan1996 commented Apr 21, 2021

BinchaoPeng commented Apr 21, 2021

jerryji1993 commented Apr 27, 2021

Token indices sequence length is longer than the specified maximum sequence length for this model (3000 > 512). Running this sequence through the model will result in indexing errors #18

Token indices sequence length is longer than the specified maximum sequence length for this model (3000 > 512). Running this sequence through the model will result in indexing errors #18

Comments

BinchaoPeng commented Apr 8, 2021

Zhihan1996 commented Apr 21, 2021

BinchaoPeng commented Apr 21, 2021

jerryji1993 commented Apr 27, 2021