Llama3-8b 🦙 #653

khatwanimohit · 2024-05-16T21:09:52Z

No description provided.

qihqi · 2024-05-17T16:38:49Z

MaxText/tokenizer.py

-    sp_model = model_fp.read()
-  sp_tokenizer = tftxt.SentencepieceTokenizer(model=sp_model, add_bos=add_bos, add_eos=add_eos, reverse=reverse)
-  return sp_tokenizer
+class TikToken():


qq; why not import and use this: https://github.com/google/JetStream/blob/main/jetstream/engine/token_utils.py#L353

The reason I created a class for it and not used the jetstream is because of the error below. Our input pipeline is tfds based so we are tokenizing Symbolic Tensors instead of np array's

File "/__w/maxtext/maxtext/MaxText/tokenizer.py", line 79, in __call__ * features[k], _ = self.sp_tokenizer.encode(str(features[k]), is_bos = self.add_bos, is_eos = self.add_eos) File "/usr/local/lib/python3.10/dist-packages/jetstream/engine/token_utils.py", line 271, in encode * tokens = np.array(self.vocab.encode_tf(s)) NotImplementedError: Cannot convert a symbolic tf.Tensor (SentenceTokenizer/SentenceTokenizer/SentencepieceTokenizeOp:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported

Do you suggest another way to get around this problem ?

I don't think the Tiktoken class you have created can handle tf.Tensor as well.

Looks like the SentencePieceTokenizer you used from tensorflow_text made it into a tf op. And I don't think tensorflow_text has tiktoken.

Modifying maxtext to pass in numpy arrays (for both tiktoken and sentencepiece) should be the way to go.

Tiktoken class here works with tf.Tensors. See the tests passing here: http://shortn/_YvmkdOTxIQ.

MaxText/configs/models/llama3-8b.yml

MaxText/input_pipeline/input_pipeline_interface.py

MaxText/tokenizer.py

end_to_end/tpu/llama3/8b/2_test_llama3_8b.sh

MaxText/tokenizer.py

MaxText/input_pipeline/input_pipeline_interface.py

MaxText/maxengine.py

MaxText/scratch_code/golden_llama3-8b_export.ipynb

MaxText/tests/tokenizer_test.py

MaxText/tokenizer.py

end_to_end/tpu/llama3/8b/2_test_llama3_8b.sh

khatwanimohit requested a review from A9isha May 16, 2024 21:09

khatwanimohit requested review from rwitten and gobbleturk as code owners May 16, 2024 21:09

khatwanimohit force-pushed the mohit/llama3 branch 4 times, most recently from c6485e2 to 70a8fa6 Compare May 16, 2024 23:20

khatwanimohit assigned rwitten May 16, 2024

khatwanimohit force-pushed the mohit/llama3 branch from 70a8fa6 to 4053b58 Compare May 16, 2024 23:29

qihqi reviewed May 17, 2024

View reviewed changes

gobbleturk requested changes May 17, 2024

View reviewed changes

rwitten assigned gobbleturk and unassigned rwitten May 17, 2024

khatwanimohit force-pushed the mohit/llama3 branch 3 times, most recently from 73ee9c1 to 1761056 Compare May 20, 2024 22:53

mrinal-essential reviewed May 22, 2024

View reviewed changes

MaxText/tokenizer.py Outdated Show resolved Hide resolved

khatwanimohit force-pushed the mohit/llama3 branch 6 times, most recently from b5f18ea to 2f3d8a2 Compare May 24, 2024 17:33

A9isha reviewed May 24, 2024

View reviewed changes

khatwanimohit force-pushed the mohit/llama3 branch from 2f3d8a2 to 6dc3318 Compare May 28, 2024 16:24

gobbleturk approved these changes May 28, 2024

View reviewed changes

github-actions bot added the pull ready label May 28, 2024

gobbleturk removed their assignment May 28, 2024

Llama3-8b model config

6a9e519

khatwanimohit force-pushed the mohit/llama3 branch from 6dc3318 to 6a9e519 Compare May 28, 2024 21:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama3-8b 🦙 #653

Llama3-8b 🦙 #653

khatwanimohit commented May 16, 2024

qihqi May 17, 2024

khatwanimohit May 17, 2024

qihqi May 21, 2024

khatwanimohit May 21, 2024

Llama3-8b 🦙 #653

Are you sure you want to change the base?

Llama3-8b 🦙 #653

Conversation

khatwanimohit commented May 16, 2024

qihqi May 17, 2024

Choose a reason for hiding this comment

khatwanimohit May 17, 2024

Choose a reason for hiding this comment

qihqi May 21, 2024

Choose a reason for hiding this comment

khatwanimohit May 21, 2024

Choose a reason for hiding this comment