Examples for Llama model architecture #2

okpatil4u · 2023-09-14T12:10:59Z

Hello Eric, this looks like great work ! Thank you !!

Can you please add examples for both training and inference for Llama model using candle-lora ? Is it supported through this work ?

EricLBuehler · 2023-09-14T12:18:01Z

Yes! With my candle-lora-macro library, all you need to do is derive the AutoLoraConvert and add the replace_layer_fields to all model structs of a LLama model. They will replace the concrete types and automate the conversion process. Then, call the conversion method on each model struct.

I plan on adding an example shortly. If you have any questions, let me know!

okpatil4u · 2023-09-14T12:26:59Z

This is amazing. Is the training example possible as well ?

…

On Thu, 14 Sep 2023 at 5:48 PM, Eric Buehler ***@***.***> wrote: Yes! With my candle-lora-macro <https://github.com/EricLBuehler/candle-lora-macro> library, all you need to do is derive the AutoLoraConvert and add the replace_layer_fields to all model structs of a LLama model. They will replace the concrete types and automate the conversion process. Then, call the conversion method on each model struct. I plan on adding an example shortly. If you have any questions, let me know! — Reply to this email directly, view it on GitHub <#2 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAXGU4DZEH4NE4Z3HTC24MLX2LYYJANCNFSM6AAAAAA4X745MQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

EricLBuehler · 2023-09-14T12:28:20Z

Yes, once you convert to a LoRA model you could fine-tune it. After finetuning, you can merge the weights to speed up inference.

EricLBuehler · 2023-09-15T00:49:22Z

I am closing this so that it does not become a stale issue, but feel free to reopen. I will be adding a LoRA example for Llama and soon!

okpatil4u · 2023-09-15T02:15:06Z

Sounds good, thank you !

…

On Fri, 15 Sep 2023 at 6:19 AM, Eric Buehler ***@***.***> wrote: I am closing this so that it does not become a stale issue, but feel free to reopen. I will be adding a LoRA example for Llama and soon! — Reply to this email directly, view it on GitHub <#2 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAXGU4GG42BAT6MV2AV4XTLX2OQZZANCNFSM6AAAAAA4X745MQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

EricLBuehler closed this as completed Sep 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples for Llama model architecture #2

Examples for Llama model architecture #2

okpatil4u commented Sep 14, 2023 •

edited

EricLBuehler commented Sep 14, 2023

okpatil4u commented Sep 14, 2023 via email

EricLBuehler commented Sep 14, 2023

EricLBuehler commented Sep 15, 2023

okpatil4u commented Sep 15, 2023 via email

Examples for Llama model architecture #2

Examples for Llama model architecture #2

Comments

okpatil4u commented Sep 14, 2023 • edited

EricLBuehler commented Sep 14, 2023

okpatil4u commented Sep 14, 2023 via email

EricLBuehler commented Sep 14, 2023

EricLBuehler commented Sep 15, 2023

okpatil4u commented Sep 15, 2023 via email

okpatil4u commented Sep 14, 2023 •

edited