Merge LoRA weights to LLM at initialization time on-device (Gemma) #5255

sitatec · 2024-03-24T04:21:39Z

Have I written custom code (as opposed to using a stock example script provided in MediaPipe)

No

OS Platform and Distribution

Web, Android, IOS

MediaPipe Tasks SDK version

No response

Task name (e.g. Image classification, Gesture recognition etc.)

GenAI

Programming Language and version (e.g. C++, Python, Java)

TypeScript, Java, Swift

Describe the actual behavior

Couldn't find a way to merge lora weights to Gemma

Describe the expected behaviour

I want to be able to add a LoRA adapter to Gemma locally (on-device)

Standalone code/steps you may have used to try to get what you need

I'm experimenting something on Web with mediapipe that require having multiple LoRA files, each file trained for a different task. I want to select a LoRA file and merge it to Gemma at initialization time locally on web. I went through the code, I saw some .proto file with lora_path or lora_rank fields but I haven't seen any exposed parameter from the LlmInference class or its options that can help me specify a LoRA file.

One option could be (Maybe) to use the LlmGPUCalculatorOptions.lora_path. However, the current API doesn't expose anything that make this possible, I don't even know if it could work e.i: if that field is meant for this purpose. Will this option work? If so, I can open a PR for it. If not, how can I achieve this?

Other info / Complete Logs

No response

The text was updated successfully, but these errors were encountered:

kevin36524 · 2024-04-10T17:42:13Z

This will be a very useful feature in reducing the app size. Loading the Lora at run-time is also something google is trying to do with Gemini nano on high end devices.

schmidt-sebastian · 2024-04-19T17:14:48Z

Thank you. We are logging this as a feature request.

google-ml-butler bot assigned ayushgdev Mar 24, 2024

kuaashish assigned kuaashish and unassigned ayushgdev Mar 26, 2024

kuaashish added task:LLM inference Issues related to MediaPipe LLM Inference Gen AI setup type:support General questions platform:javascript MediaPipe Javascript issues platform:android Issues with Android as Platform platform:ios MediaPipe IOS issues labels Mar 26, 2024

schmidt-sebastian added type:feature Enhancement in the New Functionality or Request for a New Solution and removed type:support General questions labels Apr 19, 2024

kuaashish assigned schmidt-sebastian and unassigned kuaashish Apr 22, 2024

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge LoRA weights to LLM at initialization time on-device (Gemma) #5255

Merge LoRA weights to LLM at initialization time on-device (Gemma) #5255

sitatec commented Mar 24, 2024 •

edited

kevin36524 commented Apr 10, 2024

schmidt-sebastian commented Apr 19, 2024

Merge LoRA weights to LLM at initialization time on-device (Gemma) #5255

Merge LoRA weights to LLM at initialization time on-device (Gemma) #5255

Comments

sitatec commented Mar 24, 2024 • edited

Have I written custom code (as opposed to using a stock example script provided in MediaPipe)

OS Platform and Distribution

MediaPipe Tasks SDK version

Task name (e.g. Image classification, Gesture recognition etc.)

Programming Language and version (e.g. C++, Python, Java)

Describe the actual behavior

Describe the expected behaviour

Standalone code/steps you may have used to try to get what you need

Other info / Complete Logs

kevin36524 commented Apr 10, 2024

schmidt-sebastian commented Apr 19, 2024

sitatec commented Mar 24, 2024 •

edited