Add Echo parameter to llama runner, jni+java layer, and demo app #5011

cmodi-meta · 2024-08-30T21:06:06Z

To allow the developer to set whether the response should echo (i.e. include) the input prompt.

echo = true then token_callback for output will also include input (i.e. Text Completion).
echo = false then token_callback for output will not include input (i.e. Chat generation).

We'll default to true since that will maintain the existing flow.

pytorch-bot · 2024-08-30T21:06:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5011

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d081b02 with merge base c83fd2e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kirklandsign · 2024-08-30T21:25:05Z

linter 😶‍🌫️

larryliu0820

Looks good!

facebook-github-bot · 2024-09-05T16:38:19Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Echo flag is added in the runner side via pytorch#5011 by Chirag. Now, we update the app side to leverage the new echo flag, so that we don't display the user prompt in response. Differential Revision: D62250116

facebook-github-bot · 2024-09-05T19:32:17Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

larryliu0820 · 2024-09-05T19:38:07Z

examples/models/llama2/runner/runner.cpp

+  if (echo) {
+    wrapped_callback(ET_UNWRAP(tokenizer_->decode(cur_token, cur_token)));
+  }


I think we should always call wrapped_callback, otherwise we are missing the first generated token.

from my testing, this callback would just return a newline. We also tested this and the output works well.

Hmm I'm saying it doesn't matter what that token is. Semantically it is the first token the model generates, by prefilling the prompt, so we should trigger callback on this one.

got it! @larryliu0820, I've removed the echo around the first token generation.

cmodi-meta · 2024-09-06T06:31:05Z

...o-apps/android/LlamaDemo/app/src/main/java/com/example/executorchllamademo/MainActivity.java

  public void onResult(String result) {
-    mResultMessage.appendText(result);
-    run();
+    if(result.equals("\n\n")) {


@Riandy fyi with this check on the app side.

facebook-github-bot · 2024-09-06T16:27:10Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Echo flag is added in the runner side via #5011 by Chirag. Now, we update the app side to leverage the new echo flag, so that we don't display the user prompt in response. Differential Revision: D62250116

facebook-github-bot · 2024-09-06T18:30:14Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot · 2024-09-06T19:05:30Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 30, 2024

kirklandsign approved these changes Aug 30, 2024

View reviewed changes

larryliu0820 approved these changes Sep 5, 2024

View reviewed changes

Riandy mentioned this pull request Sep 5, 2024

Update Android app to support echo flag in generate #5114

Closed

cmodi-meta changed the title ~~Add Echo parameter to llama runner and jni+java layer~~ Add Echo parameter to llama runner, jni+java layer, and demo app Sep 5, 2024

larryliu0820 reviewed Sep 5, 2024

View reviewed changes

cmodi-meta commented Sep 6, 2024

View reviewed changes

larryliu0820 approved these changes Sep 6, 2024

View reviewed changes

cmodi-meta and others added 6 commits September 6, 2024 11:17

Add Echo parameter to llama runner and jni+java layer

d4c5733

Update Android app to support echo flag in generate

2388966

Summary: Echo flag is added in the runner side via #5011 by Chirag. Now, we update the app side to leverage the new echo flag, so that we don't display the user prompt in response. Differential Revision: D62250116

Placing echo as last argument to perserve default value

3fd201e

Remove echo condition from first token from prefill gen

def65b1

Avoid printing newlines on output in app

621f5b9

Rebase echo parameter PR

a4b63e0

cmodi-meta force-pushed the echo-parameter branch from 2b7730d to a4b63e0 Compare September 6, 2024 18:28

Lint

d081b02

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot merged commit fb86e61 into main Sep 6, 2024
41 of 42 checks passed

facebook-github-bot deleted the echo-parameter branch September 6, 2024 21:39

Add Echo parameter to llama runner, jni+java layer, and demo app #5011

Add Echo parameter to llama runner, jni+java layer, and demo app #5011

Uh oh!

Conversation

cmodi-meta commented Aug 30, 2024

Uh oh!

pytorch-bot bot commented Aug 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5011

✅ No Failures

Uh oh!

kirklandsign commented Aug 30, 2024

Uh oh!

larryliu0820 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 5, 2024

Uh oh!

facebook-github-bot commented Sep 5, 2024

Uh oh!

larryliu0820 Sep 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cmodi-meta Sep 5, 2024

Choose a reason for hiding this comment

Uh oh!

larryliu0820 Sep 5, 2024

Choose a reason for hiding this comment

Uh oh!

cmodi-meta Sep 6, 2024

Choose a reason for hiding this comment

Uh oh!

cmodi-meta Sep 6, 2024

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 6, 2024

Uh oh!

facebook-github-bot commented Sep 6, 2024

Uh oh!

facebook-github-bot commented Sep 6, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Aug 30, 2024 •

edited

Loading

larryliu0820 Sep 5, 2024 •

edited

Loading