feat(llama2): add template for chat messages #782

dave-gray101 · 2023-07-20T19:17:05Z

Description
Lays some of the groundwork for LLAMA2 compatibility as well as other future models with complex prompting schemes.

Started small refactoring in pkg/model/loader.go regarding template loading. Currently still a part of ModelLoader, but should be easy to add template loading for situations other than overall prompt templates and the new chat-specific per-message templates
Adds support for new chat-endpoint-specific, per-message templates as an alternative to the existing Role: XYZ sprintf method.
Includes a temporary prompt template as an example, since I have a few questions before we merge in the model-gallery side changes (see )
Minor debug logging changes.

Big thanks to @tmm1 who helped work out the proper LLAMA2 prompt text as well as helping with testing!

Notes for Reviewers
This obviously isn't 100% compatibility with LLAMA2. We'll need some upstream changes for the larger models, as well as the proper BOS/EOS support... but we might as well get started now.

Signed commits

Yes, I signed my commits.

…ed down the line

mudler · 2023-07-21T19:00:00Z

pkg/model/loader.go

-		if err := ml.loadTemplateIfExists(modelName, modelFile); err != nil {
-			return "", err
-		}
+func evaluateTemplate[T any](templateName string, in T, modelPath string, templateMap *(map[string]*template.Template), mutex *sync.Mutex) (string, error) {


any specific need for the generics here? I'd keep them only when strictly needed otherwise it is just confusing (what's the catch instead of using an interface?)

Hey @mudler ! I'm still catching on to go style, but I'll admit I'm a big fan of generics in most cases... probably more than I should be!

I suppose we could rewrite this to interface{} instead of T - personally, I think the T approach is clearer to read, as we could point at line 119 for example to see that TemplateForChatMessage expects and only works with ChatMesageTemplate, and not interface{}. Is there another way to express this type of constraint in go that I've overlooked?

Theoretically, we could just drop the generic parameter and pass it in as interface{} if that convention is already clear in most cases to go programmers :)

I'm fine either ways. I'm just a bit not super convinced in this case because raises code complexity (see e.g. pointer of maps) for no real catch

mudler · 2023-07-21T19:02:40Z

pkg/model/loader.go

+}
+
+func (ml *ModelLoader) TemplateForChatMessage(templateName string, messageData ChatMessageTemplateData) (string, error) {
+	return evaluateTemplate[ChatMessageTemplateData](templateName, messageData, ml.ModelPath, &(ml.chatMessageTemplates), &(ml.mu))


just wondering, why passing the pointer to the map around?

Passing the pointer to the map is because evaluateTemplate[T] was generic - See other comment for style choice on why this line in general is generic.

If that function is generic, it can't accept an *ModelLoader receiver, so I just have it accept a pointer to the relevant lock and map... admittedly, a hack, but it's "contained" to the private method here in this file at least.

If we de-generic that method, I would presumably remove it.

mudler

Looks good overall here! great job @dave-gray101 and @tmm1! just few nits from my side but that shouldn't block this

mudler · 2023-07-21T20:49:00Z

api/openai/chat.go

-					content = fmt.Sprint(*i.Content)
+				if templatedChatMessage == "" {
+					log.Warn().Msgf("template \"%s\" produced blank output for %+v. Skipping!", config.TemplateConfig.ChatMessage, chatMessageData)
+					continue


maybe instead of skipping here would be better to skip templating and using the message as-is? making it disappear would be probably confusing

Well - I think we'd theoretically want to allow a template to ignore a message and return "" - so the continue here really only avoids appending "" on to a slice of strings, right - would result in one additional \n if we don't skip adding it.

I admit it's probably not super clear that's the only effect of that statement, so I'll see if I can revise it!

The Warn is because I assume rarely the user will intend for a "" to be returned from a template, so we shouldn't call that an outright error, but it also is worth making visible when debug=true since most of the time it's probably a template logic error.

mudler · 2023-07-21T20:49:48Z

api/openai/chat.go

+				templatedChatMessage, err := o.Loader.TemplateForChatMessage(config.TemplateConfig.ChatMessage, chatMessageData)
+				if err != nil {
+					log.Error().Msgf("error processing message %+v using template \"%s\": %v. Skipping!", chatMessageData, config.TemplateConfig.ChatMessage, err)
+					continue


ditto on skipping

However, in this case, I think I see a much stronger argument for falling back on the sprintf formatting, as long as we keep the error message for the template error :)

Aisuko

Thanks for your contribution. I suggest rebasing 13 commits to several reasonable commits. It will be easy for us to split features.

…plexity wtill, but cleans up the weird map pointers.

dave-gray101 · 2023-07-22T07:01:52Z

Thanks for your contribution. I suggest rebasing 13 commits to several reasonable commits. It will be easy for us to split features.

I was planning on squash merging the PR to a single commit, but I can rewrite history into a new branch and merge as several if that's the style we prefer.

mudler · 2023-07-22T07:28:45Z

@dave-gray101 that's fine, just squash them during merge

mudler

Nice!

dave-gray101 and others added 12 commits July 18, 2023 23:57

lame logging test

0cbfc96

template changes

728337f

yaml fix and system override?

5ea869b

template draft to test on the mac

72f3398

testing logs, keep track of what I'm testing out for templates?

33de312

Merge branch 'master' into exp_llama2

6de7e57

system prompt replacement broken, add some logging

57f8a19

drop requestSystemPrompt override

60f52f8

print out error if template eval fails

eaf4e7d

merge in tmm1

c71980a

tmm1's prompt template

016df3f

llama2-chat.tmpl was not correct. we can always add it back if requir…

158b184

…ed down the line

dave-gray101 changed the title ~~WIP: Prompt Template Improvements~~ Prompt Template Improvements Jul 21, 2023

dave-gray101 marked this pull request as ready for review July 21, 2023 04:22

dave-gray101 enabled auto-merge (squash) July 21, 2023 04:23

mudler reviewed Jul 21, 2023

View reviewed changes

mudler approved these changes Jul 21, 2023

View reviewed changes

cleanup

9b90a52

mudler reviewed Jul 21, 2023

View reviewed changes

Aisuko self-requested a review July 22, 2023 02:48

Aisuko reviewed Jul 22, 2023

View reviewed changes

dave-gray101 added 2 commits July 22, 2023 02:27

mudler inspired me to remove generics from template loading, some com…

1d54875

…plexity wtill, but cleans up the weird map pointers.

merge

1dc014c

dave-gray101 disabled auto-merge July 22, 2023 06:45

mudler changed the title ~~Prompt Template Improvements~~ feat(llama2): add template for chat messages Jul 22, 2023

mudler approved these changes Jul 22, 2023

View reviewed changes

dave-gray101 merged commit c6bf67f into mudler:master Jul 22, 2023

mudler added the enhancement New feature or request label Jul 22, 2023

dave-gray101 deleted the exp_llama2 branch February 21, 2024 02:19

Uh oh!

feat(llama2): add template for chat messages #782

feat(llama2): add template for chat messages #782

Uh oh!

Conversation

dave-gray101 commented Jul 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mudler Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dave-gray101 Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mudler Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

mudler Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

dave-gray101 Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

mudler left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mudler Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

dave-gray101 Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mudler Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

dave-gray101 Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

Aisuko left a comment

Choose a reason for hiding this comment

Uh oh!

dave-gray101 commented Jul 22, 2023

Uh oh!

mudler commented Jul 22, 2023

Uh oh!

mudler left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dave-gray101 commented Jul 20, 2023 •

edited

Loading

mudler Jul 21, 2023 •

edited

Loading

dave-gray101 Jul 21, 2023 •

edited

Loading

mudler left a comment •

edited

Loading

dave-gray101 Jul 21, 2023 •

edited

Loading