Open
Description
Describe the bug
I see cases where the model would return a result that includes the string ..."<|end|><|user|>"...
Looks like its breaking out of the <|assistant|> token instructions.
It would be good if next version would allow
- Better support of specifying stop token
- Support for canceling a generation
#2 would allow to implement our own stop sequence in a streaming response.
Steps to reproduce the bug
This took running the model at scale to detect
Expected behavior
No response
Screenshots
No response
NuGet package version
None
Packaging type
No response
Windows version
No response
IDE
No response
Additional context
No response