Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Temperature application order non standard? #4091

Closed
electronjoe opened this issue Nov 15, 2023 · 2 comments
Closed

Temperature application order non standard? #4091

electronjoe opened this issue Nov 15, 2023 · 2 comments

Comments

@electronjoe
Copy link

I was reading a really interesting piece on Reddit regarding samplers, and a particularly interesting exchange came up which appears to have highlighted a discrepancy between the order llama.cpp applies temperature (to probabilities) while research literature / other implementations apply temperature earlier in the chain (to logits).

I thought it would be unfortunate for this discussion to die without visibility and discussion, so I've tossed up a GH issue.

I would normally look for historical / closed issues that are related, but I'm on my phone and that's rather complex.

The interesting Reddit discussion:

https://www.reddit.com/r/LocalLLaMA/s/WonSDiMCoD

@MaggotHATE
Copy link
Contributor

There was a discussion of this topic in min_p PR, but after it was already merged. I remember playing with temp being first after that, but a customizable samplers order seems like a better idea anyway.

@electronjoe
Copy link
Author

There was a discussion of this topic in min_p PR, but after it was already merged. I remember playing with temp being first after that, but a customizable samplers order seems like a better idea anyway.

Thanks for the pointer! Closing this issue as it's on the radar of those that matter:)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants