Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix : cuda order of synchronization when setting a buffer #679

Merged
merged 2 commits into from
Jan 5, 2024

Conversation

Green-Sky
Copy link
Contributor

@Green-Sky Green-Sky commented Jan 4, 2024

we need to wait after the set to wait for all data to arrive.

see here leejet/stable-diffusion.cpp#129 (comment)
and here leejet/stable-diffusion.cpp#129 (comment)

@Green-Sky
Copy link
Contributor Author

sd.cpp without this:
output_3

@slaren
Copy link
Collaborator

slaren commented Jan 4, 2024

I have added a fix for this in ggerganov/llama.cpp#4766. I think there are still some missing synchronizations in this implementation. Since sd.cpp already has a workaround, I think we can wait until the next sync with llama.cpp.

@slaren slaren merged commit 3eace58 into ggerganov:master Jan 5, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants