Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove sync calls and reorder cache ops #96

Merged
merged 3 commits into from
Apr 9, 2024
Merged

Conversation

EricLBuehler
Copy link
Owner

No description provided.

Copy link

github-actions bot commented Apr 9, 2024

Code Metrics Report
  ───────────────────────────────────────────────────────────────────────────────
Language                 Files     Lines   Blanks  Comments     Code Complexity
───────────────────────────────────────────────────────────────────────────────
Rust                        46     15519     1090       638    13791        782
───────────────────────────────────────────────────────────────────────────────
Total                       46     15519     1090       638    13791        782
───────────────────────────────────────────────────────────────────────────────
Estimated Cost to Develop 24,786
Estimated Schedule Effort 9.933882 months
Estimated People Required 3.798987
───────────────────────────────────────────────────────────────────────────────
Processed 524586 bytes, 0.525 megabytes (SI)
───────────────────────────────────────────────────────────────────────────────
  

@EricLBuehler
Copy link
Owner Author

EricLBuehler commented Apr 9, 2024

Performance improvement: 13%: 63 -> 72 T/s on an A10 with Mistral GGUF 4_K_M. We are now at 93% of llama.cpp speed!

@EricLBuehler EricLBuehler merged commit 3a6fc62 into master Apr 9, 2024
11 checks passed
@EricLBuehler EricLBuehler deleted the remove_sync_calls branch April 9, 2024 00:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant