Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sample on GPU #67

Closed
wants to merge 29 commits into from
Closed

Sample on GPU #67

wants to merge 29 commits into from

Conversation

EricLBuehler
Copy link
Owner

Refs #66.

@EricLBuehler EricLBuehler added optimization processing Processing related to the model labels Apr 3, 2024
@EricLBuehler
Copy link
Owner Author

It appears that the sorting takes the longest amount of time.

Copy link

github-actions bot commented Apr 6, 2024

Code Metrics Report
  ───────────────────────────────────────────────────────────────────────────────
Language                 Files     Lines   Blanks  Comments     Code Complexity
───────────────────────────────────────────────────────────────────────────────
Rust                        46     15509     1090       638    13781        781
───────────────────────────────────────────────────────────────────────────────
Total                       46     15509     1090       638    13781        781
───────────────────────────────────────────────────────────────────────────────
Estimated Cost to Develop 24,463
Estimated Schedule Effort 9.931007 months
Estimated People Required 3.797193
───────────────────────────────────────────────────────────────────────────────
Processed 524223 bytes, 0.524 megabytes (SI)
───────────────────────────────────────────────────────────────────────────────
  

@EricLBuehler
Copy link
Owner Author

Developments on master have made this obsolete, and there is a <2 T/s improvement over master on an A10 (mistral GGUF Q4_K_M).

@EricLBuehler EricLBuehler deleted the gpu_sampling branch April 9, 2024 18:12
@EricLBuehler EricLBuehler restored the gpu_sampling branch May 13, 2024 18:32
@EricLBuehler EricLBuehler deleted the gpu_sampling branch May 13, 2024 18:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
optimization processing Processing related to the model
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant