We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Currently, on-flash gemm only support ARM CPU.
GPU support is needed for usage in various environments and performance comparison between processors.
The text was updated successfully, but these errors were encountered:
The current direction of research is mainly NAS that finds a deep learning model suitable for a given memory size rather than using less memory.
Therefore, it is more valuable to train a model for the GPU and generate an optimized GPU kernel than to apply the technique to the GPU. These works are in progress in the following repositories. https://github.com/SKKU-ESLAB/ANT-Model-DB https://github.com/SKKU-ESLAB/Auto-Compression
So, I close this issue.
Sorry, something went wrong.
No branches or pull requests
Currently, on-flash gemm only support ARM CPU.
GPU support is needed for usage in various environments and performance comparison between processors.
The text was updated successfully, but these errors were encountered: