-
AMD
- Sunnyvale, CA
-
13:12
(UTC -07:00) - https://www.linkedin.com/in/junliume/
- @junliume
Block or Report
Block or report junliume
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned Loading
-
-
ROCm/composable_kernel
ROCm/composable_kernel PublicComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
-
ROCm/rocComposer
ROCm/rocComposer PublicAMD composer for High Performance Deep Learning Kernels and Libraries
-
ROCm/AITemplate
ROCm/AITemplate PublicForked from facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
-
TorchBench
TorchBench PublicForked from pytorch/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Python
-
If the problem persists, check the GitHub status page or contact support.