Skip to content
View alexngng's full-sized avatar
  • alibaba cloud
  • Beijing
  • 05:43 (UTC +08:00)

Block or report alexngng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. CUDA-Learn-Note CUDA-Learn-Note Public

    Forked from DefTruth/CUDA-Learn-Notes

    🎉CUDA 笔记 / 高频面试题汇总 / C++笔记,个人笔记,更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

    Cuda 6

  2. FasterTransformer FasterTransformer Public

    Forked from NVIDIA/FasterTransformer

    Transformer related optimization, including BERT, GPT

    C++

  3. ChatGLM2-6B ChatGLM2-6B Public

    Forked from THUDM/ChatGLM2-6B

    ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

    Python

  4. ChatGLM-6B ChatGLM-6B Public

    Forked from THUDM/ChatGLM-6B

    ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

    Python

  5. llama llama Public

    Forked from meta-llama/llama

    Inference code for LLaMA models

    Python

  6. tensor_parallel tensor_parallel Public

    Forked from BlackSamorez/tensor_parallel

    Automatically split your PyTorch models on multiple GPUs for training & inference

    Python