Skip to content
View Peter-Cao89's full-sized avatar

Block or report Peter-Cao89

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. incubator-mxnet incubator-mxnet Public

    Forked from apache/mxnet

    Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

    C++

  2. cuda-samples-comments cuda-samples-comments Public

    Forked from NVIDIA/cuda-samples

    Add comments in "Samples for CUDA Developers which demonstrates features in CUDA Toolkit".Fork from NVIDIA

    C

  3. ray-comment ray-comment Public

    Forked from ray-project/ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    Python

  4. flash-attention flash-attention Public

    Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python

  5. FasterTransformer FasterTransformer Public

    Forked from NVIDIA/FasterTransformer

    Transformer related optimization, including BERT, GPT

    C++

  6. llm-awq llm-awq Public

    Forked from mit-han-lab/llm-awq

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python