Skip to content
@usyd-fsalab

FSA

Popular repositories Loading

  1. fp6_llm fp6_llm Public

    An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

    Cuda 164 14

  2. NeuralNetworkRandomness NeuralNetworkRandomness Public

    Python 12

  3. ReadingList ReadingList Public

    11

  4. FSA FSA Public

    Webpage for FSA

    HTML 1

  5. flash-llm flash-llm Public

    Forked from AlibabaResearch/flash-llm

    Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

    Cuda 1

  6. ConferenceTalk ConferenceTalk Public

    Conference talks given by FSA Lab, University of Sydney

Repositories

Showing 7 of 7 repositories
  • fp6_llm Public

    An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

    usyd-fsalab/fp6_llm’s past year of commit activity
    Cuda 164 Apache-2.0 14 4 0 Updated May 28, 2024
  • blog Public Forked from huggingface/blog

    Public repo for HF blog posts

    usyd-fsalab/blog’s past year of commit activity
    Jupyter Notebook 0 690 0 0 Updated Oct 25, 2023
  • FSA Public

    Webpage for FSA

    usyd-fsalab/FSA’s past year of commit activity
    HTML 1 0 0 0 Updated Oct 3, 2023
  • flash-llm Public Forked from AlibabaResearch/flash-llm

    Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

    usyd-fsalab/flash-llm’s past year of commit activity
    Cuda 1 Apache-2.0 13 0 0 Updated Sep 24, 2023
  • usyd-fsalab/ReadingList’s past year of commit activity
    11 0 0 0 Updated Apr 27, 2022
  • usyd-fsalab/NeuralNetworkRandomness’s past year of commit activity
    Python 12 MIT 0 0 0 Updated Mar 18, 2022
  • ConferenceTalk Public

    Conference talks given by FSA Lab, University of Sydney

    usyd-fsalab/ConferenceTalk’s past year of commit activity
    0 0 0 0 Updated Jul 28, 2021

Top languages

Loading…

Most used topics

Loading…