Skip to content
View lfopensource's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Menlo Park

Block or report lfopensource

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lfopensource/README.md
  • 👋 Hi, I’m Lingling Fan.
  • 🎓 Stanford PhD in EE, previous: Google DeepMind, Meta AI Research.
  • 👀 I’m interested in LLM inference/serving with quantization and parallelism
  • 🌱 I’m currently focused on long-context vide and image undersanding/reasoning.
  • 📫 How to reach me: linglingfan@stanford.edu
  • 😄 Pronouns: She/Her

Pinned Loading

  1. SageAttention Public

    Forked from thu-ml/SageAttention

    Quantized Attention achieves speedup of 2-3x and 3-5x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

    Cuda

  2. gemini-cli Public

    Forked from google-gemini/gemini-cli

    An open-source AI agent that brings the power of Gemini directly into your terminal.

    TypeScript