Skip to content
View lkevinzc's full-sized avatar
🎯
Learning
🎯
Learning

Organizations

@mosecorg

Block or report lkevinzc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. mosecorg/mosec Public

    A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

    Python 834 63

  2. sail-sg/oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

    Python 299 16

  3. sail-sg/understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 733 31

  4. sail-sg/oat-zero Public

    A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

    Python 217 10

  5. sail-sg/dice Public

    Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

    Python 43 3

  6. sail-sg/rosmo Public

    Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

    Python 28