Pinned Loading
-
CyberAgentAILab/annotation-efficient-po
CyberAgentAILab/annotation-efficient-po PublicCode of "Annotation-Efficient Preference Optimization for Language Model Alignment"
-
CyberAgentAILab/regularized-bon
CyberAgentAILab/regularized-bon PublicCode of "Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment" (2024).
Python 7
-
CyberAgentAILab/model-based-mbr
CyberAgentAILab/model-based-mbr PublicCode of "Model-Based Minimum Bayes Risk Decoding for Text Generation" 2024
Jupyter Notebook 3
-
CyberAgentAILab/diverse-mbr
CyberAgentAILab/diverse-mbr PublicCode of "Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding" 2024
Python 2
-
CyberAgentAILab/adaptive-mbr
CyberAgentAILab/adaptive-mbr PublicCode of "Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding" 2024
Python 1
If the problem persists, check the GitHub status page or contact support.