LaCoMSA: Language-Consistency Multilingual Self-Alignment with Latent Representation Rewarding

Khanh-Tung Tran, Barry O'Sullivan, Hoang D. Nguyen

Accepted to EACL 2026 (Main Conference)

Overview

Large Language Models (LLMs) have achieved impressive performance yet remain inconsistent across languages, often defaulting to high-resource outputs such as English. Existing multilingual alignment methods mitigate these issues through preference optimization but rely on external supervision, such as translation systems or English-biased signal. We propose Multilingual Self-Alignment (MSA), a preference optimization framework that leverages an LLM’s own latent representations as intrinsic supervision signals, rewarding lower-resource language outputs based on their alignment with high-resource (English) counterparts in the ``semantic hub''. We further introduce Language-Consistency MSA (LaCoMSA), which augments MSA with a final-layer language-consistency factor to prevent off-target generation. Integrated with Direct Preference Optimization, LaCoMSA improves a Llama 3 8B-based model multilingual win rates by up to 6.8% absolute (55.0% relatively) on X-AlpacaEval and achieves consistent gains across benchmarks and models. Our findings demonstrate that LaCoMSA can serve as an effective and scalable mechanism, opening a new venue toward multilingual self-alignment.

This repository contains the cleaned implementation for the LaCoMSA paper.

Repository Structure

Alignment/        # DPO training scripts
Preprocess/       # Data & reward generation scripts
Data/
requirement.txt

Quick Start

Environment:

python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

Prepare preference data:

cd Preprocess
bash preprocess.sh # more details available in the bash script

Train with. DPO:

cd Alignment
bash dpo.sh example.json

Citation

TBU

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Alignment		Alignment
Data/ultrafeedback_binarized/subset		Data/ultrafeedback_binarized/subset
Preprocess		Preprocess
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LaCoMSA: Language-Consistency Multilingual Self-Alignment with Latent Representation Rewarding

Overview

Repository Structure

Quick Start

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LaCoMSA: Language-Consistency Multilingual Self-Alignment with Latent Representation Rewarding

Overview

Repository Structure

Quick Start

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages