EmbFilter

Official Implementation for paper "Your Embedding Matrix is secretly a Feature Lens for Text Embeddings". This repository introduces a simple, lightweight linear filter designed to refine zero-shot text embeddings.

tips

recommend: python 3.10, torch==2.6.0, mteb==1.4.0, transformers==4.52.3
if fail to load SickrSTS, change path MMathematica/sickr-sts in the mteb package (*/mteb/tasks/STS/en/SickrSTS.py) to mteb/sickr-sts
if fail to load MindSmallReranking, try datasets==2.18 by pip install datasets==2.18

run

run the EmbFilter with python run4qwen_prompteol.py --filter_ratio 2
filter_ratio is the ratio of dims to be saved, e.g., filter_ratio=1 means saving 1/1=100% dims, filter_ratio=2 means saving 1/2=50% dims, and so on.

Reference

This paper has informed us a new design for LLM text embedding training, which stays tuned for the release! If you find this code useful useful for your research, please cite our paper.

@misc{wu2026unembeddingmatrixsecretlyfeature,
      title={Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings}, 
      author={Songhao Wu and Zhongxin Chen and Yuxuan Liu and Heng Cui and Cong Li and Rui Yan},
      year={2026},
      eprint={2606.07502},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2606.07502}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt
run4llama_echo.py		run4llama_echo.py
run4llama_prompteol.py		run4llama_prompteol.py
run4mistral_echo.py		run4mistral_echo.py
run4mistral_prompteol.py		run4mistral_prompteol.py
run4qwen_echo.py		run4qwen_echo.py
run4qwen_prompteol.py		run4qwen_prompteol.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmbFilter

tips

run

Reference

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EmbFilter

tips

run

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages