Implementation of bi-encoder Text2Topic architecture describe in Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities
Read the paper & the original repository for details about the algorithm !
pip install text2topiclossor
git clone
python -m pip install .I'm not the author of the original paper, so if you use this library, please cite the original paper :
@inproceedings{wang-etal-2023-text2topic,
title = "{T}ext2{T}opic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities",
author = "Wang, Fengjun and
Beladev, Moran and
Kleinfeld, Ofri and
Frayerman, Elina and
Shachar, Tal and
Fainman, Eran and
Lastmann Assaraf, Karen and
Mizrachi, Sarai and
Wang, Benjamin",
editor = "Wang, Mingxuan and
Zitouni, Imed",
booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track",
month = dec,
year = "2023",
address = "Singapore",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2023.emnlp-industry.10",
doi = "10.18653/v1/2023.emnlp-industry.10",
pages = "93--103",
}GNU General Public License v3.0
