Author: Nguyen Viet Bac - 22022511
-
My datatrain is over 100 MB therefore I can't push it to my repo. You can use your data or use the data below:
-
Data: https://drive.google.com/file/d/1l8PVWLyaFHERQlyleRPyMzxXch-7h-mZ
The project aims to build a vector space for Vietnamese. From this, we use the models to calculate the similarity of words or sentences.
- You can run the command below to build new models:
python main.py