https://colab.research.google.com/drive/1u34ME_e1wezCmCRdjTiTHlWg9DZsfheb?usp=sharing
- Nation politics news
- A collection of 4,000 politics and royal news from Nation TV from 2020 May ~ October.
- CheerLung
- Scraped Facebook posts from เชียร์ลุง, which is known to be politically biased.
- Quotes that sound awesome and usually be said by people who want Thailand to remain in stone age.
I created this project (datasets and model) for entertainment and education purposes since the datasets have strong characteristics. Not for promoting conflicts.
Language modeling with transformer in pytorch. The model architecture is taken from this tutorial.
https://pytorch.org/tutorials/beginner/transformer_tutorial.html
Torchtext tutorial
https://github.com/keitakurita/practical-torchtext
Attacut tokenizer