KoAlpaca: ํ๊ตญ์ด ๋ช ๋ น์ด๋ฅผ ์ดํดํ๋ ์คํ์์ค ์ธ์ด๋ชจ๋ธ
-
Updated
May 30, 2024 - Jupyter Notebook
KoAlpaca: ํ๊ตญ์ด ๋ช ๋ น์ด๋ฅผ ์ดํดํ๋ ์คํ์์ค ์ธ์ด๋ชจ๋ธ
Python package for Korean natural language processing.
Korean BERT pre-trained cased (KoBERT)
ํ๊ตญ์ด ์์ฐ์ด์ฒ๋ฆฌ๋ฅผ ์ํ ํ์ด์ฌ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์ ๋๋ค. ๋จ์ด ์ถ์ถ/ ํ ํฌ๋์ด์ / ํ์ฌํ๋ณ/ ์ ์ฒ๋ฆฌ์ ๊ธฐ๋ฅ์ ์ ๊ณตํฉ๋๋ค.
A curated list of resources for NLP (Natural Language Processing) for Korean
Pretrained ELECTRA Model for Korean
KoBERT์ CRF๋ก ๋ง๋ ํ๊ตญ์ด ๊ฐ์ฒด๋ช ์ธ์๊ธฐ (BERT+CRF based Named Entity Recognition model for Korean)
๐ค Pretrained BERT model & WordPiece tokenizer trained on Korean Comments ํ๊ตญ์ด ๋๊ธ๋ก ํ๋ฆฌํธ๋ ์ด๋ํ BERT ๋ชจ๋ธ๊ณผ ๋ฐ์ดํฐ์
KSS: Korean String processing Suite
Kiwi(์ง๋ฅํ ํ๊ตญ์ด ํํ์ ๋ถ์๊ธฐ)
Automatic Korean word spacing with Python
Korean HateSpeech Dataset
๋น์ง๋ํ์ต ๋ฐฉ๋ฒ์ผ๋ก ํ๊ตญ์ด ํ ์คํธ์์ ๋จ์ด/ํค์๋๋ฅผ ์๋์ผ๋ก ์ถ์ถํ๋ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์ ๋๋ค
์ธ์ด๋ชจ๋ธ์ ํ์ตํ๊ธฐ ์ํ ๊ณต๊ฐ ํ๊ตญ์ด instruction dataset๋ค์ ๋ชจ์๋์์ต๋๋ค.
Korean Morphological Analyzer by shineware
ํ ์ํ๋ก2์ ๋จธ์ ๋ฌ๋์ผ๋ก ์์ํ๋ ์์ฐ์ด์ฒ๋ฆฌ (๋ก์ง์คํฑํ๊ท๋ถํฐ BERT์ GPT3๊น์ง) ์ค์ต์๋ฃ
Implementing nlp papers relevant to classification with PyTorch, gluonnlp
Add a description, image, and links to the korean-nlp topic page so that developers can more easily learn about it.
To associate your repository with the korean-nlp topic, visit your repo's landing page and select "manage topics."