Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
Chapter 10		Chapter 10
Chapter 11		Chapter 11
Chapter 12		Chapter 12
Chapter 14		Chapter 14
Chapter 15		Chapter 15
Chapter 16		Chapter 16
Chapter 2		Chapter 2
Chapter 3		Chapter 3
Chapter 4		Chapter 4
Chapter 5		Chapter 5
Chapter 6		Chapter 6
Chapter 7		Chapter 7
Chapter 8		Chapter 8
data		data
.gitattributes		.gitattributes
README.md		README.md

Repository files navigation

Text Mining

파이썬 텍스트 마이닝 완벽 가이드 를 기반으로 공부합니다.

기본 베이스 코드 : https://github.com/wikibook/textmining
사용하는 기본 라이브러리
- KoNLPy
- NLTK
- scikit-learn
- Tensorflow
- Keras
- PyTorch
- Transformers
Chapter 설명
- Chapter 2 : 텍스트 전처리 (Text Preprocessing)
- Chapter 3 : 그래프 & 워드 클라우드 (Graph & WordCloud)
- Chapter 4 : 카운트 기반의 문서 표현 (Count based Document Representation)
- Chapter 5 : BoW 기반의 문서 분류 (Bag of Word based Document Classification)
- Chapter 6 : 차원 축소 (Dimensionality Reduction)
- Chapter 7 : 토픽 모델링 (Topic Modeling)
- Chapter 8 : 감성 분석 (Sentiment Analysis)
- Chapter 10 : 딥러닝을 이용한 문서 분류 (Document Classification using Deep Learning)
- Chapter 11 : Word2Vec, ELMo, Doc2Vec의 이해 (Understanding Word2Vec, ELMo, Doc2Vec)
- Chapter 12 : CNN을 응용한 문서 분류 (Document Classification using CNN)
- Chapter 13 : 어텐션과 트랜스포머 (Attention & Transfomers)
  - 다른 rapository에 설명(현 Chapter부터 설명은 여기에 추가)
- Chapter 14 : BERT (Bidirectional Encoder Representations from Transformers) 의 이해와 간단한 활용
- Chapter 15 : BERT 사전학습(pre-trained) 모형에 대한 미세조정(Fine-tuning) 학습
- Chapter 16 : 한국어 문서에 대한 BERT 황용

About

Studying Text mining

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%