Thesis

Code snippets used in experiments for thesis on comparing FCA and LDA for short-text classification.

Includes data pre-processing:

Data cleanup, case conversion, removing stop words, punctutation and words in just one doc.
Splitting between train and test sets
Removal of class labels
TDM generation

FCA:

LDA:

Modelling:

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
DataEngineering		DataEngineering
fca		fca
lda		lda
modelling		modelling
README.md		README.md
plot_scores.py		plot_scores.py

Provide feedback