Text Classification

Classifying text content for Vietnamese

Resources

Dataset:
Google search results
Word vectors:
https://github.com/Kyubyong/wordvectors

Tasks

* Cleaning the text, splitting it into words and handling punctuation and case.
* Categorizing text data.
* Building the models.
* Model evaluation.
* Building RESTful API
* Building web/app layout.

Check out this link for RESTfull API.

Work flow

Install the packages: pip install -r setup.txt
Download the dataset and extract into ./data

Run python build.py or build.py to build data
You can also run python build-compressed.py to compress your data

Run python train.py or train.py for full training data

Run python predict.py or predict.py to predict the result
You can also change the algorithm in this.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
static		static
templates		templates
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.slugignore		.slugignore
Dockerfile		Dockerfile
Procfile		Procfile
README.md		README.md
X_data_compressed.pkl		X_data_compressed.pkl
build-compressed.py		build-compressed.py
build.py		build.py
main.py		main.py
predict.py		predict.py
requirements.txt		requirements.txt
setup.txt		setup.txt
test.py		test.py
train.py		train.py
y_data_compressed.pkl		y_data_compressed.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification

Resources

Tasks

Work flow

About

Releases

Packages

Languages

fecoderchinh/text-classification

Folders and files

Latest commit

History

Repository files navigation

Text Classification

Resources

Tasks

Work flow

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages