CMDI-Project

Internship project at China Mobile Development Institute

● Preprocessed input text data and separated words into characters and character-pairs.

● Wrote natural language processing algorithms using machine learning models with sci-kit learn module in Python.

● Applied the algorithm to the training data, to analyze and identify fraudulent observations.

● Fit the model to the validation set and made predictions.

● "cn_stopwords.csv" contains Chinese stopwords.

● "main.py" contains the main algorithm.

● "text.csv" contains training observations.

● "target.csv" contains training targets.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
cn_stopwords.csv		cn_stopwords.csv
main.py		main.py
target.csv		target.csv
text.csv		text.csv