● Preprocessed input text data and separated words into characters and character-pairs.
● Wrote natural language processing algorithms using machine learning models with sci-kit learn module in Python.
● Applied the algorithm to the training data, to analyze and identify fraudulent observations.
● Fit the model to the validation set and made predictions.
● "cn_stopwords.csv" contains Chinese stopwords.
● "main.py" contains the main algorithm.
● "text.csv" contains training observations.
● "target.csv" contains training targets.