What is slangID

In a nutshell: The slangID project tries to detect slang phrases. Something literally no one asked for...

You can train a selection of classifiers, and print out a test set of phrases with the DEMO button. Or you can type a phrase and see what type it is identified as. All the models are pre-trained, but you can re-train if needed.

Challenges

Due to a lack of data, the results, regardless of the classifier used, are not impressive right now. Unknown words are also an issue since the dataset is tiny. Slang phrases with normal words like 'sick' are not accounted for with a sentiment analysis either.

Performance

In total, there are five classifiers you can choose from:

Linear SVM (SVC with linear Kernel)
Decision Tree
Gaussian Naive Bayes
Multinomial Naive Bayes
Logistic Regression

Currently the best performer is the Gaussian Naive Bayes model with an F₁ score of ~65.70%

How to run slangID2

Install Python 3.9 (3.8 and 3.10 is probably fine too, I used 3.9.12).
Install the required packages by running pip install -r requirements.txt in your shell of choice. Make sure you are in the project directory.
And then run python slangID2_Windows.py or python3 slangID2_Linux.py (the difference between both versions is just the font size on some labels and buttons).

Note: It might take a while to load. Be patient.

Screenshot

Source of the data

Most of the phrases come from archive.org's Twitter Stream of June 6th.

Recognition of Open Source use

scikit-learn
pandas

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
classifiers		classifiers
misc		misc
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
slangID2_Linux.py		slangID2_Linux.py
slangID2_Windows.py		slangID2_Windows.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is slangID

Challenges

Performance

How to run slangID2

Screenshot

Source of the data

Recognition of Open Source use

About

Releases

Packages

Languages

License

m4cit/slangID2

Folders and files

Latest commit

History

Repository files navigation

What is slangID

Challenges

Performance

How to run slangID2

Screenshot

Source of the data

Recognition of Open Source use

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages