Introducing slangID

In a nutshell: The slangID project tries to detect slang phrases. Something literally no one asked for...

slangID consists of two programs:

slangID_demo.py lets you train a selection of classifiers, and prints out a test set of phrases with their predicted types (slang or normal).
slangID_predict.py lets you also train a selection of classifiers and predict the type of your input.

All the models are pre-trained.

Challenges

Due to a lack of data, the results, regardless of the classifier used, are not good enough right now. Certain bigram slang words like (a) real one are more difficult to resolve since the provided models do not take n-grams into consideration.

How to run slangID_demo and slangID_predict

Install Python 3.9 (3.8 and 3.10 is probably fine too, I used 3.9.12).
Install the required packages by running pip install -r requirements.txt in your shell of choice. Make sure you are in the project directory.
And then run python slangID_demo.py or python slangID_predict.py.
Follow the displayed instructions.

What you will be greeted with when you run slangID_demo

What you will be greeted with when you run slangID_predict

Source of the data

Most of the phrases come from archive.org's Twitter Stream of June 6th, some come from me personally.

Recognition of Open Source use

scikit-learn
pandas

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
classifiers		classifiers
misc		misc
LICENSE		LICENSE
README.md		README.md
ignore_predict.py		ignore_predict.py
requirements.txt		requirements.txt
slangID_demo.py		slangID_demo.py
slangID_predict.py		slangID_predict.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

classifiers

classifiers

misc

misc

LICENSE

LICENSE

README.md

README.md

ignore_predict.py

ignore_predict.py

requirements.txt

requirements.txt

slangID_demo.py

slangID_demo.py

slangID_predict.py

slangID_predict.py

Repository files navigation

Introducing slangID

Challenges

How to run slangID_demo and slangID_predict

What you will be greeted with when you run slangID_demo

What you will be greeted with when you run slangID_predict

Source of the data

Recognition of Open Source use

About

Releases

Packages

Languages

License

m4cit/slangID

Folders and files

Latest commit

History

Repository files navigation

Introducing slangID

Challenges

How to run slangID_demo and slangID_predict

What you will be greeted with when you run slangID_demo

What you will be greeted with when you run slangID_predict

Source of the data

Recognition of Open Source use

About

Topics

Resources

License

Stars

Watchers

Forks

Languages