Spiderman

Spiderman is my NLP thesis project from Rose-Hulman.

For my senior thesis, I explored a new method of emotional classification for text analysis. The corpus was series of tweets compiled over two weeks. In the most basic terms, I used preclassified test data to build a bi-directional graph with emotional clusters of NER terms from the tweets. Each node in the graph is a term of phrase that occurs in a tweet and each edge indicates that a tweet had both terms in nodes in it. Emotional clusters are close groups of nodes that all have a similar emotion classification.

To classify new tweets, a tweet would be parsed into relevant NER terms and then the average of the geodesic distance from each term to each emotional cluster is calculated. The closest geodesic score determines what the emotional classification of the tweet is.

There are a variety of pros and cons with this approach, as well as a multitude of enhancements that would greatly improve its accuracy. I'll upload my thesis paper in PDF one of these days.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
bin		bin
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spiderman

About

Releases

Packages

Languages

bcoble/Spiderman

Folders and files

Latest commit

History

Repository files navigation

Spiderman

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages