Meaning Extraction Helper v2: Built from the ground up in C#
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LemmaGen_v3.0_PrebuiltCompact
MEH 2
TwitterAwareTokenizerNET
.gitattributes
.gitignore
LICENSE
MEHv2.sln
README.MD

README.MD

Source code for the Meaning Extraction Helper (MEH) program. https://meh.ryanb.cc

Currently in beta.

This is a complete C# rebuild of the original MEH. A lot of the older features have been removed, and implementation of most features in this version (starting with version 2.0.0) are far more efficient, both in terms of methodology as well as their use of the user's hardware.

As with most things, this software stands on the shoulders of giants. The work of others that makes MEH possible includes (but is probably not limited to):

-LemmaGen v3 Prebuilt/Pretrained Lemmatizers: http://lemmatise.ijs.si/

-NewtonSoft JSON.Net: https://github.com/JamesNK/Newtonsoft.Json

-The NLTK Twitter-aware tokenizer: https://www.nltk.org/_modules/nltk/tokenize/casual.html#TweetTokenizer

My C# port: https://github.com/ryanboyd/nltk.tokenize.casual.NET